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Preface 


The Algorithms department of Communications of the 
ACM (CACM) was established in February 1960, with J. 
H. Wegstein as editor, for the purpose of publishing algo- 
rithms, consisting of procedures and programs, in the Algol 
language. In 1975 the publication of ACM algorithms 
material was transferred to ACM Transactions on Mathe- 
matical Software (TOMS). A wide variety of algorithms 
have been published and many of them have been used 
heavily—either in original form or as translated into other 
languages. Recognizing the general acceptance of the al- 
gorithm material published in CACM and TOMS, the 
Association for Computing Machinery (ACM) has collected 
and reprinted the algorithms to make them more readily 
accessible and more serviceable to a larger group of users. 

This collection contains Algorithms 221-492; these ap- 
peared in the Algorithms department of CACM from 1964— 
1974. 

Algorithms 221-492 were originally published as re- 
ceived—without any refereeing whatever. Many of these 
have since been certified and/or corrected by their authors 
or by other contributors. 

To facilitate the updating and to make this volume 
convenient to use, an understanding of the page numbering 
scheme for the algorithms is helpful. The page designation 
is in a three-part format: the left part is the algorithm 


number; the middle part is the page number within the 
algorithm (the first page of each algorithm is P1); and the 
right part is the number of the revision of that page. All 
sheets in the original, or first, insertion of an algorithm have 
“0” for the right part. The first revision of a page will have 


_a page number having the left and middle parts identical 


with those on the page to be replaced, but the right part 
will be “R1” instead of “0.” The second revision of the 
same page would read R2, and so on. For example, 123-P2- 
R1 would mean the first revision of page 2 of Algorithm 
123. 

Information on submitting algorithms for publication 
may be found in the introductory section located in the 
front of the current loose-leaf collection. Included in this 
material is a cumulative index to all the algorithms pub- 


_ lished since 1960 as well as the ACM Algorithms Policy, 


which guides the publication of all algorithms submitted to 
ACM. 


Webb Miller 

ACM Algorithms Editor 

Department of Mathematics 

University of California, Santa Barbara 
Santa Barbara, CA 93106 
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ALGORITHM 221 
GAMMA FUNCTION 
Water Gautscur (Recd 10 Aug. 63) 


Oak Ridge National Laboratory,* Oak Ridge, Tenn. 
* Now at Purdue University, Lafayette, Ind. 


real procedure gamma (z); value z;_ real z; 

comment This is an auxiliary procedure which evaluates a P(e) 
for0 < z S$ 3to 10 significant digits. It is based on a polynomial 
approximation given in H. Werner and R. Collinge, Math. 
Comput. 15 (1961), 195-197. This procedure must be replaced 
by a more accurate one if more than 10 significant digits are de- 
sired in Algorithm 222 below. Approximations to the gamma 
function, accurate up to 18 significant digits, raay be found in 
the paper quoted above; 


begin 
integer k; real p,t; array A[0:10]; 
A[0] := 1.0; A[l] := .4227843370; A[2] := .4118402518; 
A[3] := .0815782188; A[4] := .0742379076; 
A[5] := — .0002109075; A[6] := .0109736958; 
A[7] := — .0024667480; A[8] := .0015397681; 
A[9] := —.0003442342; A[10] := .0000677106; 
t:= if 2 S$ 1 then zelseif z S 2 then z—1 else z—2; 
= A[10}; 
for k := 9 step —1 until 0 do p:= 1X p+ Alf[k]; 


gamma := if 2 S 1 then p/(z & (z +1)) else if z S 2 then 
p/z else p 
end gamma 


CERTIFICATION OF ALGORITHM 221 [814] 
GAMMA FUNCTION [Walter Gautschi, Comm. ACM 7 

(Mar. 1964), 143] 

Van K. McComss (Recd. 10 Apr. 1964 and 1 Jun. 1964) 
General Electric Co., Huntsville, Ala. 

The algorithm was translated into Fortran IV for the IBM 
7094. Computations were performed in double precision to take 
advantage of the ten significant digits given by ‘the polynomial 
coefficients. The function I'(z) was evaluated for the range 0 < z S$ 
10 with an increment of 0.1, and the results were checked with the 
values published in Table of the Gamma Function for Complex 
Arguments, NBS Applied Mathematics Series 34 (1954). The algo- 
rithm gave ten-digit accuracy for the range indicated. 


REMARKS ON: 
ALGORITHM 34 [S14] 
GAMMA FUNCTION 

[M. F. Lipp, Comm. ACM 4 (Feb. 1961), 106] 
ALGORITHM 54 [S14] 
GAMMA FUNCTION FOR RANGE 1 TO 2 

(John R. Herndon, Comm. ACM 4 pr 1961), 180] 
ALGORITHM 80 [814] 
RECIPROCAL GAMMA FUNCTION OF REAL 
ARGUMENT 
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[William Holsten, Comm. ACM 5 (Mar. 1962), 166] 
ALGORITHM 221 [S14] 
GAMMA FUNCTION 
[Walter Gautschi, Comm. ACM 7 (Mar. 1964), 143] 
ALGORITHM 291 [S14] 
LOGARITHM OF GAMMA FUNCTION 
[M. C. Pike and I. D. Hill, Comm. ACM 9 (Sept. 1966), 
684] 
M. C. Pix anp I. D. Hitt (Recd. 12 Jan. 1966) 
Medical Research Council’s Statistical Research Unit, 
University College Hospital Medical School, 
London, England 


Algorithms 34 and 54 both use the same Hastings approxima- 
tion, accurate to about 7 decimal places. Of these two, Algorithm 
54 is to be preferred on grounds of speed. 


Algorithm 80 has the following errors: 
(1) RGAM should be in the parameter list of RGR. 
(2) The lines 

if x = 0 then begin RGR := 0; go to EXIT end 
and 

if z = 1 then begin RGR := 1; goto EXIT end 
should each be followed either by a semicolon or preferably by an 
else. 
(8) The lines 

if ¢ = 1 then begin RGR := 1/y; 
and 

if z < — 1 then begin y := y X xz; goto CC end 
should each be followed by a semicolon. 
(4). The lines 

BB: if x = —1 then begin RGR := 0; goto EXIT end 
and 

if xz > —1 then begin RGR := RGAM (2); 


go to EXIT end 


go to EXIT end 


should be separated either by else or by a semicolon and this 
second line needs terminating with a semicolon. 


(5) The declarations of integer 7 and real array B[0:13] in RGAM 
are in the wrong place; they should come immediately after 
begin real z; 


With these modifications (and the replacement of the array B 
in RGAM by the obvious nested multiplication) Algorithm 80 ran 
successfully on the ICT Atlas computer with the ICT Atlas 
ALGOL compiler and gave answers correct to 10 significant digits. 


Algorithms 80, 221 and 291 all work to an accuracy of about 10 
decimal places and to evaluate the gamma function it is therefore 
on grounds of speed that a choice should be made between them. 
Algorithms 80 and 221 take virtually the same amount of comput- 
ing time, being twice as fast as 291 at x = 1, but this advantage 
decreases steadily with increasing x so that at z = 7 the speeds are 
about equal and then from this point on 291 is faster—taking only 
about a third of the time at x = 25 and about a tenth of the time 
at x = 78. These timings include taking the exponential of log- 
gamma. 

For many applications a ratio of gamma functions is required 
(e.g. binomial coefficients, incomplete beta function ratio) and the 
use of algorithm 291 allows such a ratio to be calculated for much 
larger arguments without overflow difficulties. 
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ALGORITHM 222 
INCOMPLETE BETA FUNCTION RATIOS 
WALTER Gautscui (Reed 10 Aug. 63) 


Oak Ridge National Laboratory,* Oak Ridge, Tenn. 
* Now at Purdue University, Lafayette Ind. 


comment Let Bz(p, q) = Jj v7! (1 — t)t 1 dt (p > 0, q > 0, 
0 S x S 1) denote the incomplete beta function. The objective 
of this algorithm is to evaluate a sequence of ratios I,(p, q) = 
B.(p, q)/B:(p, q), a8 one of the parameters p, q varies in steps of 
unity while the other remains fixed. The procedure incomplete beta 
q fized evaluates I,(p + n, q) for n = 0,1, --- , nmaz, assum- 
ing 0 < p S 1, q > O, whereas the procedure incomplete 
beta p fixed evaluates I;(p, qg + n) for n == 0,1, -+: , nmaz, as- 
suming 0 < q S$ 1, p > 0. The number d of significant digits 
desired can be specified, but is only guaranteed when x S 3. 
When x > 3, the complements 1 — J, will be accurate to d sig- 
nificant digits. In the region 0 < p S$ 1,0 < q S 2, I,(p, q) is 
calculated from a power series expansion. The sequences f(n) 
= I,(p +n, q) and g(n) = I,(p,q +n), including initial values, 
are generated recursively by means of the recurrence relations 
fut) -A+ n+p4+aq-12e/nt+pn)fm+(nm+pt+¢ 
—1)a/n+ p))fm-)=0gm+1)-A+m+p+¢q-1)- 
(1 ~ z)/(n+q)) g(r) + (n+ p+ q- 1) (1 —2)/(n + Q)) 
-g(n ~ 1) = O. Since the former is mildly unstable, a variant 
of the backward recurrence algorithm of J. C. P. Miller is ap- 
plied to it. A global real procedure gamma (z} must be avail- 
able (see Algorithm 221); 

real procedure [subz p and q small (2, p, q, d); 
value x, p, q, d; 
integer d; real 2%, p, q; 
comment This procedure evaluates /,(p, gq) to d significant 
digits when 0 < p S'1 and 0<qS 2. It first calculates 
B.(p, q) by a series expansion in powers of x, and then divides 
the result by Bi(p,q) = T(p)M(q)/"(p + q), using the real 
procedure gamma; 
begin integer k; real epsilon, u, v, 8; 
epsilon := 6 X 10T (—d); 
ui=a2tp; sit=u/p; k:=0; 
LO: uw := (k-q+l1) X (k+p) X x X u/(k+1); 
vis uf(ktptl); s:=sto; ki=k+1; 
if abs(v)/s > epsilon then go to LO; 
Isubx p and q small := s X gamma(p+q)/(gamma(p) X 
gamma (q)) 
end Isubx p and q small; 
procedure forward (x, p,q, 10, I1, nmaz, I); 
value z, p, q, 10, I, nmaz; 
integer nmax; real x, p,q, 10,11; array J; 
comment Given J0 = I,(p, q), [1 = I:(p, q+1), this procedure 
generates I,(p, g+n) for n = 0, 1, 2,--- , nmaz, and stores the 
results in the array J; 
begin integer n; 
I[0] := 10; if nmax > 0 then J[{1] := 11; 
for n := 1 step 1 until nmaz — 1 do 
I[n+1] := (1+(n+p+q—1) X (—2)/(n+q)) X Ln] 
— (nt+p+q-1) X (1-2) X I[n—1]/(n+q) 
end forward; 
procedure backward (x, p, q, 10, nmaz, d, I); 
value x, p, q, 10, nmaz, d; 


integer nmaz,d; real xz, p,q, 10; array I; 
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comment Given JU = I,(p, q), this procedure generates 
I.(p+n,q) forn = 0,1, 2, --: ,nmaz to d significant digits, 
using 2 variant of J. C. P. Miller’s backward recurrence al- 
gorithm. The results are stored in the array [; 

begin 
integer n, nu, m; real epsilon, r; array lapproz, 
Rr [0:nmazx}; 


I{(0)} := 10; if nmaz > 0 then 
begin 
epsilon := .5 X 10T (—d); 
for n := 1 step 1 until nmaz do Iapproz[n] : = 0; 


nui=2xX* nmax + 5; 
Dl: ni= nu; ri=0; 
L2: ri= (ntpt+q-1) X 2/(nt+pt+(nt+pt+q-l) X « 


— (n+p) Xr); 

if n S nmaz then Rr[n—1]:= 7; n:= n—-1; 
if n 2 1 then go to £2; 

for n := 0 step 1 until nmax — 1 do 


I[n+1] := Rr{n] X I[n]; 


for n := 1 step 1 until nmazx do 
if abs (I[n] — Iapproz[n])/I[n]) > epsilon then 
begin 
for m:=1step1 until nmaz do lapprox{m|:= I|m); 
nui= nu+5; goto Ll 
end 


end 
end backward; 
procedure [subz qfixed(z, p,q, nmax,d,1); 
integer nmaz,d; realx,p,q; array I; 
comment This procedure generates I,(p+n,q),0 < p S 1, for 
n=0,1,--- , nmazx to d significant digits, using the procedure 
backward. In order to calculate the initial value J0=/,(p,q), it 
first reduces gq modulo 1 to gq, where 0 < gq XS 1, then obtains 
Tz(p,qo) and Iz(p,qgo+1) by the real procedure Isubz p and q small, 
and finally uses these as initial values for the procedure for- 
ward, which connects with /,(p,q) by the recurrence in q; 
begin integer m, mmaz; reals, g0,[q0,Igl; 
m:= entier(q); Ss i:='¢ — m; 
gO := if s > 0 then s else s + 1; 
mmax := if s > 0 then m else m — 1; 
Iq) := Isubz p and q small(x, p, q0, a); 
if mmaz > 0 then [ql := Isubz p and q small(z, p, gO+1, d); 
begin array [q[0:mmaz]; 
forward (x, p, q0, [q0, Iq1, mmazx, Iq); 
backward (zx, p, q, [q{mmaz], nmaz, d, I) 
end 
end [subz q fixed; 
procedure [subz p fixed(z, p,q, nmaz,d,1); 
integer nmaz,d; realz,p,q; array I; 
comment This procedure generates I;(p, q+n),0°< q S 1, for 
n=0,1,--- , nmaz to d significant digits, using the procedure 
forward. The initial values JO=I/,(p,q), [1=I:(p,q+1) are ob- 
tained by twice applying the procedure backward. The initial 
values for the latter are provided by the real procedure Isubx p 
and q small; 
begin integer m, mmaz; 


value z, p,q, nmaz, d; 


valuex, p,q, nmaz,d; 


real s, p0, J0, I1, I@0, Iq1; 


m:=entter(p); s:=p—™m; 
p0.:= ifs > O then s else s + 1; 
mmax := if s > 0 then m else m — 1; 


10 := Isubx p and q small(x, p0, q, d); 
I := Isubz p and q small(x, p0, q+1, d); 
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begin array [p(0:mmaz); 
backward (x, p0, q, 10, mmaz, d, Ip); Iq) := I'p[mmaz]; 
backward (x, p0, g+1, 11, mmaz, d, Ip); Iql := Ip[mmaz] 

end; 
forward(x, p, g, I[q0, Iqg1, nmaz, I) 

end Isubx p fixed; 

procedure incomplete beia q fixed(z, p, q, nmaz, d, I); 
value x, p, g, nmaz, d; 
integer nmaz, d; real x, p, q; array J; 

comment This procedure obtains the final results [,(p-+n,q), 
0 < p S 1, n=0, 1,--: , nmaz, directly from the procedure 
Isubx q fixed, if « S 4, or via the relation J,(p+n,q) = 
1 ~— I,_2(q,p-t+n) and the procedure [subz p fixed, if x > 4. The 
indicated substitution in the case z > 4 is made to ensure fast 
convergence of both the power series used in the real procedure 
Isubzx p and q small, and the backward recurrence algorithm used 
in the procedure backward. If the parameters x, p, 7, nmax are 
not in the intended range, control is transferred to a nonlocal 
label called alarm; 

begin integer n; 
ifz<0Ve >1Vp SOV p > 1V¢q 8S OV nmaz < 0 then go to 
alarm; 


ifz=0\V/xz=1 then forn : = Ostep 1 until nmaz do 1[n]: = z else 
begin . 

ifz S 5 then [subs q fixed(x, p, q, nmaz, d, I) else 

begin 

Isubx p fixed --x,q,p, nmax,d, I); 

for n = 0 step 1 until nmaz do J{n] := 1 — I[n] 

end 
end 


end incomplete beta q fixed; 

procedure incomplete beta p fixed(x, p, q, nmaz,d, I); 
valuexz,p,q,nmaz,d; integer nmaz,d; realz,p,q; array I; 

comment This procedure, the exact analogue to the procedure 
incomplete beta q fixed, generates the final results J,(p,q+n), 
0<qs81,n=0,1,--- , nmaz. For the setup of the procedure, 
see the comment in incomplete beta q fired; 

begin integer n; 
ifz<O0Vrx>1VqsS0Vq>1Vp SOV nmar < 0 then go to 
alarm; 


ifc=0Vxr=1 then for n:= Ostep1 until nmaz do I[n]:= z else 
begin 
ifx S 5 then Jsubz p fired(x, p, q, nmaz, d, I) else 
hegin 
Isubx q fixed(1—x, q, p, nmaz,d, 1); 
for n := Ostep 1 until nmaz do /[n] := 1 — I[n] 
end 
end 


end incomplete beta p fixed 

REFERENCE: WALTER Gavtscui, Recursive computation of 
special functions. U. of Michigan, Eng. Summer Conf., Numerical 
Analysis, 1963. 


CERTIFICATION OF ALGORITHM 222 
INCOMPLETE BETA FUNCTION RATIOS 
Gautschi, Comm. ACM 7 (March 1964), 143] 

WaLTeR Gattscu! (Recd 2 Jan. 1964) 

Purdue Univ., Lafayette, Ind. 

begin integer n; array I1, 2, [3(0: 10]; 

comment This program calls the procedures Incomplete beta q 
fixed and Incomplete beta p fixed to calculate test values of 
Tig(.5+n, 7), [4(5, 1+n), I(5, 14+) for n= 0(1)10 to 6 
significant digits. The following results were obtained on the 
CDC 1604-A computer, using the Oak Ridge ALGoL compiler: 


[Walter 
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n Iig(.5 + 2, 7) Ta(5, 4 +n) I3(5,1+ 7) 
0 . 99143646185 . 010239999997 . 32768000004 
1 .93951533330 .040959999972 .65536000000 
2 .83567307612 .096255999927 . 85196799999 
3 .69444760641 . 17367039987 . 94371839999 
4 . 54111709640 . 26656767980 . 98041856000 
5 . 39800862042 . 36689674211 . 99363061758 
6 . 27831789503 .46722580441 . 99803463679 
7 . 18624810627 . 56182177742 . 99941875711 
8 . 11995785836 64695815314 . 99983399319 
9 .074724512738 . 72074301208 - 99995395031 
10 . 045203802963 . 78272229360 . 99998753828 


All results are in agreement with those tabulated in [1]; 
Incomplete beta q fixed (.4, .5, 7, 10, 6, 71); 
Incomplete beta p fixed (.4, 5, 1, 10, 6, 12); 
Incomplete beta p fixed (.8, 5, 1, 10, 6, J3); 
for n :& 0 step 1 until 10 do 
write ( I1[n], [2{n}, [3{n]) 
end Driver incomplete beta function ratios 
In the original publication of the algorithm, the following cor- 
rection of a printer’s error is needed in the real procedure Isubz 
p and q small, The statement labelled ZO should read as follows: 
wi=e(kK-q+1)X 2X uf/(k+)); 
{1] Pearson, K. Tables of the Incomplete Beta-Function. Cam- 
bridge University Press, London, 1934. 


COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 223 

PRIME TWINS 

M. Surmrat (Reed 7 June 1963; in final forra 2 Jan. 1964) 
University of Alberta, Calgary, Alberta, Canada 


procedure Prime Twins (t, Twinl, Twin2, Storage, Act): 
value Storage; integer t, Twinl, Twin2, Storage; 
procedure Act; 
comment This procedure will generate successive ‘‘prime 
twins,’’ i.e. pairs of primes T'winl, Twin2 which differ by 2. 
Storage is the maximum number of primes that can be stored. 
Act is any procedure for recording, examining, or utilizing each 
pair of twins as it is generated. ¢ is a serial number for the 
twins. P[Storage] f 2 is the last number examined; 
begin integer array P{l: Storage]; integer j, m, previous, 
current; 
comment P[j] is the jth prime; 
Pil] := 2; P[2]):= 3; j := 2; previous := 3; t:= 0; 
for current := 5 step 2 until P[j] x P[j] do 
begin m:=1; for m :=m +1 while P[m] xX P[m] S cur- 
rent do 
if current = (current + P(m]) X P[m] then go to NoPrime; 
comment If this point is reached, current is not divisible by 
any prime up to sqrt(current) and so is a prime. We now 
record the new prime, if storage permits, then check if it 
is the second of twins; 
if 7 < Storage then 
begin j :=j+1; P[j] := current 


end; 

if current = previous + 2 then 
begin ¢ := f-+1; Twinl := previous; Twin2 := current; 

Act (t, Twinl, Twin2) 

end; 

previous := current; 

NoPrime: 

end; 


end procedure Prime Twins 
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ALGORITHM 224 
KVALUATION OF DETERMINANT 
Leo J. ROTENBERG 
(Reed 7 Oct. 1963; in final form 20 Dec. 1963) 
Box 2400, 362 Memorial Dr., Cambridge, Mass. 


real procedure determinant (a, n); 
value n; real array a; integer 7; 
comment This procedure evaluates a determinant by triangulari- 
zation. The matrix supplied by the calling procedure is modified 
by this program. This procedure is.an extensive revision and 
correction of Algorithm 41; 
begin real product, factor, temp, div, piv, abpiv, maxpiv; 
integer ssign,1,J, 7, imax; 
ssign := 1; product := 1.0; 
for r := 1 step 1 until n—1 do © 
begin mazpiv := 0.0; 
for 1 := r step 1 until n do 
begin piv := alz, r]; 
abpiv := abs (piv); 
if abpiv > maxpiv then 


begin mazxpiv := abpiv; 
div := pty; 
imax := 4 
end 
end; 


if maxpiv ~ 0.0 then 
begin if imaz = i then go to reswme else 


hegin for j := r step 1 until n do 
begin temp := alimaz, J]; 
altmaz, j| := alr, j]s 
alr, j] := temp 
end; 
ssign := — ssign; 
go to resume 
end 
end; 
determinant := 0.0; 
go to return; 
resume: for 1 := r+1 step 1 until n do 


begin factor := alt, r)/div; 
for j := r+1 step 1 until n do 
alz, 7] := alt, 7] — factor X alr, 7] 
end 
end; 
for 7 := 1 step 1 until 2 do 
product := product X alt, 7]; 
comment Exponent overflow or underflow will most likely 
occur here if at all. For large or small determinants the user 
is cautioned to replace this with a call to a machine-language 
product routine which will handle extremely large or small 
real numbers; 
determinant := ssign X product; 
return: 
end 


~CACM 
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CERTIFICATION OF ALGORITHM 224 [F3] 
EVALUATION OF DETERMINANT 

[Leo J. Rotenberg, Comm. ACM 7 (Apr. 1964), 243] 
Vic HASSELBLAD AND JEFF RuLiFson (Recd. 17 July 1964) 
Computer Center, U. of Washington, Seattle, Wash. 

The ‘‘ivaluation of Determinant’’ program was tested on an 
ALGot 60 compiler for an IBM 709 (SHARE distribution * 3032). 
When the 10th line on page 244 was changed to read: 

begin if imax = r then go to resume else 


correct results were obtained. It was tested up through 4 Xx 4 
matrices. 
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ALGORITHM 225 

GAMMA FUNCTION WITH CONTROLLED 
ACCURACY 

S. J. Cyvin anp B. N. Cyvin (Reed. 25 Oct. 1963) 
Technical University of Norway, Trondheim, Norway 


real procedure GAMMA (m, x); value m, x; integer ™; 
real x; 

comment I'(z) is calculated with at least m significant figures 
(disregarding the machine’s roundoff). The range of x is reduced 
by recursion to 5 S x S 6, for which I'(z) is found (with m—2 
significant decimals) according to 


T ive) 
T(z) = / te-le-t dt + ( tele" at, 
0 T 


Simpson’s formula is applied to the former integral, which is 
divided into 2n parts. Here n, as well as T, are chosen auto- 
matically to give a result with the required accuracy. For x 
near zero or a negative integer, (x) is put equal to a large value, 
10°. The procedure is slower than other algorithms for T(z) 
[see Nos. 31, 34, 54, 80], but has the advantage of controlled 
accuracy; 
begin integer 7,n,f,T; real y,h,S; 
h:=1; yt=a2; 
A: if abs(y) < 10-60 then begin GAMMA := 1060; 
F end else 
ify > 6 then beginy := y—1; h:=hxXy; 
ify < 5then beginh :=h/y; y:= y+; 
begin real a; 
TP $= 205 
U:. if (7175 + 4KTT4 + 16X7TT3 + 48K7TT2 + 96XT + 
96) X exp(—T)>.25K10 7 (2—m) then begin T := 7'+5; 
go to U end; 
n i= 1 + entter(sqrt(sqrt(T T 5X10 7 (m—2)/30))); 
S:=0; f:=4; 
for 7 := 1 step 1 until 2X7n do 
begin 
a:= 6XixT/n; S:= 8 + fxat y—l)Xexp(—a); 
f := ift = 2Xn-1 then 1 else if f = 4 then 2 else 4 
end 
end; 
GAMMA := (SXT/(6Xn) + (5X7T754+3X7T7T44+12xTT3 
+ 36X72 + 72XT + 72)Xexp(—T))Xh; 


go to 


go to A end else 
go to A end else 


EB: 
end of GAMMA 


CERTIFICATION OF ALGORITHM 225 [814] 

JAMMA FUNCTION WITH CONTROLLED AC- 
CURACY [S. J. Cyvin and B. N. Cyvin, Comm. ACM 
7 (May 1964), 295] 

TT. A. Bray (Reed. 25 May 1964 and 18 Jun. 1964) 

Boeing Scientific Research Laboratories, Seattle, Wash. 
Algorithm 225 was coded in Fortran II and run on the IBM 


1620. No corrections were necessary and the following results were 
obtained for m = 2: 
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x GAMMA (m, x) x GAMMA (m, x) 

.O1 99 .44362100 3.50 3.32349920 

.05 19 .47214000 4.00 6 .00067550 

.10 9.51444650 4.50 11 .63224700 

.50 1.77253280 5.00 24 00270200 
1.00 1.00011250 5.50 52 .34511500 
1.50 .88626644 10.00 0 3628697410 6 
2.00 1.00011250 25.00 0.62043066,» 24 
2.50 1.32939960 50.00 0.6082643410 63 
3.00 2.00022510 


These results are correct to at least two significant digits. The 
following results and times were obtained for z = 0.5: 


m GAMMA (m, x) TIME {om 
2 1.77253280 58 
3 1.77254230 105 
4 1.77245370 200 
5 1.77244430 405 
6 1.77244020 885 


The correct result is 1.7724539. Note that the accuracy decreased 
as m increased and the result for m = 6 is incorrect in the sixth 
significant digit. 

This algorithm is extremely slow as compared to some others 
available. Algorithm 31 was used for the above set of arguments 
and gave seven-digit accuracy in 250 milliseconds per argument. 
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ALGORITHM 226 

NORMAL DISTRIBUTION FUNCTION 

S. J. Cyvin (Recd. 15 Oct. 1963) 

Technical University of Norway, Trondheim, Norway 


real procedure Fi(m,c); value m,z; integer m; real 2; 
comment ®&(7) = (1/V/2m) S06 exp(—4u?) du is found by com- 
puting [% exp(—4}u?) du with aid of Simpson’s formula. The 
latter integral is divided into 2n parts, where n automatically 
is adjusted to give a result with at least m significant decimals 
(disregarding the machine’s roundoff). The error function is 
obtainable as erf (x) =" 2(x/./2) — 1. The practical use of the 
present method is not restricted to small or large ranges of x. 
Probably the method has some advantages compares to Algo- 
rithms 123, 180, and 209; 
begin integer i,n,f; real b,S; 
b := abs(z); 
n:= 1+ entier(sqrt(sqrt(b T 5X10 T m/ 
(480 X sqrt(2<3.14159265))))); 
ifn <4thenn:=4; S:=1; f: 
for 7 := 1 step 1 until 2Xn do 
begin 
S:=S+fX exp(—(ixb/n) f 2/8); 
f:= ift = 2Xn—1 then 1 else if f=4 then 2 else 4 
end; 
Fi := 5+ sign(x)XSXb/(6XnX sqrt (2X3.14159265)) 
end Fi 


REMARKS ON: 

ALGORITHM 128 [S15] 

REAL ERROR FUNCTION, ERF(z) 
[Martin Crawford and Robert Techo Comm. ACM 5 
(Sept. 1962), 483] . 


ALGORITHM 180 [S15] 
ERROR FUNCTION—LARGE X 
[Henry C. Thacher Jr. Comm. ACM 6 (June 1963), 314] 


ALGORITHM 181 [815]. 
COMPLEMENTARY ERROR FUNCTION— . 
LARGE X 

[Henry C. Thacher Jr. Comm. ACM 6 (uns pee 315] 


ALGORITHM 209 [S15] 
GAUSS 
[D. Ibbetson. Comm. ACM 6 (Oct. 1963), 616] 


ALGORITHM 226 [815] 
NORMAL DISTRIBUTION FUNCTION 
[S. J. Cyvin. Comm. ACM 7 (May 1964), 295] 


ALGORITHM 272 [815] 
PROCEDURE FOR THE NORMAL DISTRIBUTION 
FUNCTIONS 

[M. D. MacLaren. Comm. ACM 8 (Dec. 1965), 789] 
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ALGORITHM 304 [815] 

NORMAL CURVE INTEGRAL 
(I. D. Hill and 8. A. Joyce. Comm. ACM 10 (June 
1967), 374] 


I. D. Hitt anp S. A. Joyce (Recd. 21 Nov. 1966) 

Medical Research Council, 

Statistical Research Unit, 115 Gower Street, 
W.C.1., England 


London 


These algorithms were tested on the ICT Atlas computer using 
the Atlas ALGOL compiler. The following amendments were made 
and results found: 


ALGORITHM 123 

(i) value x; was inserted. 

(ii) abs(T) < »:—-10 was changed to Y—T= Y 
both these amendments being as suggested in [1]. 

(iii) The labels 1 and 2 were changed to Ll and L2, the go to 
statements being similarly amended. 

(iv) The constant was lengthened to 1.12837916710. 

(v) The extra statement x := 0.707106781187 X x was made 
the first statement of the algorithm, so as to derive the 
normal integral instead of the error function. _ 

The results were accurate to 10 decimal places at all points 
tested except x = 1.0 where only 2 decimal accuracy was found, as 
noted in [2]. There seems to be no simple way of overcoming the 
difficulty [3], and any search for a method of doing so would 
hardly be worthwhile, as the algorithm is slower than Algorithm 
304 without being any more accurate. 


ALGORITHM 180 
(i) JT := —0.56418958/z/exp(v) was changed to 
T := —0.564189583548 X exp(—v)/zx. This is faster and also 
has the advantage, when »v is very large, of merely giving 0 
as the answer instead of causing overflow. 
(ii) The extra statement x := 0.707106781187 X x was made 
as in (v) of Algorithm 123. 
(iii) form := m-+1 = was changed to form := m-+2. m+1 
is a misprint, and gives incorrect answers. 
The greatest error observed was 2 in the 11th decimal place. 


ALGORITHM 181 
(i) Similar to (i) of Algorithm 180 (except for the minus sign). 
(ii) Similar to (ii) of Algorithm 180. 
(iii) m was declared as real instead of integer, as an alternative 
to the amendment suggested in [4]. 
The results were accurate to 9 significant figures for x < 8, 
but to only 8 significant figures for z = 10 and z = 20 


ALGORITHM 209 
No modification was made. The results were accurate to 7 decimal] 
places. 


ALGORITHM 226 
(i) 10 fT m/(480Xsqrt(2X3.14159265)) was changed to 
10 T m X 0.000831129750836. 
(ii) for 7 := 1 step 1 until 2 X n do was changed to 
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m:=2Xn; fort := 1 step 1 until m do. 
(ili) —(tXb/n) T 2/8 was changed to 
(iv) iff =2Xn-—1 waschanged tc if7 = m—1 
(v) b/(6XnXsqrt(2X%3.14159265)) was changed to 
b/(15.03897696478 Xn). 


Tests were made with m = 7 and m = 11 with the following 
results: 


en Rs 


Number of significant Number of decimal 
. jigures correct places correct 
m= 7 m= 11 m=i7 m = 11 
—0.5 7 11 7 11 
~—1.0 7 10 7 10 
—1.5 Zt 10 8 10 
—2.0 7 9 8 10 
—2.5 6 9 8 11 
—3.0 6 7 8 9 
—4.0 5 7 10 11 
—6.0 2 1 12 10 
—8.0 0 0 11 9 


Perhaps the comment with this algorithm should have referred 
to decimal places and not significant figures. To ask for 11 sig- 
nificant figures is stretching the machine’s ability to the limit, 
and where 10 significant figures are correct, this may be regarded 
as acceptable. 


ALGORITHM 272 
The constant .99999999 was lengthened to .9999999999. 

The accuracy was 8 decimal places at most of the points tested, 
but was only 5 decimal places at x = 0.8. 
ALGORITHM 304 
No modification was made. The errors in the 11th significant figure 
were: ; 


abs(x) x > 0 = upper x > 0 # upper 
0.5 1 1 
1.0 1 2 
1.5 215(5) 2 
2.0 259(0) 4 
3.0 0 0 
4.0 2 3 
6.0 6 0 
8.0 14 0 
10.0 23 0 

20.0 35 0 


* Due to the subtraction error mentioned in the comment section 
of the algorithm. Changing the constant 2.32 to 1.28 resulted in 
the figures shown in brackets. 


To test the claim that the algorithm works virtually to the 
accuracy of the machine, it was translated into double-length 
instructions of Mercury Autocode and run on the Atlas using the 
EXCHLF compiler (the constant being lengthened to 
0.398942280401432677939946). The results were compared with 
hand calculations using Table II of [5]. The errors in the 22nd 
significant figure were: 


—(ixXb/n) FT 2X 0.125. 
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a 


abs (x) x > 0 = upper x > 0 + upper 
1.0 2 3 
2.0 7 1 
4.0 2 0 
8.0 8 0 


Timings. Timings of these algorithms were made in terms of 
the Atlas “Instruction Count,” while evaluating the function 100 
times. The figures are not directly applicable to any other com- 
puter, but the relative times are likely to be much the same on 
other machines. 


Instruction Count ror 100 EvaLuatTIoNns 


Algorithm number 


123 | 180 | 181 | 209 226 272 | 3049 | 304° 


8 97 24 25 2+ 
1.0 65¢ 8 176 24 29 29 
9 273 25 35 35 


2.0 | 194 78 90 8 387 | 24 39 39 
2.5 | 252 54 68 10 O15 24 131 44 


3.0 42 51 9 628 20 97 50 
4.0 27 39 9 9004 | 25 67. 44 
6.0 15 30 6 14004 | 16 49 23 
8.0 9 28 7 21004 18 44 ll 
10.0 10 25 1 5 27004 | 16 38 11 
20.0 9 22 5 6500¢ | 16 32 il 
30.0 9 9 5 | 10900? | 16 11 11. 


» Readings refer to x > 0 # upper. 
° Time to produce incorrect answer. A count of 120 would fit a 
smooth curve with surrounding values. 

100 times Instruction Count for 1 evaluation. 


Opinion. There are advantages in having two algorithms 
available for normal curve tail areas. One should be very fast and 
reasonably accurate, the other very accurate and reasonably 
fast. We conclude that Algorithm 209 is the best for the first 
requirement, and Algorithm 304 for the second. 

Algorithms 180 and 181 are faster than Algorithm 304 and may 
be preferred for this reason, but the method used shows itself in 
Algorithm 181 to be not quite as accurate, and the introduction 
of this method solely for the circumstances in which Algorithm 
180 is applicable hardly seems worth while. 


Acknowledgment. Thanks are due to Miss I. Allen for her 
help with the double-length hand calculations. 
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ALGORITHM 227 

CHEBYSHEV POLYNOMIAL COEFFICIENTS 

S. J. Cyvin (Recd. 15 Oct. 1963) 

Technical University of Norway, Trondheim, Norway 


procedure I'cheb(n,A);  valuen; integern; integer array A; 

comment This procedure finds (by recursion) the coefficients 
of T,(z), rather than the value of the polynomial, which is the 
subject of Algorithms 10 and 36. The (n+2)+2 nonvanishing 
coefficients are stored in one-dimensional integer array A in 
the following way: 


Pp 
Top(t) = D> Alit+1] 2% (n even), 
£=0 
Topalz) = >> Alit+1}e*# (n odd); 
i=0 
begin integer 7,j7; integer array B[1:(n+2)+2]; Boolean 
EVEN; 
Afi] := Bil]):=1; EVEN :=n+2xX2=n; ifn > 1then 
for 1 := 2 step 1 until (n+2)+2 do 
for j := 7 step —1 until 1 do 
begin 
Alj] := if j=i7 then 2XB[j—1] else if j=1 then — A[t] 
else 2 X B[j—1] — Aly]; 
Bij] += if j=7 then 2 X Ali] else 2 X Als] — BU] 
end 7 loop; 
for 7 := 1 step 1 until (n+2)+2 do 
Ali] := if EVEN then A[z] else B[z] 
end T cheb 
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ALGORITHM 228 

Q-BESSEL FUNCTIONS 1[,(:) 

J. M.S. Sim6es Pererra (Recd. 21 Sept. | 63 and 6 Jan. 
64) 

Gulbenkian Scientific Computing Ctr, Lisboa, Portugal 


procedure qBesselbar (t,q,n,j,8); integer n,j; real t,q,s; 

comment This procedure computes values of any q-Bessel 

function J,(¢) for n integer (positive, negative or zero) by the 

use of the expansion [,(t) = > ?9 (ir *?*/((q)e(Q)n+e)) where 

(x = (1-9) (1—@) ++ (1-@"), (@o = 1 and (1/(q)-n) =0 (n=1, 
--). This series is convergent for ¢ € (—©, --~) if |q| > 1 

and for |¢| < lif |q|< 1. j+1 denotes the number of terms 

(at least 2) retained in the summation and s stands for the sum 

of these first terms. See L. Carlitz, The product of q-Bessel 

functions, Portugaliae Mathematica, vol. 21; 

begin integer k,m,p; realcju; m:i=abs(n); c:= 1; 

ifn = 0 then goto A; 

for p:= lstep1 until mdoc:= cX(1—¢qT p); 

if n < 0 then go to B; 

A:ur= (tfn)/e; si= 4; 

for k := 1 step 1 until j do : 

begin wu i= uxX(ét2)/(d—-qtkh)xXQd—-qt (mt+k))); s c= 

s+ uend; a 

go to C; 

Br ur=t fT (n+2Xm)/e; si= u; 

fork := m+ 1 step 1 until j do 

begin wu t= uxX(tt2)/(d-qTkxd—-@qT (mt+k))); s t= 

s + u end; : 
C: end 
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ALGORITHM 229 


ELEMENTARY FUNCTIONS BY CONTINUED 
FRACTIONS 

JAMES C. Moretock (Reed. 1 Oct. 63 and in final form 
24 Jan. 64) 


Computation Lab., Marshall Space Flight Ctr, NASA, 
Huntsville, Ala. 


procedure CONFRAC (x, n, parm, answer); 
integer parm, n; real zx, answer; 

comment This procedure utilizes a continued fraction which is 
equivalent to the diagonal of the Padé table for exp z, with 
error in the computed convergent less than x22"/(2 K 62 K (10)? 
X s+ & (4n — 2)%(4n + 2)). This fraction was developed by 
J. ©. Morelock, Note on Padé Table Approximation, Internal 
Note MIN-COMP-62-9, Marshall Space Flight Center, Hunts- 
ville, Alabama, 1962. For source reference see Nathaniel Macon, 
On the. computation of exponential and hyperbolic functions 
using continued fractions, J. ACM, 2(1955) , 262-266. The argu- 
ment, z, is assumed to be less than 7/4. For such x any desired 
level of accuracy is quickly computed for each funetion specified 
as folluws: 


parm := 1, answer: = sins parm := 5, answer := sinh x 
parm := 2, answer := cos x parm := 6, answer: = cosh x 
parm := 3, answer := tan 2 parm := 7, answer := tanh x 
parm := 4, answer :=erpx 


The body of this procedure has been tested using extended 
AuGou for the B-5000 Computer. It gave the following results: 


z=0.50 n=1 parm =1 = answer = 0.47938 801530 
a= 0.50 n=2 parm =1 answer = 0.47042 547125 
xz =050 n=3 parm =1 answer = 0.47942 553854 
+=0.50 n=4 parm =1 answer = 0.47942 553860 
x=050 n=1 parm = 2 ‘answer = 0.87760 305092 
«= 0.50 n=2 parm =2 answer = 0.87758 259869 
x= 0.50 n=3 parm =2 answer = 0.87758 256193 
x=0.50 n=4 parm =2 answer = 0.87758 256189 
z= 0.50 n=1 parm = 3 answer = 0.54624 697437 
«= 0.50 n=2 parm = 3 answer = 0.54630 239019 
«= 0.50 n=3 parm = 3 answer = 0.54630 248974 
x= 050 n=4 parm = 3 answer = 0.54630 248985 
a=050 n= 1 parm = 4 answer = 1.64864 864365 
2a=050 n=2 parm = 4 answer = 1.64872 139973 
a= 0.50 n= 3 parm = 4 .answer = 1.64872 127057 
x= 0.50 n=4 parm =4 answer = 1.64872 127070 
x= 0.50 n=1 parm =5 answer = 0.52104 563580 
x= 0.50 n=2 parm = 5 answer = 0.52109 539374 
a= 0.50 n=3 parm = 5 answer = 0.52109 530541 
z= 050 n=4 parm = 5 answer = 0.52109 530549 
x=0.50 n=1 parm =6 answer = 1.12760 301285 
z=0.50 n=2 parm =6 answer = 1.12762 600598 
a= 0.50 n= 3 parm = 6 answer = 1.12762 596516 
x=0.50 n=4 parm =6 answer = 1.12762 596521 
z=050 n=1 parm =7 answer = 0.46208 251473 
z=0.50 n =2 parm =7 answer = 0.46211 721881 
a= (0.560 2 =3 parm =7 answer = 0.46211 715720 
x = 0.50 n= 4 garm =7 answer = ().46211 715726 
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The value of n selects the continued fraction convergent; 
begin integer 7, ndigt; . 
real r, f; 
ri= if parm S 3 then — x f 2elsez fT 2; 
fr=4xX% n+ 2; 
for? :=nstep —luntilldof:=4xXi—2+4r/f; 
ndigt := if parm S 3 then parm + 1 else parm —3; 
answer >= if ndigt = 1 then (f+2)/(f—z) 
else if ndigt = 2 then 2 X x xX f/((f fT 2) — 7) 
else if ndigt = 3 then ((f f2)+7r)/((f Tf 2)—r) 
else if ndigt = 4 then 2 & x X f/((fT2)+r) 
else x; 
end 


CERTIFICATION OF ALGORITHM 229 [B1] 

ELEMENTARY FUNCTIONS BY CONTINUED 

FRACTIONS [James C. Morelock, Comm. ACM 7 (May 
1964), 296] 

T. A. Bray (Recd. 18 June 1964) 

Boeing Scientific Research Laboratories, Seattle, WA 98124 


KEY WORDS AND PHRASES: Padé 
table 
CR CATEGORIES: 5.19 


Algorithm 229 was coded in Fortran II and run on the IBM 
1620 computer for x = 0.50 and 0.75, for n = 1, 2, 3, 4, and for 
parm = 1, 2, 3, 4, 5, 6, 7. 

For x = 0.50 my values agree with the author’s up to +107". 

For « = 0.75 and n = 4, my values of sin z, cos x, tan x, and 
exp x agree with tabulated values to within +107 !. For the same 
x and 7 my values of sinh z, and cosh wz, and tanh x agree with 
tabulated values to within +107; no tables were available to 
check the 11th decimal. 


continued factions, 
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ALGORITHM 230 

MATRIX PERMUTATION 

J. Booturoyp (Recd 18 Nov. 1963) 

English Electric-Leo Computers, Kidsgrove, Stoke-on- 
Trent, England 


procedure matrixperm(a,b,j,k,s,d,n,p); value n;_ real a,b; 
integer array s,d;_ integer j,k,n,p; 
comment a procedure using Jensen’s device which exchanges 
rows or columns of a matrix to achieve a rearrangement specified 
by the permutation vectors s,d{l:n]. Elements of s specify the 
original source locations while elements of d specify the desired 
destination locations. Normally a and 6 will be called as sub- 
scripted variables of the same array. The parameters j,k nom- 
inate the subscripts of the dimension affected by the permuta- 
tion, p is the Jensen parameter. As an example of the use of this 
procedure, suppose r,c[1:n] to contain the row and column sub- 
scripts of the successive matrix pivots used in.a matrix inver- 
sion of an array a[l:n,l:n]; i.e. rfl], c{1] are the relative sub- 
scripts of the first pivot 7[2], c[2] those of the second pivot and 
so on. The two calls 
matrixperm (alj,pl, alk,p]), jik,r,c,n,p) 
and matrixperm (alp,J], alp,k], j,k,c,7,n,p) 
will perform the required rearrangement of rows and columns 
respectively ; 
begin integer array tag, loc[l:n];_ integer i,t; real w; 
comment set up initial vector tag number and address arrays; 
for 7 := 1 step 1 until n do tagli] := loc[i] := 7; 
comment start permutation; 
for 7 := 1 step 1 until n do 
begin ¢ := s{i];_ j := loc{t]; k := d[z]; 
if jk then begin for p := 1 step 1 until n do 
begin w := a; a:= 6; b:=wend; 
taglj] := tag[k]; tag{k] :== ¢; 
loclt] := locltag{j}]; locltag[j]] := 7 
end jk conditional 
end 7 loop 
end matrixperm 
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ALGORITHM 231 

MATRIX INVERSION 

J. BoorHroyp (Recd 18 Nov. 1963) 

Kinglish Electric-Leo Computers, Kidsgrove, Stoke-on- 
Trent, England 


procedure matrixinvert (a,n,eps,singular); value n,eps; ar- 
raya; integern; realeps; label singular; 

comment inverts a matrix in its own space using the Gauss- 
Jordan method with complete matrix pivoting. I.e., at each 
stage the pivot has the largest absolute value of any element in 
the remaining matrix. The coordinates of the successive matrix 
pivots used at each stage of the reduction are recorded in the 
successive element positions of the row and column index 
vectors r and c. These are later called upon by the procedure 
matrizperm which rearranges the rows and columns of the 


matrix. If the matrix is singular the procedure exits to an appro- 
priate label in the main program; 


begin integer 1,j,k,l,pivi,pivj,p; real pivot; integer array 
r,c[l:n]; 
comment set row and column index vectors; 


for i := 1 step 1 until n do 7{z] := c[z] := 7; 
comment find initial pivot; pivt := pivj := 1; 
for 1 := 1 step 1 until n do for j := 1 step 1 until n do 
if abs (a[z,j]) > abs (a[pivi,pivj]) then begin pivz 
pivj := j end; 
comment start reduction; 
for 7 := I step 1 until n do 
begin 1 := rfi]; r[¢] := rf[pivi]; 
elt] := elpivj]; clpivj] := 1; 
‘if eps > abs (a[r{t],c[¢]]) then 
begin comment here include an appropriate output pro- 
cedure to record 7 and the current values of r[l:n] and 
c{l:n]; go to singular end; 
for j := nstep —1 until7+1,i—1 step —1 until 1 do a[r[t],c[j]] 
:= alr(t],elg]l/alr{e],elt}]; alrlz),c{e]] := 1/afr[z],cfe]}; 
pivot := 0; 
for k := 1 step | until ¢—1, 7+1 step 1 until n do 
begin for j := n step —1 until i+1, 7--1 step —-1 until 1 do 
begin a/[r{k],clj]] := alr[k],cl7]] — alr{t],el]] X alr[k],ce]; 
ifk>i A j>t A abs (al[rlk],clj]]) > abs(pivot) then 
begin pivit := k; piv) := 7; 
pivot := al[r[k]j,c[j]] end conditional 
end jloop; 
a(r[k],cfz]} := —alr|z},c[é}] X afr[k],clz]] 
end kloop 
end loop and reduction; 
comment rearrange rows; matrixperm (alj,p],a[k,p],j,k,r,¢,n,p); 
comment rearrange columns; 
matrizperm (alp,j],a[p,k],j,k,c,r,n,p) 
end matrixinvert 


1= 45 


r[pivt] := 1; 0 := eft]; 


[Epvitor’s Note. On many compilers matrizinvert would run much 
faster if the subscripted variables r[z], c{z], r[k] were replaced by 
simple integer variables ri, ci, rk, respectively, inside the 7 loop.— 
G.E.F.1 
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REMARK ON ALGORITHM 231 [F1] 
MATRIX INVERSION 

[J. Boothroyd, Comm. ACM 6 (June 1964), 347] 
Marts Frrrine (Recd. 23 Nov. 1964) . 
Flygmotor Aeroengine Company, Trollhattan, Sweden 


The algorithm cannot accept the pivot element = 0 which re- 
duces the detection of singularities. We suggest the correction: 


ifk>itAjg>%/A/ abs(a[r[k], cly]]) > abs(pivot) then 
should be | 
ifk>7tAj>zt/ abs(a[r{k], c{j]]) 2 abs(pivot) then 
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ALGORITHM 232 

HEAPSORT 

J. W. J. Wirtuiams (Reed 1 Oct. 1963 and, ees 15 
Feb. 1964) 

Elliott. Bros. (London) Ltd., Boieharwood: Herts, Eng- 
land 


comment The following procedures are related to TREESORT 
{R. W. Floyd, Alg. 113, Comm. ACM & (Aug. 1962), 434, and 
A. F. Kaupe, Jr., Alg. 143 and 144, Comm. ACM 5 (Dec. 1962), 
604] but avoid tiie use of pointers and so preserve storage space. 
All the procedures operate on single word items, stored as 
elements 1 to n of the array A. The elements are normally so 
arranged that A{zJ|SA[j] for 2SjSn, i=j+2. Such an arrange- 
ment will be called a heap. A[1] is always the least element of 
the heap. 

The procedure SETHEAP arranges n elements as a heap, 
INHEAP adds a new element to an existing heap, OUTHEAP 
extracts the least element from a heap, and SWOPHEAP is 
effectively the result of INHEAP followed by OUTHEAP. In 
all cases the array A contains elements arranged as a heap on 
exit. 

SWOPHEAP is essentially the same as the tournament sort 
described by K. E. Iverson—A Programming Language, 1962, 
pp. 223-226—-which is a top to bottom method, but it uses an 
improved storage allocation and initialisation. INHEAP re- 
sembles TREESORT in being a bottom to top method. HEAP- 
SORT can thus be considered as a marriage of these two 
methods. 

The procedures may be used for replacement-selection sort- 
ing, for sorting the elements of an array, or for choosing the 
current minimum of any set of items to which new items are 
added from time to time. The procedures are the more useful 
because the active elements of the array are maintained densely 
packed, as elements A[1] to A[n]; 

procedure SWOPHEAP (A,n,in,out); 

value in,n; integer 7; realin,out; real array A; 

comment SWOPHEAP is given an array A, elements A[1] 
to A[n] forming a heap, n20. SWOPHEAP effectively adds 
the element in to the heap, extracts and assigns to out 
the value of the least member of the resulting set, and leaves 
the remaining elements in a heap of the original size. In 
this process elements 1 to (n+1) of the array A may be dis- 
turbed. The maximum number of repetitions of the cycle 
labeled scan is logon; 

begin integer i,j; real temp, temp 1; 
if inSA[l1] then out := in else 


begin 7 := 1; 
Aln+1] := in; comment this last statement is only 
necessary in case 7=n at some stage, or n=0; 
out := A[l]; 
scan: j := i4+t; 


if jSn then 
hegin temp := Aly]; 
temp 1 := A[j+1]; 
if temp 1<temp then 
begin temp := temp 1; 
jim jt} 


end; 
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if temp < in then 
begin A[i] := temp; 
ti= J; 
go to scan 
end 
end; 
Ali] := in 
end 
end SWOPHEAP; 
procedure INHEAP (A, n, in); 

value in; integern; realin; real array A; 

‘comment INH EAP is given an array A, elements A[1] to 
A[n] forming a heap and n=0. INHEAP adds the element in 
to the heap and adjusts n accordingly. The cycle labeled 
scan may be repeated logan times, but on average is repeated 
twice only; 

begin integer 7,7; 
ti=n:i=nt+l; 

scan: if 1>1 then 
begin j := 1+2; 

if in<A[j] then 
begin A[i] := A[j]; 
t= J; 
go to scan 
end 
end; 
A(t] := in 
‘end INHEAP; 
procedure OUTHEAP (A,n,out); 

integer 7; real out; real array A; 

comment. given array A, elements 1 ton of which form a heap, 
n21, OUTHEAP assigns to out the value of A[1], the least 
member of the heap, and rearranges the remaining menibers 
as elements 1 to n—1 of A. Also, n is adjusted accordingly; 

begin SWOPHEAP (A,n—1, A[n],out); 
ni=n—-l 

end OUTHEAP; 

procedure SETHEAP (A,n); 

value n; integer7; real array A; 

comment SETHEAP rearranges the elements A[1] to A[n] 
to form a heap; 

begin integer j; 

7:=1; 
L: INHEAP(A,j,Alj+1)); 
if j<n then goto L 
end SETHEAP 
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ALGORITHM 233 

SIMPSON’S RULE FOR MULTIPLE 
INTEGRATION 

FRANK OLYNYK* (Recd 24 Dec. 1963) 

Case Institute of Technology, Cleveland, Ohio 


*Partially sponsored by the National Science Foundation under Grant GP-642. 


real procedure Simps (X, 21, x2, delta, f); 
value 21, x2, delta; real X, x1, 22, delta, f; 

comment This procedure calculates a single integral by Simp- 
son’s rule in such a way that it can be called recursively for the 
evaluation of an iterated integral. x1 and x2 are the lower and 
upper limits, respectively, which may be any mathematically 
meaningful expressions. Hence in using Simps for multiple 
integration the region is not limited to rectangular boxes. The 
algorithm terminates when two successive evaluations pass the 
test involving delta. The formal parameter f stands for the 
expression to be integrated. 

As an example of the use of Simps, 


pl (1—2?)4 
| ax [ 
0 0 


would be evaluated by 
Simps (x, 0, 1, delta, Simps(y, 0, sqri(l — x12), delta2, g(x, y))). 
Simps has been written and run in AtGot 60 on the Univaec 
1107 at Case Institute. 
[Eprror’s Nots. Experience of W. McKeeman suggests the 
wisdom of choosing delia2 < delta.—G.E.F.]; 


g(x,y) dy 


begin 
Boolean turing; real 21, 22, 23, h, k; 
turing := false; 


go to box2 end; 
a1 := 42; x2:=h; 


if 21 = x2 then begin zl := 0; 
if <1 > x22 then begin hh := 21; 
turing := true end; 


X:=a@l; z2l:=f; X:= 22; 2:=21:= 21 +f; 
k i= x2 — 21; 
box: 


22:=0; h:= k/2; 

for X := 21 +hAstepkuntilz2do 22 := 22+ /f; 

zl := 21 +4 & 22; 

if h X abs((21 — 2 X 23)/(if z1 = 0 then 1.0 else z1)) < delta 
then go to box2 


else 23.:= 21; 
zl := zl — 2 X 22; 
k := hy 
go to box; 
box2: 
if turing then h := —h; 


Simps := h XK 21/3 
end Simps 
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REMARK ON ALGORITHM 233 [D1] 

SIMPSON’S RULE FOR MULTIPLE INTEGRATION 
[Frank Olynyk, Comm. ACM 7 (June 1964), 348] 

L. G. Proty (Reed. 6 Apr. 1970) 

Department of Mathematics, University of Southampton, 
U.K. 

KEY WORDS AND PHRASES: 


integration, Simpson’s rule 
CR CATEGORIES: 5.16 


numerical integration, multiple 


Algorithm 233 fails in the case 71 = 2x2 since h and, thus, the 
value of the function Simps are undefined. This situation can be 
avoided by replacing the line 

if x1 = x2 then begin zl := 0; go to boz2 end; 
by 

if <1 = 22 then begin Simps := 0.0; go to bozr3 end; 
and by replacing the last two lines of the procedure by 

Simps := h XK 21/3.0; 

box3: 

end Simps 
The algorithm can be marginally improved by replacing each 
integer constant by its equivalent decimal number. 
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POISSON-CHARLIER POLYNOMIALS [823] 

J. M.S. Simézus Pereira (Recd. 6 Jan. 1964) 
Gulbenkian Scientific Computing Center, Lisboa, Portugal 


real procedure PCpolynomial (x, n, a); 

. integer n; real x, a; 

comment PCpolynomial computes values of the Poisson- 
Charlier polynomial pn(x) defined by L. Carlitz, Characteriza- 
tion of certain sequences of orthogonal polynomials, Portugaliae 
Mathematica 20 (1961), 43-46: 


pat) = art(nty-a > (1) (") rat (7) | 


In this algorithm u stands for the successive terms of the summa- 
tion, s stands for the sum of these terms and all other symbols 
possess evident meanings. Clearly each term of the summation 
is obtained from the preceding one by the indicated multipli- 
cation; 


begin 
integer j; real u, 8, c; 
u:= (-I)Tn; 
Si= 4; 
e:= 1; 


begin u := —uX (n— 9) X (@ — 5)/@X G +1); 8 I= 
s + u end; 
PCpolynomial := sqrt(at n/c) X s 
end PCpolynomial 
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POISSON-CHARLIER POLYNOMIALS |J. M. S. 
Simées-Pereira. Comm. ACM 7 (July 1964), 420] 

P. A. Samet (Recd. 17 Aug. 1964) 

Computation Lab., The University, Southampton, Eng. 


PC polynomial was compiled correctly by the Pegasus-ALGoL 
compiler and ran without trouble. The procedure was tested for 
n = 0(1)4, values of a in the range 0.2 to 2.0, and z in the range 
0 to 1. The values produced. were spotchecked by hand. 

The procedure could be improved by 

(i) putting x, n, a in the value part. 

(ii) replacing u := (—1) Tn by 

u:= ifn = n+ 2X 2 then 1 else —1 

(ili) eliminating the separate evaluation of ! by including 
the evaluation of a*-(n!)—! in the main loop. This gives a simpler 
argument for sqrt in the final assignment statement. 

The revised algorithm then reads 
real procedure PCpolynomial (x, n, a); 

value x, n, a; real x, a; integer 7; 
begin integer j; real u, 8, c; 
si=u:i=ifn =n + 2X 2 then 1 else —1; 
c 1; 
for j := 0 step 1 until n — 1 do 
begin wu := —u X (n—j) X (@—j)/(a X G1); 


li 
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8 
c 
end; 
PCpolynomial := sqrt(c) X s 
end PCpolynomial 
This version gave the same results as the original but was ap- 
preciably faster. 


s+ 4; 
ce X a/(j+1) 


oe oe 


COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 235 

RANDOM PERMUTATION [G6] 
RIcHARD DURSTENFELD (Recd. 2 Jan. 64) 
General Atomic, San Diego 12, Calif. 


procedure SHUFFLE (a, n, random); 
value n; integer n; real procedure random; integer 


array a; 
begin 
comment SHUFFLE applies a random permutation to the 
sequence a[z] where i = 1,2, ... , n. The procedure random is 


supposed to supply a random element from a large population 
of real numbers uniformly distributed over the open unit 
interval 0 < r < 1. The array a is declared to be integer but 
actually it suffices for its type to agree with that of the vari- 
able 6 (in the procedure body); 

integer 1, j; real b; 

for 1 := n step — 1 until 2 do 
begin j := entier (1 X random + 1); 

b:= aft}; aft}:= als]; aj] := 
end loop? 
end SHUFFLE 


Note. Numbers in brackets: following Algorithm titles indicate the subject 
eategory for the algorithm, based on the Modified SHARE Classification listing 
given in the March, 1964 issue of the Communications of the ACM. 


REMARK ON ALGORITHM 235 [G6] 

RANDOM PERMUTATION [Richard Durstenfeld, 
Comm. ACM 7 (July 1964), 420] 

M. C. Pixs (Recd. 11 Feb. 1965 and 5 Apr. 1965) 

Statistical Research Unit of the Medical Research Council, 
University College Hospital Medical School, London, 
England 


SHUFFLE applies a random permutation to the complete 
sequence a[z] where 7 = 1, 2, ---,n. SHUFFLE does this in such a 
way that after k calls of the real procedure random the elements 
alt] fori = n—k+1, n—k+2, ---, n are a random permutation of 
the original n elements a[z] wherez = 1,2,---,n taken k at a time. 
In many applications this will be all that is required and by coming 
out of the procedure at this point the remaining n — k — 1 calls 
of random and the subsequent transfers will be avoided; this will 
result in a considerable saving in time if k is much smaller than n. 
The necessary modifications are: 

(1) Amend the procedure heading by adding the variable k: 

procedure SHUFFLE (a, n, k, random); 

value n,k; integer n, k; 

(2) Amend the line 

for 7 := n step —1 until 2 do 
to read: 

k.:= n+1—k; 

for 7 := n step —1 until k do 
Note that at exit a[l:n] will still contain all the elements of the 
original a[i:n], and that if k=n that these modifications will 
make the procedure call random one more time than the original 
SHUFFLE. 
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ALGORITHM 236 

BESSEL FUNCTIONS OF THE FIRST KIND [S17) 
Water Gautscut (Recd. 10 Aug. 1963 and 10 Apr. 1964) 
Oak Ridge National Laboratory, Oak Ridge,-Tenn.* 


* Now at Purdue University, Lafayette, Inds 


real procedure t(y);_ value y;_ real y; 

comment This is an auxiliary procedure which evaluates the 
inverse function ¢ = t(y) of y = ¢Iné (¢21) to an accuracy of 
about 1%. For the interval 0 < y S 10 a fifth degree approxi- 
mating polynomial was obtained by truncating a series expan- 
sion in Chebyshev polynomials. For y > 10 the approximation 
ty) = (y/In(y/a)) (1+ (na—InIn(y/e))/(1+1n(y/a)))-!_ where 
In a = .775f is used; 


begin real p, z; 

if y <= 10 then 

begin _ 

= .000057941 X y — .00176148; p:=yX 

p:=y X p— .129018; p:=y X p+ .85777 
t:=yX p+1.0125 


p + .0208645; 


end 
else 
begin 2 ae 
= In (y) — .775;) p := (.775—In (z))/A+2); 
=1/(+p); t:=y X p/z 
end 
end ¢; 


procedure Japlusn (x, a, nmaz,d, J); value x, a, nmaz, d; 
integer nmaz,d;. realz;a; array J; . 

comment This procedure evaluates to d significant digits the 
Bessel functions Ja,n(x) for fixed a, x and for n = 0,1, -+- , nmaz. 
The results are stored in the array J. It is assumed 
thatO Sa<1,x2>0, and nmaz = 0. If any of these variables is 
not in the range specified, control is transferred to a nonlocal 
label called alarm. The procedure makes use of the real procedure 
t. In addition, it calls for a nonlocal real procedure gamma which 
evaluates I'(z) for 1 S$ z S 2. (See [2].) The method of computa- 
tion is a variant of the backward recurrence algorithm of J.C. 
P. Miller. (See [1].) The purported accuracy is obtained by a 
judicious selection of the initial value » of the recursion index, 
together with at least one repetition of the recursion with » re- 
placed by v + 5. Near a zero of one of the Bessel functions 
generated, the accuracy of that particular Bessel function may 
deteriorate to less than d significant digits. The algorithm is 
most efficient when z is small or moderately large; 

begin integer n, nu, m, limit; real epsilon, sum, dl,‘r; s, L, 
lambda; array Japprox, Rr{[O:nmaz]); 
ifa<O0Va21V2s0V nmaz < 0 then go to alarm; 
epsilon := .6 X 10f (—d); 
for n := 0 step 1 until nmaz do ee = 0; 


+ In an earlier version of this procedure the author used a = 1. 


The value In a = .775 was found empirically by H. C. Thacher, Jr. 


to yield somewhat better approximations. 
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sum := (x2/2)ta/gamma (1+a); 
dl := 2.8026 X d + 1.3863; 
if nmaz > 0 then r := nmazx X t(.5Xd1/nmaz) else r := 0; 
= 1.3501 X x X t(.73576Xd1/z); 
nu := 1+ entier (if rSs then s else r); 
10: m:=0; L:=1; limit := entier (nu/2); 
Tl: m:=m-+1; 
L:=LX (mta)/(m+)); 
- m < limit then go to L1; 
=2xXm; r:=s :=0; 
La: r = 1/(2x (a+tn)/2—r) ; 
comment Conceivably, but very unlikely, division by an 
exact zero, or overflow, may take place here. The user may 
wish to test the divisor for zero, and, if necessary, enlarge it 
slightly to avoid overflow, before this statement is carried out. 
As such a test depends on the particular machine used, it was 
not included here; 
if entier (n/2) ¥ n/2 then lambda : 
sg 
:= DX (n+2)/(n+2xXa); 
ai = DIX (n+a) 
end; 
3:= 7X (lambdat+s); ifn S nmaz then Rr[n—1] := 7; 
n:=n-—I1; ifn = 1 then go to L2; 
J{0] := sum/(1+s); 
for n := O. step 1 until nmaz — 1 do 
for n := 0 step 1 until nmax do 


= 0 else 


J[n+1] := Rr[n] X J[n]; 


if abs((J[n] — Japproz[n})/J[n]) > epsilon then 
begin» 
for m := 0 step 1 until nmaz do Japproz[m] := J[m]; 


nu:=nu+5; goto LO 

end 

end Japlusn; 

procedure Japlusn(z, a, nmaz,d,I); value x, a, nmaz, d; 
integer nmaz, d; real x, a; array I; 

comment This procedure evaluates to d significant digits the 
modified Bessel functions Za ,n(x) for fixed a, z, withO Sa <1, 
x > 0, and for n = 0,1, -:- , nmaz. The results are stored in the 
array I. For the setup of the procedure, and the method of com- 
putation used, see the comment in Japlusn; 

begin integer n, nu, m; real epsilon, sum, dl, r, 8, L, lambda; 
array lapproz, Rr[O:nmaz]; 
ifa<0Va2z1V2S0V nmaz < 0 then go to alarm; 
epsilon := .6 X 10T (—d); 
for n := 0 step 1 until nmaz do Lapproz[n] := 0; 
sum := exp(x) X (2/2) Ta/gamma(1+a); 
dl := 2.3026 X d + 1.3863; 
if nmax > 0 then r := nmaz X t(. 5Xd1/nmaz) else r := 0; 

s:i= if x < di then 1.3591 X x X #(.73576X (dl—z)/z) else 
1.3591 Xz; 


nu := 1+ entier (if rSs then s else 7); 
IN: n:=0; L:= 
Ti: ni=n+1; | 

L:= DL X (n+2Xa)/(n+1); 

ifn < nu then go to L1; 

ri=s:=0; . 
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L2: r := 1/(2X (at+n)/z+1r); 
L:= LX (n+1)/(n+2xa); 
lambda := 2X (n+a) X L; 
s:=rX (lambda+s); if n S nmaz then Rr[n—1] := 7; 
n:=n—1; ifn 2 1 then go to 12; 
Z{O] := sum/(-+s); 
for n := 0 step 1 until nmaz — 1 do I[n-+1] := Rr[n] X I[n]J; 
for n := 0 step 1 until nmaz do 
if abs((I[n]—Iapproz{n])/I[n]) > epsilon then 


begin 
for m := 0 step 1 until nmaz do lapproz[m] := I[m]; 
nu:= nu-+ 5; goto LO 

end 


end laplusn; 


procedure Jaminusn(z, a, nmaz, d, J); 
integer nmaz, d; real z, a; array J; 
comment This procedure evaluates to d significant digits the 
Bessel functions J,_n(x) for fixed a, x, withO <a <1,z2> 0, 
and for n = 0,1, --+ , nmaz. The results are stored in the array 
J. The procedure makes use of the real procedure ¢, and the 
procedure Japlusn. In addition, it calls for a nonlocal real pro- 
cedure gamma which evaluates I'(z) forl & z S 2. (See [2].) The 
accuracy may deteriorate to less than d significant digits if a is 
close to 0 or 1; 

begin integer n; array J1(0:1]; 
if a = 0 then go to alarm; 
Japlusn(x, a, 1, d, J1); 
J(0] := J10); 
Jl] := 2X aX J[0)/x — J1[1]; 
for n := 1step 1 until nmaz — 1 do 

J(n+1] := 2 (a—n) X J[n]/z — J[n—-1] 
end Jaminusn; 


value z, a, nmaz, d; 


il 


procedure laminusn(x, a, nmaz,d,I); value x, a, nmaz, d; 
integer nmaz, d; real x, a; array I; 

comment This procedure evaluates to d significant digits the 
modified Bessel functions Ia_,(z) for fixed a, x, withO <a <1, 
xz > 0, and for n = 0,1, --- , nmaz. The results are stored in the 
array I. The procedure makes use of the real procedure #, and 
the procedure Japlusn. In addition, it calls for a nonlocal real 
procedure gamma which evaluates I'(z) for 1 S z S 2. (See [2].) 
The accuracy may deteriorate to less than d significant digits if 
a is close to 0 or 1; 

begin integer n; array /1[0:1]; 
if a = 0 then go to alarm; 
Taplusn(z, a, 1, d, 1); 


T{0] := £1(0); 
Wj] := 2X aX If0)/* + 11]; 
for n := 1 step 1 until nmaz — 1 do 


I[n+1] := 2X @—n) X I[n]/x + I[n—-1} 
end laminusn; 


procedure Complex Japlusn(x, y, a, nmaz, d, u, v); 
nmax, d; 
integer nmaz, d; real x, y, a; array u, 0; 

comment This procedure evaluates to d significant digits the 
Bessel functions Jain(Z) = Un + itn for fixed real a, fixed complex 
z= x + iy, and for n = 0, 1,--:: , nmaz. The real parts wm, 
U1, ‘**', Unmaz Of the results are stored in the array u, the imagi- 
nary parts vo, V1, °** , Unmaz in the array v. It is assumed that 
0S a <1, nmaz 2 0, and that z is not on the negative real axis 
x S0,y = 0. Otherwise, control is transferred to the nonlocal 
label alarm upon entry of the procedure. The procedure makes 
use of the real procedure ¢. In addition, it calls for a nonlocal 
real procedure gamma which evaluates ['(z) for 1 S z S 2. (See 
[2].) The method of computation is a complex extension of the 
method used in the procedure Japlusn. The algorithm is most 
efficient when | z | is small or moderately large; 

begin integer n, nu, m; real epsilon, y1, 102, 710, phi, c, cl, c2, 


value 2, y, a, 
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suml1, sum2, dl, r, s, lambdal, lambda2, L, rl, 12, s1, 82; 
uapproz, vapprox, Rr1, Rr2[0:nmaz]; 
ifa<O0VaZ2z1V («s0Ay=0) V nmax < 0 then go to alarm; 
epsilon := .5 X 10T (—d); 
for n := 0 step 1 until nmaz do uapproz[n] := vapproz{n] := 0; 
yl := abs(y); 702 := 272+ yT2; 10 := sqrt(r02); 
phi := ifz = 0 then 1.5707963268 elseifz > 0 then arctan(y1/r) 
else 3.1415926536 + arcian(y1/z); 
comment The two constants 7/2 and w in the preceding state- 
ment are to be supplied with the full accuracy desired in the 
final results; 
c := exp(yl) X (r0/2)ta/gamma (1+a); 
suml := c X cos(aXphi—x); sum2 :=c¢ X sin(aXphi—x); 
dl := 2.3026 X d + 1.3863; 
if nmax > O then r := nmax X t(.5Xdl/nmax) else r := 0; 
s := if yl < dl then 1.3591 X r0 X #(.73576X (d1l—yl)/70) else 
1.3591 X 10; 
nu := 1+ entier (if rSs then s else 1); 
10: n:=0; LD:=1; cl :=1; c2:=0; 
Ti: n:=n+1; 
L:=L*X (n+2Xa)/(n+1); 
e:= —cl; cl := ¢2; c2:=c; 
if n < nu then go to 1; 
rl := 72 := sl := 82 := 0; 
£2: ¢ := (2X (atn)—2Xrl+ylXr2) T2 + (@X&r2+y1Xrl) FT 2; 
rl := (2X (atn)xX2—7102X11)/e; 
r2 := (2X (at+n) Xyl+7r02X1r2)/e; 
L:= LX (n+i)/(n+2Xa); ¢ := 2 
lambdal := c X cl; lambda2 := c X 2; 
c:= el; cl := —c2; c2:= 6; 
s:= rl X (lambdal+sl) — r2 X (lambda2-+s2); 
s2 := rl X (lambda2+s2) + r2 X (lambdal+sl); 
sl := 8; 
if n S nmaz then begin Rri{n—1] := 71; Rr2[n—1] := 12 end; 
n:i=n—1; 
if n = 1 then go to L2; 
c := (i+s1) T2 + s272; 
u[0] := (sum1X (1+s81)+sum2x s2)/c; 
v [0] := (sum2~x (1-+s81) —sum1Xs2)/c; 
for n := 0 step 1 until nmar — 1 do 
begin 
u{n+1] := Rri[n] X ul[n] — Rr2[n]  o[n]; 
v[n+1] := Rrli[n] * o[n] + Rr2[n] *K uln] 


array 


xX (nta) X L; 


end; 
if y < 0 then for n := 0 step 1 until nmaz dov[n] := — v[n}; 
for n := 0 step 1 until nmaz do 


if sqrt(((ul[n]—uapproxz[n]}) 7 2+ (v[n]—vapprozx|[n}) fT 2) 
/(u[n] T 2-+v[n] T 2)) > epsilon 


then 
begin 
for m := 0 step 1 until nmazx do 
begin uapproz|[m] := ulm]; vapprox[{m] := v[m] end; 
nu :=nu-+5; goto LO 
end 


end Complex Japlusn 
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BESSEL FUNCTIONS OF THE FIRST KIND [Walter 
Gautschi, Comm. ACM 7 (Aug. 1964), 479] 

WALTER GAUTSCHI (Recd. 24 Aug. 1964 and 2 Nov. 1964) 

Purdue University, Lafayette, Ind. 


All procedures were tested on the CDC 1604-A computer, using 
the Oak Ridge ALGou compiler. 

1. The procedure Japlusn was submitted to the following tests: 

(a) Values of Jn(2) and Jn412(10) were produced for n = 0(1)10, 
calling for an accuracy of d = 6 significant digits. The values ob- 
tained for Jn(2) agreed with those of Table 9.4 in [1] to 10 signifi- 
cant digits (with occasional discrepancies of one unit in the tenth 
figure). The results for Jn412(10) were compared against those of 
Jnvi2(10) = 2.523132521 X j,(10) obtained from Table 10.5 in [1]. 


The maximum discrepancy was found to be five units in the tenth 
figure, occurring for n = 38. 

(b) ‘To observe the performance of the procedure near a zero 
of a Bessel function, we generated Jn(x), n = O0(1)10, for 
x = 2.40482556-—-the 8D value of the first zero jo,, of Jo—calling 
for d = 10 significant digits. The results are shown in the table 
below. 


In(Jo1) 
3.404818490219—3 
6 .006883695510.—4 
9 . 2165787385 10—5 
1.25172710821:0—5 
1.52536561821:0—6 


In(on) n 

—1.193625277510—9 6 

5.1914749680,.—1 7 

4 .317548073810—1 8 

1.9899990578 1o—1 9 

6 .474666637110—2 0 
1.638924327619—2 


arwnwmnroe e® 
— 


The entry for n = 1 agrees to 9 figures with that of —Jo’(jo,1) given 
in Table 9.5 of reference [1]. 

(c) We drove the procedure to calculate Jz4,-1(x) to 6 signifi- 
cant digits, for « = 4(4)20, v = 0(.1)1.9. The results agreed with 
those tabulated in [2]. 

2. The procedure Japlusn was called to generate test values to 
6 significant figures of I,(20), Inji2(10), Intia(.1), for n = 0(1)10. 
The first two sets of values were compared with those in [3] and 
in Table 10.10 of [1], respectively, and found to be in error by at. 
most 5 units in the tenth figure. The value for fij4(.1) agreed to 10 
figures with that given in [5]. 

3. Further checks were made on the procedures Japlusn, 
Taplusn, as well as the procedures Jaminusn, Iaminusn, by having 
them ‘‘verify’’ the relation 


feare(2r) = fayi(x) + 22 fan (2) fasnr2 (2) 


forz =1, a = .2(.2).8, where f,(x) stands for either J,(x) or I,(x) 
(ef. [4], p. 100, formula (21)). That is, we printed the relative errors 
incurred when the infinite series is truncated after the (N+1)- 
st term, N = 0(5)20. Selected results (rounded to four digits) are 
shown in the table below. 


> N 0 5 10 15 20 
2 1.16510—2 2.51910—4 —3.56810—5 1.04310--5 —4.23410—6 
8 —7.94510—2 4.96810—5 —3.45910—8 6.51710--7 —1.92310—7 
4 —8 .09110—2 1.24510—4 —1.45610—5 3.71410--6 —1.361i10—6 
6 —1.02310—1 7.59010—5 —7.04110—-6 1.55310--6 —5.11610—7 


The first two lines refer to f = J, the last two lines to f = I. The 
driver program follows. 


begin integer n; real a, sumJ, sumlI, sJ, sI, errorJ, errorl; 
array J1, I1[0:3], J2, 12[0:22], J3, £3[0:20]; 
for a := .2 step .2 until .9 do 
begin 
if 2X a< 1 then 
begin 
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Japlusn (2.0, 2 X a, 2,6,/1); Iaplusn (2.0, 2 X a, 2,6, 11); 
sumJ := J1[2]; sumI := [12] 

end 

else 

begin 
Japlusn (2.0, 2 X a—1, 3, 6, J1); 
Taplusn (2.0, 2 X a—1, 3, 6, Il); 
sumJ := J1[3]; sumI := 11[3] 

end; 


Japlusn (1.0, a, 22, 6, J2); Jaminusn (1.0, a, 20, 6, J3); 
Iaplusn (1.0, a, 22, 6, 12); Iaminusn (1.0, a, 20, 6, 13); 
sJ := sI := 0; 
for n := 0 step 1 until 20 do 
begin 
sJ := sJ + J3[n] X J2[n4+2|; sJ := sl + ln] X [2[n+2]; 
if entier (n/5) = n/5 then 


begin 
error] := (J2(1] 724+ 2 XK sJ—sumJ)/sumJ ; 
error] := ({2{1]72 + 2 X sl—sumI)/suml; 


outreal (1, a); 
outinteger (1, n); 
outreal (1, errorJ/); 
outreal (1, error!) 


outstring (1, ‘a=’); 
outstring (1, ‘N=’); 
outstring (1, ‘errord =’); 
outstring (1, ‘errorl=’); 
end 
end 
end; 
go to skip; 
alarm: outstring (1, ‘parameters not in range’); 
skip: end 


4. The procedure Complex Japlusn underwent the. following 
tests: 

(a) Values of Jn(re**) were produced forn = 0,1, ¢ = (r—2) 
< 30°, r = 1(1)6, calling for an accuracy of 6 significant digits. 
Comparison with [6] showed agreement to 9-10 significant figures. 

(b) We asked the procedure to ‘‘verify’’ the identity (cf. [4], 
p. 99, formula (2)) 


“ rd -— aa + n) 
(2/2)* Jo(z) = x, an a eae 


(a + 2n)Jaton(2), 


by printing the moduli of the relative errors incurred when trun- 
cating the infinite series at n = 0(1)5. We let a and z run through 
values a = .2(.2).8, z= 2exp (i¢), ¢6 = —150° (30°) 150°, respec- 
tively. Selected results (rounded to three figures) are displayed 
in the table below. 


ss ; 
ee ja\n 0 1 2 3 4 5 
SS se Ge, 
—120 2 1.1710—1 §.5li0—3 1.3l:0—4 1.8510—-6 1.7210—8 2.0210—10 
—30 4 3.16;0—1 2 .0210—2 5.6lio—4 8.7010—6 8.6410—8 5.2710—10 
60 6 2.6010—1 1.65:10-2  4.6710-4 7.4l10-6 7.5l10—8 3.9310—10 
150 8 4.9510—1 4.0010—2 1.2910-3 2.2310-5 2.4l10~-7 1.7510—9 


The same pattern persists throughout the range of the variables. 

The driver program follows. 

begin integer m, n; 
q, $l, s2, p, error; 


real a, phi, c, 8, x, y, suml, sum2, 
array u, v[0:10]; 


for a := .2 step .2 until .9 do 
for m := —5 step 1 until 5 do 
begin 

phi := .52359877560 KX m; 


c := cos(aXphi); s := sin(aXpht); 

a := 2X cos(pht); y := 2 X sin(pht); 

Complex Japlusn (x, y, 0, 0, 6, u,v); 

suml :=c X u[0] — s X v0]; sum2 :=c X oO) + s X ul]; 
Complex Japlusn (x, y, a, 10, 6, u, v); 

q := gamma (1+4); 


sl :=q X uO]; s2:= q X v0]; p := a/a; 
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:= 0; go to skip; 
L: error := sgrt (((suml—sl) 12+ (sum2—s2) f 2)/(sumi T 2 alarm: outstring (1, ‘parameters not in range’); 
+ sum2T 2)); skip: end 


outstring (1, ‘a=’); outreal (1, a); 


: ‘ 3 REFERENCES: 
‘ prone AVN € a 
outstring (dl, ben ); outinteger (1, 30Xm) ; 1. ABRAMOoWITz, M., AND Srecun, I. A. (Eps.) Handbook of Mathematical Func- 
outstring (1, ‘n=’); owutinteger (1, n); tions. NBS Appl. Math. Ser. 55, U.S. Govt. Printing Off., Washington, D.C., 
outstring (1, ‘error=’); outreal (1, error); 1964. 
n:i=n+1; 2. Arrev, J. R. Beasel functions of nearly equal order and argument. Philos. Mag. 
d 


(7) 19 (1935), 230-235. 


i < 
if n = 5 then 3. BAAS. Bessel functions, part II, Functions of positive integer order. Mathematical 


begin Tables, vol. X, Cambridge U. Press, London, 1952. 
p i= —p & ((nta-1)/n) T 2; q:= (at+2Xn) X D; 4. Erpiuyi, A. (Ep.) Higher Transcendental Functions, vol. IZ. McGraw-Hill, New 
a, ‘ oe ; ‘ : York, 1953. ; 
= 7 : +a X ul2xnj; 82 := 82+ q X e[2Xn]; 5. National Burgavu oF Sranparps. Tables of Bessel functions of fractional order, 
goto vol. II. ,Columbia U. Press, New York, 1949. 
end 6. ——. Table of the Bessel Functions Jo(z) and Ji(z) for Complex Arguments. Co- 
end; lumbia U. Press, New York, 1943. 
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REMARK ON ALGORITHM 236 


Bessel Functions of the First Kind [S17] 
LW. Gautschi, Comm. ACM 7,8 (Aug. 1964), 479-480 ] 


Ove Skovgaard [Recd 6 Nov. 1973 and 3 Feb. 1975 ] 
Institute of Hydrodynamics and Hydraulic Engineering, Technical University of 
Denmark, DK-2800 Lyngby, Denmark 


The procedures in Algorithm 236 were coded in PL/I and run on the IBM 
370/165. The following error was discovered for a = 0, nmazx large, and x small, 
e.g. nmax = 50 and x = 0.5. In the last if statement in three of the procedures, 
Japlusn, Laplusn, and Complex Japlusn, division by zero took place. Not all com- 
pilers and computers would pose problems for the above values of the parameters; 
whether or not they do depends on the permissible magnitude of the floating-point 
numbers for the cornpiler and computer used. For the IBM 370/165 the smallest 
positive floating-point number which the computer can hold is approximately 
5.40 * 1077 (see [10, p. 163 ]). 

The following corrections should be made in the procedure Japlusn. 

The last if statement should be replaced by 


if abs(J[n] — Japproz[n]) > epsilon X abs(J[n]) then 

comment Conceivably, but very unlikely, underflow, i.e. the exponent of the 
floating-point number exceeds its lower bound, may take place here. In that case 
the machine representation of ‘floating-point zero’? must be produced if the 
program is to work properly; 


The same comment should be inserted after the statement 
Jin +1] := Rr[n] x J[n]; 


The same corrections should be made in the procedures Japlusn and Complex 
Japlusn at the appropriate places. 

The corrections of the defective if clauses proposed above are most elegant, but 
not the most efficient for all compilers and computers. The following general correc- 
tions in the procedure Japlusn have only one call instead of two calls of the abs- 
function and are therefore more efficient for some compilers. 

Before the last if statement two new lines should be inserted: 


ifJ[n] ¥ 0 then 
begin 


and before the last end statement one new line should be inserted: 


end 
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The two proposed comment statements are still necessary. 

The numerical results are identical for the two methods. 

The same efficient (in some cases) corrections can be made in the procedures 
Iaplusn and Complez Japlusn at the appropriate places. 

According to [5], all the material relevant for the construction of Algorithm 236 
is included in [4, especially Section 5]. This reference is used in the following 
comments, since reference [1] in Algorithm 236 is not easily available. 

The last for statement (of which the delinquent if statement is a part) is included 
for checking purposes only, in order to verify that the required accuracy has indeed 
been attained. According to [5], Gautschi says, ‘I believe, however, that my initial 
choice of » is conservative enough to guarantee this accuracy. For all practical 
reason, therefore, the whole for statement in question could be deleted.” This has 
not been checked by the present author.: 

Because a simplified PL/I version of Algorithm 443 [2, 3] had already been 
implemented in the local university computer library, the call to the real procedure 
¢ was replaced by an application of Algorithm 443 (version B). The solution of 
wexp(w) = y, y > 0 (furnished by. Algorithm 443) corresponds to w(y) = In(t(y)) 
in terms of the procedure t, so that t(y) = exp(w(y)) or t(y) = y/w(y). Algorithm 
443 is less efficient than procedure ¢. The former is more accurate, although this 
accuracy is not necessary here. 

In order to improve the documentation and thereby facilitate modifications 
and/or translations of the procedures Japlusn, Iaplusn, and Complex Japlusn, the 
mathematical constants corresponding to the four decimal constants in the three 
procedures are given here: 2.3026 is In 10, 1.3863 is In 4, 1.3591 is e/2, .73576 
is 2/e. 

The procedure Japlusn was coded using double precision floating-point calcula- 
tions. For implementation on the IBM 370/165 (chopping with 14 hexadecimal 
digits) this gives approximately 15 significant decimal digits. The procedure was 
used to calculate the Bessel function of the first kind for integer orders Jn(z), i.e. 
a = 0. The procedure was programmed with d = 15 (the values of Jn(x) were 
wanted with at least 15 significant digits). The values were checked using the tables 
in [6, 7, 8] and Table III in [9]. It was discovered that the values often had an 
error of 1 to 2 units in the fifteenth digit, where there was no zero of one of the 
Bessel functions to deteriorate the accuracy to less than 15 digits. Tests were run 
to determine whether the results were dependent on the selection of the initial »; 
it must be remembered that the estimate of v is very conservative; see [4, pp. 
50-51]. Systematic tests revealed that it was impossible to obtain the wanted 
accuracy with any v. To simplify testing, when a = 0, all even \ = 2 were used 
(according to [4, p. 49, line 1]), rather than the recursively generated even d 
(according to [4, p. 48, last 11 lines]). With this simplification the procedure 
evaluated J,(x) to 15 significant digits. Near a zero of one of the generated Bessel 
functions, the accuracy of that particular function still deteriorated to less than 15 
significant digits. This deterioration was generally of the same magnitude as 
occurred when \ was generated recursively. 

If the procedures Japlusn, Iaplusn, and Complex Japlusn are contemplated for 
use in the calculation of Bessel functions of integer order only, then they might be 
rewritten directly employing the explicit values of \, rather than generating them 
by an upward and downward recursion. This will make the procedures more effi- 
cient and slightly more accurate. In this connection it is relevant to refer to two 
more recent algorithms, due to Sookne [11-14], dealing with Bessel functions of 
integer order. Procedure Besiri in [12] was translated to PL/I, and tests disclosed 
that the execution time for procedures Japlusn and Iaplusn is of the order twice the 
execution time for procedure Besir1. Therefore Sookne’s procedures, and not the 
procedures in Algorithms 21 and 236 (see [1] and the editorial comment in [15 ]), 
should be used for the calculation of Bessel functions of integer order. 
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GREATEST COMMON DIVISOR [A1] 

J. E. L. Peck (Reed. 16 Dec. 1963) 

University of Alberta, Calgary, Alberta, Canada 


integer procedure Euclidean (a) dimension : (n) linear coeffi- 
cients : (); value a; integer array a, x; integer n; 

comment This procedure finds the greatest common divisor of 
the n nonnegative elements of the vector a, and produces values 
for xz; in the expression (@1 , dz, +++ , Gn) = Git, + dete + -°: 
+ Gntn; 

begin integer array M[1:n, 1:n]; 
integer 7, j, min, maz, imin, imaz, q, t; 
comment We set up M as an identity matrix; 

INITIALISE: 
for 7 := 1 step 1 until n do 

for j := 1 step 1 until n do M{i, j] := 0; 
for 7 := 1 step 1 until n do M[i, 7] := 1; maz := 0; 
comment We search for the least nonzero integer in the array 
a. Note that this step need not be repeated at every iteration 
(see statement labelled DIVIDES) ; 
MINIMUM: 
for 7 := 1 step 1 until n do 
begin ¢ := a[z]; 
if ¢ ~0 A (maz=0Vt<maz) then 
begin max := t; imax := 7 end 
end of minimum search. If the use of the identifier maz is 
confusing, observe the two statements following the label 
MAXIMUM, where the confusion is resolved; 
if max = 0 then go to ERROR; comment ERROR is a global 
label; 

MAXIMUM: imin := imax; min := maz; 
comment We search for the greatest element of a; 
max := a{l]; imax := 1; 
for 7 := 2 step 1 until n do if a[z] > maz then 

begin max := ali]; imax := 7 end of maximum search; 
if maz ~ min then 

REDUCTION : 
begin comment Note that the identity a; = >of mia; 

holds at each stage of the reduction; 
q := max + min; alimaz] := mar := max — q XK min; © 
for j := 1 step 1 until n do 

M{imaz, j] := Mlimaz, 7] — ¢ X M{imin, 7); 

DIVIDES: go to if maz = 0 then MINIMUM else MAXIMUM 
end of the reduction. Note that if max # 0 then maz now con- 
tains the new nonzero minimum. 

If max = min then we are ready with the results; 

for j := 1 step 1 until n do 2[j] := M[imin, J]; 

Euclidean := min 

end of procedure Euclidean 


REFERENCE 


1, BuankinsHip, W. A. A new version of the Euclidean al- 
gorithm. Amer. Math. Mon. 70 (1963), 742-745. 
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CERTIFICATION OF ALGORITHM 237 [A1] 

GREATEST COMMON DIVISOR (J. E. L. Peck, 
Comm. ACM 7 (Aug. 1964), 481] 

T. A. Bray (Recd. 8 Sept. 1964) 

Boeing Scientific Research Laboratories, Seattle, 
Washington 
This procedure was translated into the Fortran IV language 


and tested on the Univac 1107. No corrections were required and 
the procedure gave correct results for all cases tested. 
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CONJUGATE GRADIENT METHOD [F4] 

C. M. Reeves (Recd. 18 Nov. 1963) 

Electronic Computing Lab., Univ. of Leeds, England 


procedure conjugate gradients (x, r, n, matmult); 
value n; real array x, 7; integer 1; procedure maimuli; 
comment The method of conjugate gradients [cf: BECKMAN, 
F.S. Mathematical Methods for Digital Computers. Ch. 4, Ralston, 
A., and Wilf, H. S., (Eps.), Wiley 1960.] is applied to solve the 
equations Ax = b where A is a general nonsingular matrix of 
order n, and x and 6 are vectors. At entry x contains an initial 
approximation to the solution, and r contains b, the vector of 
constants. Both z and r have bounds [1:”]. Up to n+1 iterations 
are carried out and at exit the solution is in x and the corre- 
sponding residuals r = 6 — Az are in r. 
The procedure maimult has the following heading, with semi- 
colons which must now be omitted: 
procedure matmuli (transpose, dat, res) 
Boolean transpose real array dat, res 
comment The datum vector dat is premultiplied by the 
matrix B and the result formed in res where, denoting the 
transpose of A by At, 


B = if transpose then At else A 


The body of matmult will depend upon whether A is stored on 
magnetic tape, and whether all or only its nonzero elements 
are stored. The products should be accumulated in double 
precision, if possible. ; 

begin integer iiérations; real alpha, beta, Ai r sq; 
real array p, temp [1:n]; 
real procedure doi (u, v); 
real array w, 0; 
comment dot is the scalar product of the vectors wu and 9; 


begin integer 7; real sum; sum := 0; 
for i := 1 step 1 until n do sum := sum + ult] X fz]; 
dot := sum 

end of dot; 


procedure combine (f) plus: (c) times: (g) to form: (h); 
value c; 
real c; real array jf, g,h; 
comment f + cg is formed inh; 
begin integer 7; 
for 1 := 1 step 1 until n do A[z] := f[z] + ¢ X gfz] 
end of combine; 
Start: 
for iterations := 0 step 1 until n do 
begin if zéerations = 0 
then begin maimult (false, x) in : (temp); 
combine (r, —1, temp) in : (r); 
maimult (true, r) in : (p); 
Atr sq := dot (p, p); 
end of forming r = 6 — Az, p = Air, and Atr sq 
else begin maimult (true, r) giving Aér in : (temp); 
beta := dot (temp, temp)/At r sq; 
combine (temp, beta, p) in : (p); 
Atrsq := beta X Atr sq 


end; 


if At r sq = 0 then go to finish; 
matmult (false, p) giving Ap in : (temp); 
alpha := dot (temp, temp); 
if alpha = 0 then ge to finish; 
alpha := dot (r, temp)/alpha; 
combine (x, alpha, p) in : (x); 
combine (r, —alpha, temp) in : (r) 
end of iterative loop; 
finish : . 
end of conjugate gradients; 
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FREE FIELD READ [I5] 

W. M. McKeeman (Recd. 12 Dec. 63 and 1 “May 1964) 
Computation Center, Stanford University, Stanford, Calif. 


procedure inreal (channel, destination); 
integer channel; real destination; 

begin comment Each invocation of inreal will read one (number) 
[Revised Report --- ALGOT, 60, section 2.5.1] from the input 
medium designated by the parameter channel and convert it 
into the internal machine representation appropriate for real 
numbers. Successive data values within the ‘data string are 
separated by the blank character u. Integer values from the 
input medium are converted into values of type'real. A nonlocal 
procedure error is invoked whenever a non. (number) is en- 
countered in the input string. The action of error is left un- 
defined; 
real szg, fp, d; 
integer esig, ep, ip, ch; 


value. channel; 


integer procedure CHAR; 

begin comment The value of CHAR is the integer repre- 
senting the next character from the input string. insymbol 
is defined in the “Report on Input-Output Procedure for ALGoL 
60,” ALGOL Bull. No. 16 (May 1964), 9-138; Comm. ACM, to ap- 
pear. Characters occurring in the second parameter of in- 
symbol are mapped onto the integers corresponding to their 
position, left-to-right, within the string. Other basic symbols 
map onto the integer 0. 

The present procedure inreal differs from the inreal of 
the referenced Report on Input-Output Procedures for 
ALGoL 60 in the following ways: 

(a) The report does not specify what values may be pre- 
sented in its inreal, only that whatever is presented will be 
assigned to the second parameter of inreal. I demand that a 
(number) be presented. 

(b) No separator of values on the foreign medium is speci- 
fied. I demand an AuGou string blank.; 
real c 
ineuinbol (channel, ‘0123456789. —-+10u’ ,¢)3 
if c S O then error; comment an illegal character; 
CHAR :=c—1 

end CHAR; 
integer procedure unsigned integer; 
begin comment (unsigned integer) ::= (digit) | (unsigned 
integer) (digit); 
integer u; 
u = 0; 
K: w:=10Xut+ch; 
ch := CHAR; 
if ch < 10 then go to K; 
unsigned integer := u 
end unsigned integer; 
sig:= 1.0; ep :=0; fo := 
L: ch := CHAR; 
if ch = 14 then go to L; comment suppress. initial blanks; 
comment (number) ::= (unsigned number) | + (unsigned 
number) | —(unsigned number); 
if ch = 12 then ch := CHAR 
else if ch = 11 then 
begin comment 12 = “+” and ill = “—”; 
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stg := —1.0; 
ch := CHAR 
end; 


comment (unsigned number) ::= (decima) number) | (ex- 
ponent part) | (decimal number)exponent part); 
if ch S 10 then 
begin comment (decimal number) ::= (unsigned integer) | 
(decimal fraction) | (unsigned integer)(decimal fraction); 
if ch < 10 then ip := unsigned integer else ip := 0; 
if ch = 10 then 
begin comment (decimal fraction) ::= .(unsigned integer); 
ch := CHAR; 
if oes = 10 then error; 
fp: = G3 d := 0.1; 
M: fp := fp + ch X d; 
d:=dx 0.1; 
a table ot reciprocal powers of ten is preferable 
d X 0.1; 


comment a digit must follow the 


comment 
to the statement d := 
ch := CHAR; 
if ch < 10 then go to M 
end — 
end else if ch = 
if ch = 13 then 
begin comment (exponent part) ::= 10(integer): 
ch := CHAR; esig := 1; 
comment (integer) ::= (unsigned 
integer) | —(unsigned integer); 


13 then ip := | else error; 


integer) | +(unsigned 


if ch = 12 then ch := CHAR 
else if ch = 11 then 
begin comment negative exponent; 
esig := —1; 
= CHAR 
end; 
if ch < 10 then ep := unsigned integer X esig else error 
end; 


if ch ~ 14 then error; comment the required ‘‘u’’ separator; 
destination := sig X (ip+fp) X 10.0 fT ep 
end inreal 
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COORDINATES ON AN ELLIPSOID [Z] 

Econ Dorrer (Recd. 8 Jan. 1964 and, rev., 19 May 1964) 

Inst. f. Photogrammetrie, Techn. Hochschule, Munich, 
Germany 


procedure GEODH 1 (L, B, AZ, 8, EPS, lim, A, F, FAIL); 
value 8, EPS, lim, A, F; real L, B, AZ, S, EPS, A, F; 
integer lim; label FAIL; 
comment GEODH | solves the problem of transferring of geo- 
graphical coordinates on an arbitrary ellipsoid of rotation. A is 
the radius of the equator, F is the flattening of the meridian 
ellipse. Before executing GEODH 1, L and B are longitude and 
latitude of a point P; on the ellipsoid. AZ is the azimuth at Py, , 
measured from north, of the geodesic to another point P; , and 
Sis the distance from P; to P, , measured in the same unit as A. 
After execution of GEODH 1, L and B represent the longitude 
and latitude of P, , and AZ is the final azimuth of the geodesic 
at P.. Here L, B, AZ, and EPS are measured in radians. Arbi- 
trarily long distances S can be used, even more than the circum- 
ference. However, the geodesic must not cross the poles or come 
near to them. The problem has been solved by reiterated use of 
the Runge-Kutta method to solve the system of the three first- 
order differential equations of the geodesic on a rotation ellip- 
soid. EPS is the convergence parameter, e.g. a small number 
indicating the desired accuracy, normally 10-° or 10-*. lim is the 
upper limit on iterations, it depends on /7PS, and should not be 
chosen greater than 11 or 12. If lim is reached, computations 
stop, and the FAIL exit is used: 
begin 
real EP2, Lo, Bo, AZo, LL, BL, AZL, So, SL, H, DL, DB, DAZ, 
KL, KB, KAZ, BQ, AZQ, W, H1, T, SINBQ; 
integer i, n, J, 23 
array D{1:4]; D[i] := D[4]:= 1; D[2] := D[B] := 2; 
EP2 := FX (2—F); Lo:= L: Bo:= B; AZo := AZ; 
n:=1; z2:= 0; 
ITERATION: if z = lim then go to FAIL; 
So :=0; LL := Lo; BL := Bo; AZL := AZo; 
for 7 := 1 step 1 until 7 do 
begin 
SL := SX i/n; H := (SL — So)/A; 
DL := DB := DAZ::= KL := KB := KAZ := 0; 
for 7 := 1 step 1 until 4 do 
begin 
T := Diy); 
BQ := BL+ DB/T; AZQ:= AZL+DAZ/T; SINBQ := 
sin(BQ); 
W :=1— EP2 X SINBQ X SINBQ; H1:= HX sqrt(W); 
DL := H1 X sin(AZQ)/cos(BQ); 
DB := H1 X W X cos(AZQ)/(1 — EP2); 
DAZ := DL X SINBQ; 
KL := KL+ DLXT; KB:=KB+DBXT; KAZ := 
KAZ + DAZ X T 
end 7; 
So := SE; LL := LL + KL/6; BL := BL + KB/6; 
AZL := AZL + KAZ/6 
end 7; 
DL := LL —L; DB := BL— B; DAZ := AZL — AZ; 
L:= LL; B:= BL; AZ := AZL; 
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if abs(DAZ) < EPS/sin(S/A) (A (abs(DL) < EPS/cos(B) V 
abs(DB) < EPS) then go to END; 

z2:=1l+2; n:=2Xn; 

go to ITERATION; 

END: 


end GEODH 1 
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ARCTANGENT [B1] 
K. W. Mitts (Recd. 21 Nov. 1963) 
Yomputing Centre, University of Adelaide, So. Australia 


real procedure arg(x, y) exit: (error); value z,y; real x, y; 
label error; 

comment This procedure calculates the argument of a complex 
number « + zy, using a method which is substantially that of 
KE. G. Kogbetliantz, IBM J. Research Develop., Jan. 1958, pp. 
43-53. The result lies in the interval [—7, 7] and the exit error 
is provided for the case when x = y = 0. The procedure is es- 
sentially an ALGou program for the calculation of the arctan- 
gent. arctan(y) is obtained most conveniently by calling the pro- 
cedure with x = 1; 

begin 
array ct, csc2(2:5], in{1:4]; integer k; real w, v, pi, 1, 2; 
pt := 3.1415926536; if x = 0 then 
begin 

if y = 0 then go to error; 

Ll:arg := pi/2 X sign(y); go to exit 
end; 
w:i= y/z; v:= abs(w); 
if v > 1.3408 then go to LI; 
if v < 2.130—22 then r := w else 


begin 
ct[2] := tn[4] := 2.7474774195; 
ct[3] := in[3] := 1.1917535926; 
ct[4] := tn[2] := .57735026919; 
ct[5] := tn[1] := .17632698071; 


csc2[2} := 8.548632169; 
esc2[3] := 2.420276626; 
csc2{4] := 1.333333333; 
csc2[5] := 1.031091204; 
if v < in[1] then 
begin 
k:=1; 2 := .16363636364 x v 


for k := 2 step 1 until 4 do if v < in[k] then go to L3; 
k := 5; 
L383: 2 := .16363636364  (cé[k]—csc2[k]/(v-+et[k])) 
end; 
ri= (pt X (k—-1)/9 + 2/(z2 X 2 + .216649136 — .00270998425/ 
(2 & 2 + .0511194591))) & szgn(w) 
«nd; 
arg := ifz > Othen,r else 
ify = Othen 7 + pi else 
r+ pt X sign(y); 
extt: 
end arg 
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PERMUTATIONS OF A SET WITH REPETITIONS 
[G6] 

T. W. Sac (Reed. 10 Feb. 1964 and 19 June 1964) 

Math. Dept., Manchester U., Manchester, England 


procedure PERMUTATION (X, K, j, process); 
array X; integer array K; integer j; procedure process; 
comment PERMUTATION generates all the distinct permuta- 
tions of an array of numbers consisting of K[1] numbers equal to 
X{1], A[2] numbers equal to X(2], --- , K[j] numbers equal to 
X|j]. The K[z]’s must be positive integers. Each permutation ig 
stored in the array Y and processed according to the user’s wish 
by the procedure process before the next permutation is gen- 
erated. 
{The procedure is more efficient if the sequence K[i| is mono- 
tone decreasing.—Ref.}; 
begin 
realz; integer M,N,1; array B[l:K[j]]; 
procedure permutation (x, M, Nj, B, process); 
realx; integer M,N,j; array B; procedure process; 
begin 
real A; integer 2, KK, N1, N2, 71; 
integer array J[1:N+1]; 
array Y{li:N+M]; N2:=N+M; 
if M = 0 then go tol; 
for 2 := N + 1 step 1 until N2 do Y[i| := x; 
1: for z:= 1 step 1 until N do J[i] := 7; 
JIN+I):= N24+1; jli=j—1; KK:=N; 
2: fort := 1 step 1 until KK do Y[J[{i]] := Biz]; 
if j1 < 1 then begin process(Y); go to 3 end; 
A t= X[jl-1]; Nl:= K[jl—1]; 
permutation (A, N1, N2, 71, Y, process); 
3: for 7 := 1 step 1 until N do 
begin 
YVJ(tll:= 2; J{t]:= J[i] + 1; 
if /(¢| -- J[¢+1] + 1 S O then go to 4 else go to 5; 
4: KK := 17; goto 2; 


end of permutation; 
if j = 1 then begin z := X[1]; M:=0; go te 1 end; 
ei= ATj-1]; M i= K[j-1]; 
lL: for 7 := 1 step 1 until K[j] do Bit] := X[j]; 
permutation (x, M, K{j], j, B, process); 
end of PERMUTATION 
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LOGARITHM OF A COMPLEX NUMBER [B3] 

REWRITE OF ALGORITHM 48 [Comm. ACM 4 (Apr. 
1961), 179; 5 (Jun. 1962), 347; 5 (Jul. 1962), 391; 7 (Aug. 
1964), 485] . 

Davip 8. Cotiens [Recd. 24 Jan. 1964 and 1 Jun. 1964] 

Computer Laboratory, The University, Liverpool, 3, 
England 


This procedure was tested using the DEUCE ALGOL Compiler 
and a small sample of the test data and results are given below. 
procedure LOGC (a, b, c, d, FAIL); value a, b, FAIL; real 

a, b, c, d; label FAIL; 
comment This procedure computes the number ¢ -+ di which is 

equal to the principal value of the natural logarithm of a + bi, 

i.e. such that —r < d S +7. A nonlocal label must; be supplied 

as a parameter of the procedure, to be used as an exit when the 

real part of the result becomes — o. Where required in the body 
of the procedure the numerical values for 7, r/2, and the log- 
arithm of the square root of 8 are provided; 

ifa = 0/) }b = 0 then go to FAIL 

else 

begin 

real e, f; 
e:=05xXa; f:=05X db; 
if abs(e) < 0.5 A abs(f) < 0.5 then 
begin 
abs(2Xa) + abs(2Xb); 
8X a/eXa+8X b/e X b; 
0.5 X (In(c)+ln(d)) —1.08972077084 


Q, 
wu 


= abs(0.5Xe) + abs(0.5xf); 
d:= 0.5 X e/e Xe + 0.5 X fle X f; 
= 0.5 X (In(c) +ln(d)) + 1.03972077084 


d := if a ~ 0 A abs(e) 2 abs(f) then arctan(b/a) + 
(if sign(a)~#—1 then 0 else if sign(b)%-—-1 then 
3.14159265359 else —3.14159265359) else -- arctan(a/b) 
+ 1.57079632679  sign(b) 


end LOGC 
TEST OF LOGC 

a b c d 

—2 —2 +1.039721 —2.356194 
—2 +1 +0.804719 +2.677945 
—1 —1 +0.346573 —2.356194 
—1 +0 +0.000000 +3.141593 
+0 —2 +0.693147 —1.570796 
+0 —1 +0.000000 —1.570796 
+0 +1 +0.000000 +1.570796 
+0 +2 +0.698147 +1.570796 
+1 —l +0.346573 —0.785398 
+1 +0 +0.000000 +0.000000 
+2 —2 +1.039721 — 0.785398 
+2 +1 +0.804719 +0.463647 
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CERTIFICATION OF ALGORITHMI 2438 [B3] 

LOGARITHM OF A COMPLEX NUMBER [David S. 
Collens Comm. ACM 7(Nov. 1964), 660] 

J. BootHroyp (Recd. 18 Jan. 1965) 

Computing Centre, U. of Tasmania, Hobart, Tasmania 


With the label parameter FAIL removed from the value list to 
accommodate a restriction of Elliott 503 ALGOL, the algorithm was 
successfully run on an Elliott 503, using the data test cases pub- 
lished with the algorithm. The constants in the algorithm were 
rounded to nine significant decimal digits, and this probably ex- 
plains the two differences between the results obtained and those 
published, namely: 

a b c d 
—1 —1 0.346574 


2 1 0.463648 
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ALGORITHM 244. 

FRESNEL INTEGRALS [820] 

Hetmut Lotsca* (Recd. 27 May 64 and 11 Jun. 64) 

W. W. Hansen Laboratories, Stanford U., Stanford, Calif. 
AND 

Matcoum GrRayt 

Computation Center, Stanford U., Stanford, Calif. 


(* now at Northrup Space Laboratories, Hawthorne, Calif.) 
({ now at The Boeing Company, Seattle, Wash.) 


procedure FRESNEL. (w, eps, C, S); value w, eps; real w, 
eps, C, 8; 
comment ‘This procedure computes the Fresnel sine and cosine 


integrals C(w) = fo ‘cos [(r/2)2] dt and S(w) = fo sin [(r/2)#] 
dt. It is a modification of Algorithm 213 (Comm. ACM,6 (Oct. 
1963),617) such that the accuracy, expressed by eps, is improved. 
eps can arbitrarily be chosen up to eps = 10 — 6 for a computer 
with sufficient word length as, for example, the Burroughs 
B5000 which has 11-12 significant digits. Referring to the formu- 
las of Algorithm 213: if | w | < +/(26.20/7) the series expansions 
C(w) and S(w) are terminated when the absolute value of the 
relative change in two successive terms is Seps. If |w| 2 
/ (26.20/7) the series Q(x) and P(x) are terminated when the 
absolute value of the.terms is Seps/2. However, this truncation 
point is not necessarily valid for the range +/(26.20/7) S | w | 
< V(28.50/r) when eps = 10 — 6, since the asymptotic series 
must be terminated before arriving at the minimum. In this 
range the ignored terms of the series expansions are < 310 — 6, 
and for larger arguments < 10 — 6. This accuracy may be im- 
proved if desired: the switch-over point from the regular to 
the asymptotic series expansions has to be displaced to larger 
arguments; 

begin 
real x, 22, term; integer n; 
if abs(w) S 10 — 12 then 

begin C := S :=.0; go to aend end 

else x := w X w/0.636619772368; 


w2:= —x Xa; ifx = 13.10 then go to asympt; 
begin 
real frs, frst; 
frs := 2/3; n:=:5; term := 2 X 22/6; 
frst := frs + term/7; 
loops: if abs((frs—frst)/frs) S eps then go to send; 
frs := frst; term.:= term X 22/(nXn—n); 


frst := frs + term/(2Xn+1); 
n:i= n+ 2; go to loops; 
send: S := frsi X w 
end; 
begin 
real fre, fret; 
fre := 1; n:= 4; 
fret := 1 + term/5; 
loope: if abs((fre—fret)/fre) S eps then go to cend; 


term := 22/2; 


fre := frei; term := term X 22/(nXn—n); 
fret := fre + term/(2Xn+1); 
n :=n-+ 2; go to loopc; 
cend: C := fret X w 
end; 


go to aend; 
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asympt: 
begin 
real sl, s2, half, temp; integer 7; 
x2 := 4% 22; term := 3/22; sl := 1+ term; n:= 8; 
for 71 := 1 step 1 until 6 do 
begin 
ni=n+A4; 


term := term X (n—7) X (n—5)/22; 

sl := sl + term; 

if abs(term) S eps/2 then go to next 
end 7; 


next: term := s2:= 0.5/4; n := 4; 
for 7 := 1 step 1 until 6 do 
begin 

n:=n+4; 


term := term X (n—5) X (n—3)/x2; 
s2 := s2 + term; 
if abs(term) S eps/2 then go to final 
end 7; 
final: half := if w < 0 then —0.5 else 0.5; 
term := cos(x); temp := sin(x); 2x2 := 3.14159265359 x w; 
C := half + (tempXsl—termX s2)/x2; 
S := half — (termXs1+temp X s2)/x2 
end; 
aend: 
end FRESNEL 
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TREESORT 3 [M1] 

Rosert W. Fioyp (Recd. 22 June 1964 and 17 Aug. 1964) 
Computer Associates, Inc., Wakefield, Mass. 


procedure TREESORT 3 (M, n); 
value n; array M; integer n; 
comment TREESORT 3 is a major revision of TREESORT 
[R. W. Floyd, Alg. 118, Comm. ACM 6 (Aug. 1962), 434] sug- 
gested by HEAPSORT [J. W. J. Williams, Alg. 232, Comm. 
ACM 7 (June 1964), 347] from which it differs in being an in-place 
sort. It is shorter and probably faster, requiring fewer compari- 
sons and only one division. It sorts the array M[1 in], requiring 
no more than 2 X (27 p—2) X (p—1), or approximately 2 x 
nm X (loge(n)—I) comparisons and half as many exchanges in 
the worst case to sort n = 2fp — 1 items. The algorithm is 
most easily followed if M is thought of as a tree, with M[j+2] 
the father of M[j] forl <j Sn; 
begin 
procedure exchange (x,y); 
begin real ¢; 
end exchange; 
procedure siftup (i,n); value 7, n; integer i, n; 
comment M{i] is moved upward in the subtree of M[1:n] of 
which it is the root; 
begin real copy; integer j; 
copy := M{1]; 
loop:j3 := 2X 14; 
if j < nthen 
begin if j < n then 
begin if M[j+1] > M[j] then j := 7 + 1 end; 
if M[j] > copy then 
begin M[i] := M[j]; 7 := 7; 
end; 
M{t} := copy 
end siftup; 
integer 7; 
for 1 := n+2 step —1 until 2 do siftup (i,n); 
for 1 := n step —1 until 2 do 
begin siftup (1,1); 
comment M[j+2] 2 M{j|forl <j S17; 
exchange (M[1], M[z]); 
comment M[i:n] is fully sorted; 
end 


end TREESORT 3 


real 2,y; 
t:= 2; Xi= Y; y:=t 


go to loop end 


CERTIFICATION OF ALGORITHM 245 [M1] 
TREESORT 3 [Robert W. Floyd, Comm. ACM 7 (Dec. 
1964), 701] 
Puitie S. Aprams (Recd. 14 Jan. 1965) 
Computation Center, Stanford University, Stanford, 
California 
The procedure TREESORT 3 was translated into B5000 Ex- 
tended ALGou and tested on the Burroughs B5500. Tests were run 
on arrays of length 50 to 1000 in steps of 50. For each array size, 50 
random arrays were generated, sorted, timed and checked for 
sequencing. No corrections were required and the procedure gave 
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correct results for all cases tested. 

exchange is unnecessary as a separate procedure, since it is 
used at only one place in TREESORT 38. Sorts were found to run 
significantly faster when the body of exchange was inserted in 
the appropriate place, than when run with the algorithm as 
published. 
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TREESORT 8 [Robert W. Floyd, Comm. ACM 7 (Dec. 
1964), 701]: PROOF OF ALGORITHMS—A NEW 
KIND OF CERTIFICATION 

Ratpu L. Lonpon* (Reed. 27 Feb. 1969 and 8 Jan. 1970) 

Computer Sciences Department and Mathematics Re- 
search Center, University of Wisconsin, Madison, WI 
53706 


* This work was supported by NSF Grant GP-7069 and the 
Mathematics Research Center, US Army under Contract 
Number DA-31-124-A RO-D-462. 


ABSTRACT: The certification of an algorithm can take the form 
of a proof that the algorithm is correct. As an illustrative but 
practical example, Algorithm 245, TREESORT 3 for sorting an 
array, is proved correct. 


KEY WORDS AND PHRASES: proof of algorithms, debugging, 
certification, metatheory, sorting, in-place sorting 
CR CATEGORIES: 4.42, 4.49, 5.24,°5.31 


Certification of algorithms by proof. Since suitable techniques 
now exist for proving the correctness of many algorithms [for 
example, 3-7], it is possible and appropriate to certify algorithms 
with a proof of correctness. This certification would be in addi- 
tion to, or in many cases instead of, the usual certification. Certi- 
fication by testing still is useful because it is easier and because it 
also provides, for example, timing data. Nevertheless the existence 
of a proof should be welcome additional certification of an algo- 
rithm. The proof shows that an algorithm is debuggged by show- 
ing conclusively that no bugs exist. 

It does not matter whether all users of an algorithm will wish 
to, or be able to, verify a sometimes lengthy proof. One is not 
required to accept a proof before using the algorithm any more 
than one is expected to rerun the certification tests. In both 
cases one could depend, in part at least, upon the author and the 
referee. 

As an example of a certification by proof, the algorithm 
TREESORT 8 {2] is proved to perform properly its claimed task 
of sorting an array M[1:n] into ascending order. This algorithm 
has been previously certified [1], but in that certification, for 
example, no arrays of odd length were tested. Since TREESORT 3 
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is a fast practical algorithm for in-place sorting and one with 
sufficient complexity so that its correctness is not immediately 
apparent, its use as the example is more than an abstract exercise. 
It is an example of considerable practical importance. 


Outline of TREESORT 3 and method of proof. The algorithm is 
most easily followed if the array is viewed as a binary tree. 
M[(k+2] is the parent of M[k], 2 < k < n. In other words the 
children of M[j] are M[2j] and M[2j+1] provided one or both 
of the children exist. 

The first part of the algorithm permutes the M array so that 
for a segment of the array, each parent is larger than both of the 
children (one child if the second does not exist). Each call of the 
auxiliary procedure siftup enlarges the segment by causing one 
more parent to dominate its children. The second part of the 
algorithm uses siftup to make the parents larger over the whole 
array, exchanges M[1] with the last element and repeats on an 
array one element shorter. The above statements are motivation 
and not part of the formal proof. 

That TREESORT 3 is correct is proved in three parts. First 
the procedure siflup is shown to perform as it is formally defined 
below. Then the body of TREESORT 8, which uses siftup in two 
ways, is shown to sort the array into ascending order. (The proof 
of the procedure exchange is omitted.) The proofs are by a method 
described in (3, 4, 7]: assertions concerning the progress of the 
computation are made between lines of code, and the proof con- 
sists of demonstrating that each assertion is true each time con- 
trol reaches that assertion, under the assumption that the previ- 
ously encountered assertions are true. Finally termination of the 
algorithm is shown separately. 

The lines of the original algorithm have been numbered and the 
assertions, in the form of program comments, are numbered cor- 
respondingly. The numbers are used only to refer to code and to 
assertions and have no other significance. One extra begin-end 
pair has been inserted into the body of TREESORT 3 in order 
that the control points of two assertions (3.1 and 4.1) could be dis- 
tinguished. In siftup the assertions 10.1 and 10.2 express the cor- 
rect result; in the body of TREESORT 3 the assertions 9.3 and 
9.4 do hkewise. : 


Definition of siftup and notation. We now define formally the 
procedure siftup(i,n), where n is a formal parameter and not the 
length of the array M, Let A(s) denote the set of inequalities 
M|k+2| > MI[k] for 2s < k < n. (If s> n+2, then A(s) is a vacu- 
ous statement.) If A(¢+1) holds before the ecall of siftup(z,n) 
andifl<i<n< array size, then after siftup(t,n): 

(1) A(z) holds; 

(2) the segment of the array M[i] through M[n] is permuted; 
and 

(3) the segment outside M[z] through M[n] is unaltered. 

In arder to prove these properties of siftup, some notation is 
requirdd. The formal parameter 7 will be changed inside siftup. 
Since 7 is called by value, that change will be invisible outside 
siftup. Nevertheless it is necessary to use the initial value of 7 
as well as the current value of 7 in the proof of siftup. Let 7) denote 
the value of ¢ upon entry to siftup. 

Similarly let M> denote the M array upon entry to siftup. 
The notation “M = p(M)) with M := copy’’ means “‘if M[i] := 
copy were done, M is some permutation of M> as described in (2) 
and (3) of the definition of siftup.”?> “M = p(Mo)’’ means the 
same without the reference to M[i] := copy being done. 


Code and assertions for siftup. 
0 procedure siftup(i,n); valuei,n; integer i, 7; 


1 begin real copy; integer j; 
comment 
Li: 1<%=t< n < array size 
1.2: A(éo+1) 


1.3: M = p(Mo); 
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2 copy :=-M{[i]; 
3 loop:j := 2X 4; 
comment 
8.1l:4< 7 
3.2: 26 = 7 
3.0: 4 = 19 or? > 2to 
3.4: M = p(Mo) with M[i] := copy 
3.5: A(io) or (¢ = % and A(to+1)) 
3.6: M[i+2] > copy ort = 1% 
3.7: Mji+2] > M[t] ort = %; 
4 if 7 < n then 
5 begin if 7 < n then 
6a begin if M[j+1] > M[j] then 


6b ji=j+lend; 
comment 
61:7 =f +2 


6.2: 2<jsn 

6.3: 74 = % ort > 2to 

6.4: M = p(Mo) with M[t] := copy 

6.5: A(d) or (¢ = t and A (io-+1)) 

6.6: M[t+2] > copy ort = % 

6.7: Mli+2] > M{i] ori = io 

6.8: (28 < n and M{[j] = max(M[27], M[2i+1])) or 
(2i = n and M[j] = M[n]) 

6.9: M{c] > M{[j] or 7 = 1%; 


7 if M[j] > copy then 
8a begin M[i] := M[j]; 
comment 


8.1: 4 = t ort > 2No 
8.2: 2<7<n 
8.3: M[j+2] = M[t] = Mj] > copy 
8.4: M{i+2] > M[j] or 7 = 16 
8.5: M = p(Mo) with M[j] := copy 
8.6: A (to); 
8b Fe aw 
comment 
8.7: 4 > 2to 
88:47 =7 Sn 
8.9: M[t+2] > copy 
8.10: M[i+2] > Mz] 
8.11: M = p(Mo) with M[t] := copy 


8.12: A (to); 
8c go to loop end 
9 end; 

comment 


9.1: M[j] < copy if reached from 7 or 
2i = j > nif reached from 4; 
10 M{i] := copy; 
comment 
10.1: M = p(Mo) 
10.2: A (i); 
ll end siftup; 


Verification of the assertions of siftup. Reasons for the truth of 
each assertion follow: 
1.1-1.2: Assumptions for using siftup. 
1.3: pis the identity permutation. 
3.1-3.7: If reached from 2, 
3.1: 1.1. 
3.2: 3. 
3.38, 3.5-3.7: 7 = % by 1.1. 
3.4: 1.3 and 2. 
If reached from 8, respectively, 8.8, 3, 8.7, 8.11, 8.12, 
8.9 and 8.10. 
6.1: At3.27 = 27 and by 6b, j might be 27 + 1.7 = j+2 in either 
case. 
6.2: After 4,7 <n. j is altered from 3.1 to 6.2 only at 6b. Before 
6b, j < nby 5. Hence j < nat 6.2.2¢ < jby6.1. 
6.3-6.7: 3.3-3.7, respectively. 


3.5 also requires 1.2. 
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6.8: 


6.9: 


8.1: 
8,2: 
8.3: 
8.4: 
8.5: 


8.6: 


8.7: 


8.8: 
8.9: 


8.10: 


If 4 is true and 5 is false, 7 = 23 = n (using 3.2) so the 
second clause of 6.8 holds. If 4 is true and 5 is true, then 
at 6a, 20 = 7 < n (using 3.2) so M[j+1] = M[2i+1] is 
defined. Now at 6.8, 7 = 2¢ or 7 = 2:+1. In either case, 
by 6a and 6b, the first clause of 6.8 holds. 

By 6.5 7 # % gives A(t). 2% < 22 < j < n by 6.3 and 6.2, 
Hence A (%) and 6.1 give M[i] = M[j+2] > Mj]. 

6.3. 

6.2. 

t= j+2 by6.1, M[t] = M[j] by 8a and M[j] > copy by 7. 

6.7 and 6.9. 

6.4 requires that M[z] be replaced by copy. Since M[i] = 
M{j] by 8a, M[j] may equally well be replaced with copy. 
8.1 and 8.2 give % < 7 < nso that the change to M at 8a 
is in the segment M[] through M[n]. 

By 8a and if 6.8 (first clause) holds, M[i] > M[2c] and M[z] > 
M([2i+1]. By 8a and if 6.8 (second clause) holds, M[z] = 
M{j] = M[n] = M[2¢] and M[2:+1] does not exist for this 
call of siftup. A(¢e+1) holds at 6.5 since A(te) implies 
A(to+1). If ¢ = %, A(éo+1) and the relations above on 
M[t] give A (ie). If ¢ # i, then 8a, 8.4, A(t) at 6.5 and 
the relations above on M{i] give A (zo) at 8.6. 

8b, 8.1 and 8.2. 

8b and 8.2. 

8b and 8.3. 

At 8.6, 2% <j < n by 8.1 and 8.2. Hence by 8.6, M[j+2] > 
M{j]. Use 8b on M[j+2] > MJ]. 


8.11: 8b and 8.5. 
8.12: 8.6. 


9.1: 


10.1-10.2: 


9.1 is reached only if 7 is false or if 4 is false. 2¢ = 7 by 3.2. 
If reached from 7, 

10.1: 6.4 and 10. (6.2 and 6.3 give % < 7 < n ensuring 
the change to M at 10 is in the segment M[z9] 
through M[n].) 

10.2: By 10, 9.1, 6.2 and 6.8, M[z] = copy > M[j] > 
M[2¢] and, if M[2i+1] exists, M[j] > M{[2:+1]. If 
4 = 7%, 10.2 follows as in 8.6. If 7 > a, 6.6 and 
10 give M[t+2] > copy = M[t]. A(t) at 6.5 now 
gives A(t) at 10.2. 

If reached from 4, 

10.1: 3.4 and 10. (8.1 and 3.3 give % < i < n.) 

10.2: 23 > mn means no relations in A(z) of the 
form M[i] >---. If 7 = %, 3.5 gives 10.2. If 
t ¥ %, 3.6 and 10 give M{t+2] > copy = M{z]. 
A (4) at 3.5 now gives 10.2. 


Code and assertions for the body of TREESORT 38. 


integer 7; 
comment 
0.1: A(n+2+1); 
for 7 := n+2 step —1 until 2 do 
begin 
comment 
2.1: A(@+1) 
2.2: Assumptions of siftup satisfied; 
sifiup (i,n); 
comment 
3.1: A(z); 
end; 
comment 
4.1: M[p] < M[pt+l])forn+1<p<n-1 
4.2: A(2), ie. M[k+2] > Mik] for4< k <n; 
for 1 := n step —1 until 2 do 
begin 
comment 
6.1: M[p] < M[pt+l1]) fori +1l<p<cn-1 
6.2: M[k+2] > M{k] for4< k Si 
6.3: MUi+1] > M[r]forl<r<si 
6.4: Assumptions of siftup satisfied; 
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7 siftup (1,1); 
comment 
7.1: M{p] < M[p+1] for 7 + 
7.2: M[k+2] > M[k] for 2 < 
7.8: M{1] > Mir] for2<r<i 
7.4: M{i+1] > M[1]}; 
8 exchange (M[1], M[z]); 
comment 
8.1: MY] > Mir]forl<r<7i-1 
8.2: M[p] < M[p+1] fori < p<n-1 
8.3: Mik+2] > Mik]for4< k <i -— 1; 
9 end; 
comment 
9.1: M[p] < M[p+1] for2< p<n-—1 
9.2: M[2] > M{1] 
9.3: M[p] < M[p+1] for 1 < p< n-— l,ie. M is fully 
ordered 
9.4: M is a permutation of Mo; 


Verification of the assertions for the body of TREESORT 8. 
Reasons for the truth of each assertion follow: 


0.1: Vacuous statement since 2(n+2-+1) > n. 
2.1: If reached from 0.1, by 1 substitute 7 = n+2in0.1. 
If reached from 3.1, by 1 substitute 7 = 7 + 1 in 3.1 to ac- 
count for the change in 7 from 3.1 to 2.1. 
2.2: 2.1, the bound on 7 implied by 1 and the array size being n. 
3.1: 2.1 and the definition of siftup(, n). 
4.1: Vacuous statement. 
4.2: If n > 4, 3 is executed; hence 3.1 with 7 = 2. If n < 3, 
vacuous statement. 
6.1-6.3: If reached from 4.1, 
6.1-6.2: By 5 substitute 7 = n in 4.1 and 4.2. 
6.3: Vacuous statement for? = n. 
If reached from 8.1, by 5 substitute z = ¢ + 1 in 8.2, 
8.3 and 8.1, respectively. 
6.4: 5 and 6.2, i.e. A(2) for the subarray M[1:7]. 
7.1: 6.1 and (8) of siftup. 
7.2: 6.2 and (1) of siftup. 
7.3: 7.2 noting that M[1] = M[k+2] if k = 2 and using the transi- 
tivity of >. 
7.4: Vacuous for 7 = n. Otherwise 6.3 for the appropriate r since 
by (2) of siftup, M[1] at 7.3 is one of the M[r],1 <r <1, 
at 6.3. 
8.1: 7.3 with the changes caused by 8 (only M[1] and M{z] are 
altered by 8). 
8.2: By 8 substitute M[z] for M[1] in 7.4; then 7.1 also holds for 
p = 1. 
8.3: 7.2 excluding only the one or two relations M{1] > --- , and 
the one relation -++ > M[z1]. 
9.1-9.3: If n > 2, 8 is executed; 


9.1: 8.2 with 7 = 2, 
9.2: 8.1 with 7 = 2. 
9.3: 9.1 and 9.2. 
If n < 1, 9.1-9.3 are vacuous statements. 
9.4: The only operations done to M are siftup and exchange all of 
which leave M as a permutation of Mo. 


Proof of termination of TREESORT 8. Provided siftup and ex- 
change terminate, it is clear that TREESORT 3 terminates. Note 
that each parameter of siftup is called by value so that 7 is not 
changed in the body of the for loops. 

The procedure exchange certainly terminates. In siftup the only 
possibility for an unending loop is from 3 to 8b and back to 3. 
Note that all changes to 7 (only at 8b) and toj (only at 3 and 6b) 
occur in this loop and that on each cycle of this loop both 7 and 7 
are changed. By the test at 4, it is sufficient to show that 7 strictly 
increases in value. 7 > 1 means 27 > 7. At 8b, 7 = 7 < 2¢ while at 
3,7 = 2, ie. j(at 3) = 24 > 7 = j(at 8b). Hence each setting to j 
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at 3 strictly increases the value of 7. The only other setting to j 
(at 6b), if made, similarly increases the value of j. 


1. 


2. 
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ALGORITHM 246 

GRAYCODE [Z] 

J. Booturoyp* (Recd. 18 Nov. 1963) 

English Electric-Leo Computers, Kidsgrove, Stoke-on- 
Trent, England 


* Now at University of Tasmania, Hobart, Tasmania, Aust. 


procedure graycode (a) dimension: (n) parity: (s); value n,s; 
Boolean array a; integer n; Boolean s; 

comment elements of the Boolean array a[1:n] may together be 
considered as representing a logical vector value in the Gray 
cyclic binary-code. [See e.g. Phister, M., Jr., Logical Design of 
Digital Computers, Wiley, New York, 1958. pp. 232, 399.] This 
procedure changes one element of the array to form the next 
code value in ascending sequence if the parity parameter s 
= true or in descending sequence if s = false. The procedure 
may also be applied to the classic ‘‘rings-o-seven’’ puzzle [see 
K. E. Iverson, A Programming Language, p. 63, Ex. 1.5]; 
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begin integer i,j; g:=n+1; 
for 1 := n step —1 until 1 do if a[i] then begin s := — s; 
j:= tend; 
-if s then a[l] := - alllelseif j < then a[j+]] := — al[j+1] 
else a[n] := - al[n] 


end graycode 


CERTIFICATION OF ALGORITHM 246 [Z] 
GRAYCODE [J. Boothroyd, Comm. ACM 7 (Dec. 1964), 
701] 
WiuuiaM D..ALLEN (Recd. 8 Feb. 1965 and 23 Feb. 1965) 
Computing Ctr., U. of Kentucky, Lexington, Ky. 
graycode was coded in Fortran IV and tested on the IBM 7040. 
graycode code was generated from 0 to 10,000 in both ascending 


and descending sequence. The procedure required no corrections 
and gave correct results for all cases tested. 


ACM Transactions on Mathematical Software, Vol. 1, No. 3, September 1975. 


REMARK ON ALGORITHM 246 
Graycode [Z ] 


[J. Boothroyd, Comm. ACM 7, 12 (Dec. 1964), 701 ] 


Jayadev Misra [Reed 13 May 1974 and 28 April 1975 ] 
Department of Computer Sciences, University of Texas at ‘Austin, Austin, TX 


78712 


The following modifications to Algorithm 246 will generate Gray code for any J, 
with each code word being generated in a bounded amount of time. Let A be a 
vector of zeros and ones of length N which will be the successive code words. New 
code words are successively generated by reversing a single bit in A each time. 
Routine OUTPUT, to be supplied by the user, is called on generation of every new 


code word. 


Initially A contains all zeros. At every odd-numbered step, A[N ] is reversed. 
At every even-numbered step, ALJ — 1] is reversed, where A[J ] is the rightmost 
one-bit in A. (In case J = 1, the algorithm terminates.) The positions of all the 
one-bits are stored in an increasing order in a stack S, from bottom to top. This 
helps in quickly locating J, the rightmost one-bit. 


REFERENCES 
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COLLECTED 


ALGORITHM 247 

RADICAL-INVERSIE QUASI-RANDOM POINT 

SEQUENCE [G5] 

J. H. Harton anp G. B. Smrra (Reed. 24 Jan. 1964 and 
21 July 1964) 

Brookhaven National Laboratory, Upton, N. Y., and 
University of Colorado, Boulder, Colo. 


procedure QRPSH (K, N, P,Q, R, E); 
integer K, N; real array P,Q; integer array R; real E; 
comment This procedure computes a sequence of N quasi- 
random points lying in the K-dimensional unit hypercube 
given by0 <a: <1, ¢ =1,2,---,K. The ith component of 
the mth point is stored in Q[m,z]. The sequence is initiated by a 
“zero-th point”’ stored in P, and each component sequence is 
iteratively generated with parameter R[7]. E is a positive error- 
parameter. K, N, E, and the P[{i] and R[z] fori = 1, 2, --- , K, 
are to be given. 
The sequence is discussed by J. H. Halton in Num. Math. 2 
(1960), 84-90. If any integer n is written in radix-R notation as 


nonino .0 = no + mR + nek? + +--+ + nak”, 


n= Nm eee 


and reflected in the radical point, we obtain the R-inverse func- 
tion of n, lying between 0 and 1, 


[sn = Nok} + nyR 
+ nok + eae + Nm), 


The problem solved by this algorithm is that of giving a com- 
pact procedure for the addition of R-, in any radix R, to a frac- 
tion, with downward “carry’’. 

Tf Plt] = #1 (s), as will almost always be the case in practice, 
with s a known integer, then Q[m,i] = ¢atij(s-+m). For quasi- 
randomness (uniform limiting density), the integers R[z] must 
be mutually prime. — 

For exact numbers, E would be infinitesimal positive. In prac- 
tice, round-off errors would then cause the ‘“‘carry”’ to be in- 
correctly placed, in two circumstances. Suppose that the stored 
number representing gr(n) is actually ¢r(n) + A. (a) If [A | 
= R-"~!, we see that the results of the algorithm become un- 
predictable. It is necessary to stop before this event occurs. It 
may be delayed by working in multiple-length arithmetic. (b) 
Ifin = R™*! — 1, so that dr(n) = 1 ~ R-™'!, andA < 0, the com- 
puted successor of the stored value can be seen to be about R-™, 
instead of R-™? = ¢p(n+1). This error can be avoided, without 
disturbing the rest of the computation, by adopting a value of 
E greater than any | A | which may occur, but smaller than the 
least (nR})~! (which is smaller than the least R-™~!) to be en- 
countered. 

Small errors in the P[z] will not affect the sequence. Any set 
of P[i] in the computer may be considered as a set of @zi:1(s:), 
for generally large and unequal integers s; , with small round-off 
errors. The arguments used in J. H. Halton’s paper to establish 
the uniformity of the sequence of points 


[ori(n), OR2(N), ‘t' ORR (n)], 
can be applied identically to the more general sequence 


n=1,2,++,N. 


or(n) =0. Noning 


n=1,2,-::,N 


lor (sitn), ori(Se+n), "¢+ ) ORK (sx+n)], 
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Thus, theoretically, any ‘‘zero-th point’? P will do. However, 
the difficulty described in (a) above limits us to the use of P[z] 
corresponding to relatively small integers s; .; 
begin integer 7, m; real r, f, 9g, h; 
for 7 := 1 step 1 until K do 
begin r := 1.0/R[t); 
for m := 1 step 1 until N do 
begin if m > 1 then f := 1.0 — Q[m—1,7] else 
f := 1.0-—P{z]; 
g:= 10; h := 7; 
repeat: if f —h < E then 
begin g :=h; h:=hX 7; 
end 


end 
end QRPSH 


go to repeal end; 


COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 248 

NETFLOW [H] 

Wi.tuiamM A. Briees (Recd. 18 Jan. 1964 and 17 Aug. 
1964) . 

Marathon Oil Company, Findlay, Ohio 


procedure NETFLOW (nodes, arcs, I, J, cost, hi, lo, flow, pt, 

INFEAS); 
value nodes, arcs; integer nodes, arcs; 
integer array I, J, cost, hi, lo, flow, pi; label INFEAS; 

comment This procedure determines the least-cost flow pat- 
tern over an upper and lower bound capacitated flow network. 

Each directed network are a is deaned by nodes I[{a] and 
J [a], has upper and lower flow bounds hi[a] and lo[a], and cost 
per unit of flow cost[a]. Costs and flow bounds. may be any 
positive or negative integers. An upper flow bound must be 
greater than or equal to its corresponding lower flow bound for 
a feasible solution to exist. There may be any number of parallel 
ares connecting any two nodes. 

A multi-source, multi-demand, capacitated transportation or 
transshipment problem may be stated as a network flow problem 
as follows: 

Append to the network (1) bounded ares from the demand 
node(s) to a “super sink,’’ (2) bounded ares from a “‘super 
source’? to the supply node(s), (3) an are directed from the 
“super sink” to the “super source”’ with zero lower bound, a 
large positive upper bound, and a negatively large cost. 

NETFLOW will maximize flow through the low-cost arc from 
“supper sink’? to ‘‘super source’’—subject to the capacity 
constraints of the network—fulfilling all demands optimally. 

The procedure returns vectors flow and pit. Flow[a] is the 
computed optimal flow over network are a. Pi[n] is a number 
--the dual variable—which represents the relative value of 
injecting one unit of flow into the network at node n. NETFLOW 
may be entered with any values in vectors flow and pi (such as 
those from a previous or a guessed solution) feasible or not. If 
the initial contents of flow do not conserve flow at any node, 
the solution values will also not conserve flow at that node, by 
the same amount. This fact can be frequently used to advantage 
in transportation problem definition. The closer initial values 
of flow and pi are to solution values, the shorter the computa- 
tion. 

Procedure NETFLOW is a mechanization of the out-of-kilter 
network flow algorithm described by D. R. Fuukerson in J. 
Soc. Indust. Appl. Math. 9 (1961), 18-27, and elsewhere. Many 
thanks are due the referee for noting some erroneous comments 
and for suggesting ways to increase the efficiency and utility of 
the procedure; 

begin integer a, aok, c, cok, del, e, eps, inf, lab, n, ni, nj, sre, snk; 
integer array na, nb[1: nodes]; 
integer procedure min (a, y); value xz, y; integer z, 4; 

begin if z < y then min := z else min := y end min; 

comment check feasibility of formulation; 
fora := 1step 1 until arcsdoif lo[a]> hila] thengoto]NFEAS; 
inf := 99999999; comment set inf to max available integer, 
aok := 0; 
comment find an out-of-kilter arc; 
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Seek: for a:= 1 step 1 until arcs do 
begin c := cost [a] + p7 [I{a]} — pi [V[a]]; 
if flow [a] < lo fa] V (c<O0A flow[a]<hi{a]) then 


begin src := J [a]; snk :=TI [al]; e:=+1; goto LABL 
end; 
if flow [a] > hi [a] V (c>0/Aflowla)>lo[a]) then 
begin src := I[a]; snk := Jla]; e := —1; goto LABL 
end; 
end; 
comment no remaining out-of-kilter arcs; 
go to FINT; 


comment attempt to bring found out-of-kilter arc into kilter; 


LABL: if a = aok (A na[src] ~ 0 then go to SKIP; 
aok := a; 
for n := 1 step 1 until nodes do na[n] := nb[n] := 0; 
na[src] := abs (snk) X e; nb[src] := abs (aok) X e; 
SKIP: cok := ¢; 
LOOP: lab := 0; 
for a := 1 step 1 until arcs do 


begin if (na{Z[a]]=O0/Anal/[a]]|=0) V 
(na|I[a]]~0/Anal/[a]]#0) then go to XC; 
c¢ := costla] + pi[I{a]] — pi|J[a)]; 
if na[Z[a]] = 0 then go to XA; 


if fowla] 2 hifa] V (flowla]Zzlola]lAc>0) then go to XC; 
na|J [al] := Ila]; nbiJla]} := a; goto XB; 

XA: if flowla] S lola] V (fowla]Shila]|Ac<0) then go to XC; 
nalIla]] := —Jla]; nb[Zfa]} := —a; 


XB: lab := +1; 
comment node labeled, test for breakthru; 
if na[snk] #~ 0 then go to INCR; 
XC: end; 
comment no breakthru; 
if lab) ~ 0 then go to LOOP; 
comment determine change to pz vector; 
del := inj; 
for a := 1 step 1 until arcs do 
begin if (na[Z[a]]=O0/Anal/[a]]=0) V 
(na[I[a]|~0/Ana[/[a]]#0) then go to XD; 
—¢ := costal + pi[Z[a]] — pi[J[a]); 
if na[J[a]] = 0 A flowla] < hifa] then del := min (del, c); 
‘if nalJ[a]] ~ 0 A flow[a] > lofa] then del := min (del, —c); 


XD: end; 
if del = inf /\ (flow[aok]=hi[aok]\/flowlaok]=lolaok]) then 
del := abs (cok); 


if del = inf then go to INFEAS; 
flow pattern; 
comment change pi vector by computed del; 
for n := 1 step 1 until nodes do if na{n] = 0 then pi[n] := 
pi|n}] + del; 
comment find another out-of-kilter arc; 
go to SEEK; 
comment breakthru, compute incremental flow; 
INCR: eps := inf; 
nt = sre; 
BACK: nj := abs (na[ni]); a := abs (nb[ni}); 
c := cost{a] — abs (pi[nt]—pi[nj]) X sign (nb[ni]); 
if nb[nt] < 0 then go to XE; 
ifc > 0 A flowla] < lola] then eps := min (eps, lola]—flow|a]) ; 


comment exit, no feasible 
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ifc $0 A flow[a] < hila] then eps := min (eps, hila]—flow[a]) ; 
go to XF; 
XE: ife <0 A flow[a)] > hila] then eps := min (eps, flow[a] 
—hila)); 
ifc 2 0 A flow[a] > lola] then eps := min (eps, flow[a]—lo[a]); 
XF: ni:= nj; if nt # srcthen go to BACK; 
comment change flow vector by computed eps; 
BACK2: nj := abs (na[ni]); @ := abs (nb[ni]); 
flow[a] := flowla] + eps X sign (nb[nz]); 
ni:= nj; if ni ~ sre then go to BACK2; 
comment find another out-of-kilter arc; 
aok := 0; go to SEEK; 
FINI: end NETFLOW with a feasible, optimal flow pattern 


REMARK ON ALGORITHM 248 [H] 

NETFLOW [William A. Briggs, Comm. ACM 8 (Feb. 
1965), 103] 

J. H. Henprrson, R. M. Knapp, anp M. E. VoLBERDING 
(Recd. 7 Apr. 1966) 

Northern Natural Gas Company, Omaha, Neb. 


KEY WORDS AND PHRASES: capacitated network, linear 
Programming, minimum-cost flow, network flow, out-of-kilter 
CR CATEGORIES: 5.32, 5.41 


Algorithm 248 was transcribed into Burroughs Extended ALGoL 
for the Burroughs B5500. After modification it has been used suc- 
cessfully. Before modification it was found to give erroneous 
values of pz for transportation problems and nonoptimal solutions 
for networks representing multitime level trans-shipment prob- 
lems. This was caused by the method utilized within the procedure 
for exiting with the best solution. The difficulty was circumvented 
by inserting a statement just before label SKJP reading: 


if nb [src] = arcs then go to FINI; 


This statement enables the user to exit the procedure without a 
pass through the pz incrementation block and a final pass through 
the out-of-kilter arc-finding block, saving a significant amount of 
time on sizeable problems. With the arcs arranged so that the are 
directed from the ‘‘super sink”’ to the “super source”’ is the last 
one in the arc array, it must be the last arc remaining out-of- 
kilter. Therefore, by the time the search block discovers it as an 
out-of-kilter arc, an optimal solution has already been found. 

[Algorithm 336 [Comm. ACM 11 (Sept. 1968), 631-632] is an 
improved version of Algorithm 248, which by its very construction 
bypasses this error.—J.G.H.] 


REMARK ON ALGORITHM 248 [H] 

NETFLOW [William A. Briggs, Comm. ACM 8 (Feb. 
1965), 103] 

T. A. Bray anp C. WitzcGaLu 
(Reed. 2 Oct. 1967. and 20 May 1968) 

Boeing Scientific Research Laboratories, Seattle, WA 
98124 


KEY WORDS AND PHRASES: capacitated network, linear 
programming, minimum-cost flow, network flow, out-of-kilter 
CR CATEGORIES: 5.82, 5.41 


We found that 
1. in the statement 
c := costa] — abs(pi[ni]—pi[nj]) K sign(nb[nz)); 
on page 103, column 2, line 3 from below, 
the ‘‘abs’’ should be deleted. 
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2. in the statement 
LABL: if a = aok fA na(src] ~ 0 then go to SKIP; 
on page 103, column 2, line 13 from above, 
the value of na[src] may be undefined. 


The algorithm worked satisfactorily after the corresponding 
changes had been made. We acknowledge a correspondence with 
R. M. Van Slyke and R. D. Sanderson of the University of Cali- 
fornia, Berkeley, on the subject. 

Algorithm 336 [Comm. ACM 11 (Sept. 1968), 631-632] is an im- 
proved version of Algorithm 248 incorporating these changes. 
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ALGORITHM 249 

OUTREAL N [I5)] 

Nixiaus E. Wirts (Recd. 28 Aug. 1964 and 2 Nov. 1964) 
Computer Science Div., Stanford U., Stanford, Calif. 


procedure outreal n (ch,x,n); 
value ch, x, n; real x; integer ch, n; 
comment outreal n outputs to channel ch the real number zx as 
a sequence of characters with n significant decimal digits in the 
form + d.d +--+ dj +d--d, where d stands for a digit. Like the 
procedures outboolean, outstring, ininteger (cf. Report on Input- 
Output Procedures for ALGOL 60, [Comm. ACM 7, (Oct. 1964), 
628-629]) and inreal [Alg. 239, Comm.ACM 7 (Aug.1964), 481] it 
constitutes an example of the use of the primitive procedure 
pair insymbol-outsymbol defined in the referenced Report; 
begin integer 7, j,k, s; real f; integer array a[l:n]; 
procedure outchar(x) value xz; integer 7; 
outsymbol — (ch,‘0123456789-++ —.10’, +1); 
s:i=k:=0; f:=31; 
outchar (if x = 0 then 10 else 11); x := abs(z); 
if x = 0 then begin outchar(0); go to L4 end; 
if < 2 1 then 
begin L1: f:=fX10; s:=s+1; ifs 2fthen goto; 
f2= FX OL} si= es 1 
end 
else 
begin L2: f:=fX01; s:=s—1; 
if x < f then go to L2 
end; 
x:= 2/f; comment now 1 S x < 10; 
for 7 := 1 step 1 until n — 1do 
begin «7 := entier(z); alj] := 74; x := (x—-t) X 10 end; 
aln|] := x; 
for 7 := n — 1 step —1 until 1 do 
begin if a[j+1]) < 10 then go to L6; alj+1] := 0; 
alj] := alg] + 1 
end; 
if a[1] = 10 then begin a[1] := 1; s:=s-+1end; 
6: outchar(a{l]); outchar (12); 
for 7 := 2 step 1 until n do outchar(alj]); 
comment now process the scale factor s; 
if s = 0 then go to LA; 
outchar (13); 
outchar (if s 2 0 then 10 else 11); s := abs(s); 
j:= 10; 
L3: ifs 2jthen beging :=jX 10; k:=k+1; gotoL8end; 
Lb: if k > 0 then 
begin j := j + 10; ¢:= 8 + Jj; outchar (i); s:=s —iX J; 
k:=k-—1; goto L5 
end; 
outchar (s); 
TA: 


end 
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ALGORITHM 250 

INVERSE PERMUTATION [G6] 

B. H. Boonstra (Reed. 12 Oct. 1964) 

Nationaal Kasregisters, NCR Holland, Amsterdam. 


procedure inversepermutation (P) of natural numbers up to: (n); 
value n; integer n; integer array P; 

comment given a permutation P(d) of the numbers 7 = 1(1)n, 
the inverse permutation is computed in situ. The process is 
based on the lemma that any permutation can be written as a 
product of mutually exclusive cycles. Procedure inversepermuta- 
tion has been tested for several permutations including n = 1; 

begin integer 71, 3, k, first; 
switch sss := tag, cycle, next, endcycle, finish; 

tag: fori := | step 1 until n do P{i] :== — Piz]; 
comment now P[i] contains a negative number if original and 

a positive number if inverse; 


first := 1; 
cycle: k := first; «4 := —P{k]; 
next: j := —Pli]; Plt] := k; 
if 7 = first then go to endcycle; 
hk := 4; t := 7; go to next; 


endcycle: if first = n then go to finish; 

first := first + 1; 

if P[first] < 0 then go to cycle else go to endcycle; 
finish: end inversenermutation 


REMARK ON ALGORITHM 250 [G6] 
INVERSE PERMUTATION 
[B. H. Boonstra, Comm. ACM 8 (Feb. 1965), 104] 
C. W. Meptock (Recd. 12 Apr. 1965 and 14 July 1965) 
IBM Corp., Programming Systems, Poughkeepsie, N.Y. 


Several simplifications may be made to the subject algorithm 
to permit more efficient operation. 

1. On many compilers; the procedure would be more efficient 
if the outer loop were written as a for loop. 

2. The initialization of the vector P to negative values may be 
omitted by reversing the interpretation of positive and negative 
values. As revised, P[t] contains a negative number if it contains 
the inverse value and 7 is less than the current walue of the pa- 
rameter n. P[i] contains a positive value in all other cases. This 
allows the for loop labeled tag to be eliminated. 

3. The variable first may be eliminated by declaring the pa- 
rameter n as a value parameter, and utilizing it as the controlled 
variable. of the outer loop. 

The author wishes to thank the referee for valuable suggestions. 

The revised algorithm then reads: 


procedure inversepermutation (P) of natural numbers up to: (n); 
value n; integer n; integer array P; 

comment Given a permutation P(t) of the numbers 2 = L(1)n, 
the inverse permutation is computed in situ; 

begin integer 1, j, k; 


for n := n step —1 until 1 do 
begin 7 := P[n]; 
if: < 0 then Pi[n| := —7 


else if 1 ~ n then 
begin k := n; 


loop: j := Pili]; Plt] := —4h; 
if j = n then P[n] :=7 
else 
begin k := 1;7 := 7; go to loop 
end 
end 
end 


end inversepermutation 
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ALGORITHM 251 

FUNCTION MINIMISATION [B4] 

M. We ts (Recd. 18 July 1964 and 5 Oct. 1964) 
Electronic Computing Lab., U. of Leeds, England 


procedure FLEPOMIN (n, a, f, est, eps, funct, conv, limit, h, 
loadh); 

value n, est, eps, loadh, limit; 
real f, est, eps; integer n, limit; 
array x, h; procedure funct; 

comment function minimisation by the method of Fletcher 
and Powell [Comput. J. 6, 163-168 (1963)]. On entry 2z[1:n] is an 
estimate of the position of the minimum, est an estimate of the 
minimum value, eps a tolerance used in terminating the proce- 
dure when the first derivative of f nearly vanishes, and loadh 
indicates whether or not an approximation to the ‘inverse of the 
matrix of second derivatives of f is available. If! loadh is true 
the procedure supplies the unit matrix as this estimate, other- 
wise it is assumed that the upper triangle of a symmetric posi- 
tive definite matrix is stored by rows in A{L:y x (n+1)/2]. 
The statement funct (n, x, f, g) assigns to f the function value 
and to g[l:n] the gradient vector. 

A successful exit from FLEPOMIN, with conv | Erae; occurs 
if two successive values of f are equal, or if a new value of f 
is larger than the previous value (due to rounding errors), or 
if after n or more iterations the lengths of the ‘vectors s and 
sigma are less than eps. If the number of iterations exceeds 
limit, then an exit occurs with cony false. In either case, the 
final function value, estimated position of the minimum and 
inverse matrix of second dorvativee: are in f, x and h; 

begin 
real oldf, sg, ghg; 
integer 1, J, k, count; 
array g, 8, gamma, sigma [1:n]; 
real procedure dot (a, 5); 
array a, b; 
comment inner product of a and b [In this procedure ana 
in up dot greater accuracy would be obtained by accumulat- 
ing the inner products in double precision;—Ref.]; 
begin integer 7; real s; s := 0; 
for 7 := 1 step 1 until n do s := 
dot := 

end of dot; 
real procedure wp dot (a, b, 1); 

value 7; 

array a,b; integer 7; 

comment multiply b by the ith row of the symmetric 

matrix a, whose upper triangle is stored by rows; 

begin integer j,k; reals; k:= 7; s := 0; 

for j := 1 step 1 until 7 — 1 do 
begin s := s + alk] X b[j]; k: 
to diagonal. Now go along row; 
for j := 7 step 1 until n do s := s + afk+j—7] X d{7]; 
up dot := 
end of up dot; 
set initial h: 
if loadh then 
begin k := 1; 

for 7 := 1 step 1 until n do 

begin Al{k] := 


Boolean conv, loadh; 


s + alt] X bf]; 


=k+n —J end steps 
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- for j := 1 step 1 until n — 7 do h[k+]] := 
k:=k+n-—-7t4+1 
end 
end formation of unit matrix inh; 
start of minimisation: 
conv := true; 
funct (n, x, f, 9); 
for count := 1, count + 1 while oldf > f do 
begin oldf := f; 
for 7 ‘= 1 step 1 until n do 
begin sigma[i] := z[t]; gamma[t] := g[t]; 
s[t] := —up doth, g, t) 
end preservation of x, g and formation of s; 
search along s: 
begin real ya, yb, va, vb, vc, h, k, w, 2, t, 88; 
b :=f; vb := dot(g, s); ss := dot(s, s); 
if vb = 0 then go to skip; 
k := 2 X& ‘(est-f)/vb; 
scale: h:=ifk > Oandk 7 2X ss < 1 thenk else 1/sqrt(ss); 


k :=0; 
extrapolate: ya := yb; va := vb; 
for i := 1 step 1 until n do 2f[i] := 27] +h X s[t]; 


funet(n, x, f, 9); 
yb := f; vb := dot, s); 
if vb < 0 and yb < ya then 
beginh :=k:=h-+k; go to extrapolate end; 
t:= 0; 
interpolate: z:= 8X (ya—yb)/h + va + vb; 
w := sqrt(z Tt 2—vaXvb); 
k :=h X (vb-+w—z)/(vb—vat+2Xw); 
for 7 := 1 step 1 until n do z[i] := 
funct(n, 2, f, 9); 
if f > ya or f > yb then 
begin vc := dot(g, s); 
if ve < 0 then 
begin ya := f; va := 0c; ¢: 
else 
begin yb := f; vb := vc; t: 
go to interpolate 
end; 
skip: end of search along s; 
for 7 := 1 step 1 until n do 
begin sigmal[i] := x{i] — sigma{z]; 
gammali] := glt] — gamma] 
end; 
sg := dot(sigma, yamma); 
if count 2 n then 
begin if sqrt(dot(s, s)) < eps and sqri(dot(sigma, stgma)) < 
eps then go to finish 
end test for vanishing derivative; 
for 1 := 1 step 1 until n do s[{i] := 
ghg := dot(s, gamma); 
k := 1; 
for i := 1 step 1 until n do for j := i step 1 until n do 
begin h[k] := h[{k] + sigmal[i] X sigma[jl/sg — st] X sljl/ghg; 
=k+1 
end updating of h; 
if count > limit then go to exit; 
end of loop controlled by count; 
exit: conv := false; 
finish: end of FLEPOMIN 


x{t] + (t—k) X st]; 


h := k end 


ll 


0; A:=h—kend; 


up dot(h, gamma, 7); 


go to finish; 
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CERTIFICATION OF ALGORITHM 251 [E4] 
FUNCTION MINIMISATION [M. Wells, Comm. ACM 
& (Mar. 1965), 169] 

R. Fietcuer (Recd. 9 Aug. 1965 and 24 Mar. 1966) 
Electronic Computing Lab., U. of Leeds, England 


Two points need 
FLEPOMIN. 
(i) When the method has converged, either or both of the vec- 
tors s and g can become ‘zero, hence also the scalars sg and ghg, 
causing division by zero when updating the matrix h. 
(u) The part of the procedure connected with the linear search 
along s does not make usé of the fact that the identifier h (y in the 
Appendix to the source paper Fletcher and Powell [1]) tends to 1 
as the process converges. This knowledge must be included to 
achieve the rapid convergence obtained by Fletcher and Powell. 
However, the particular choice of 7 given there can also be in- 
sufficient when its optimum value would be much greater than 1 
(as happens for example in the minimization of f(x) = [H(x—1)}? 
where | is the vector (1,1, ---, 1) and H is a segment of the Hil- 
bert matrix, from an initial approximation x = (0, 0, ---, 0)). 

An alternative approach is to estimate 1 by using its value at 
the previous iteration, increasing or decreasing its value by 
some constant factor when appropriate (I have arbitrarily used 
4), This approach removes the need for the estimate est of the 
minimum value of f(z). 

The appropriate changes to be made are thus: 

(i) omit est as a formal parameter, 

(ii) include amongst the real identifiers at the head of the 
procedure body the following: 

step, tla, fa, fb, ga, gb, w, z, lambda 
(iii) replace the statements from the label 
start of minimisation 

to the end of the program by the following: 


correcting concerning the procedure 


start of minimisation: 
conv := true; step := 1; 
funet(n,2z,f,g); 
for count := 1, count +1 while oldf > f do 
begin 
for 7 := 1 step 1 until n do 
begin sigma[t] := 2x[t]; gammal[t] := g[t]; 
s[t] := —up dot(h,g,2) 
end preservation of z,g and 
formation of s; 
search along s: 
fo := f; gb := dot (g,s); 
if gb > 0 then go to exit; 
oldf := f; tla := step; 
comment a change of ita X s is made in x and the function 
is examined. z/a is determined from its value at the previous 
iteration (step) and-is increased or decreased by 4 where 
necessary. It should tend to 1 at the minimum; 
extrapolate: fa := fb; ga := gb; 
for 7 := 1 step 1 until n do 2[t] := z[¢] +7ta X s[z]; 
funcet (n,x,f,9); 
fo := f; gb := dotg,s); 
if gb <0 A fb < fa then 
begin ita := 4 X ila; step 
end; 
interpolate: z:= 3X (fa—fb)/tta + ga + gb; 
w := sqrt (27 2—gaxXgb); 
lambda := ita X (gb-+w—z)/(gb—gat+2xw); 
for 7 := 1 step 1 until n do x[t] := 2[t] — lambda X s[t]; 
funct (n,z,f,g); 
iff > faVf> fb then 
begin step := step/4; 
if fo < fa then 


:= 4X slep; go to extrapolate 
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begin for 7 := 1 step 1 until n do 2[t] := x[t] + lambda X 


sli]; f := fb 
end else 
begin gb := dot(g,s); 


if gb < 0 A count > n A step <10-6 then go to exit; 


fo := f; tta:= ita — lambda; 
go to interpolate 
end; 


skip: end of search along s; 
for 7 := 1 step 1 until n do 


begin sigma [i] := x [¢] — sigma [1], 
gamma|i] := g{t] — gamma|z] 
end; 


sg := dot(sigma,gamma) ; 

if count > n then 

begin if sqrt (dot(s,s)) < eps /\ sqrt(dot(sigma,sigma)) <eps 
then go to finish 

end; 

for 7 := 1 step 1 until n do s[¢] := up dot (h,gamma,i); 


ghg := dot(s, gamma); 
k :=1; 
if sg = 0 V ghg = 0 then go to test; 


for i := 1 step 1 until n do for j := 7 step 1 until n do 
begin h[k] := h[k] + sigmali] X sigmalj]/sg — s[t] X slj]/ghg; 
k:=k+1 
end updating of h; 
test: if count > limit then go to exit; 


end of loop controlled by count; go to finish; 
exit:conv := false; 
finish: 


end of FLEPOMIN 


With these changes the procedure was run successfully on a 
KDF 9 computer on the first of the test functions used by Fletcher 
and Powell, and the appropriate rate of convergence was achieved. 
(The corresponding values in [1, Table 1, col. 4] being 24.200, 
3.507, 2.466, 1.228, 0.043, 0.008, 4 x 10-5). It could well be, however, 
that these changes may still not prove satisfactory on some 
functions. In such cases it will most likely be the search for the 
linear minimum along s which will be at fault, and not the method 
of generating s. It should not be necessary to evaluate the func- 
tion and gradient more than 5 or 6 times per iteration in order to 
estimate the minimum along s, except possibly at the first few 
iterations. 

T am indebted to William N. Nawatani of Dynalectron Corpora- 
tion, Calif., for pointing out the discrepancies in the rates of con- 
vergence, and to the referee for his calculations and comments 
with regard to the Hilbert Matrix function. 
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REMARK ON ALGORITHM 251 [E4] 

FUNCTION MINIMIZATION [M. Wells, Comm. 
ACM 8 (Mar. 1965), 169] 

P. A. Hamitron anp J. Booturoyp (Recd. 17 Dec. 1968) 

University of Tasmania, Hobart, Tasmania, Australia 


KEY WORDS AND PHRASES: function minimization 
CR CATEGORIES: 5.19 ; 


The changes proposed by Fletcher in his ‘‘Certification of Al- 
gorithm 251,’’ Comm. ACM 9 (Sept. 1966), 686, cohtain one mistake 
and one unprotected possible source of error. On page 687, line 2, 
the assignment statement f := fb should be replaced by the proce- 
dure statement funct(n,z,f,g) in order to reset: the gradients in 
g{l : n}. 

In theory, the conditions on z,ga,gb valid for interpolation im- 
ply 2 T 2—gaXgb>0. The statement w := sgri(z Tf 2—gaXgb) should 
therefore be safe. In practice, round-off errors may give rise to 
small negative values of the argument, resulting in an error condi- 
tion which may be avoided with: 


zt 2—gaXqgb; 
wi:i= if w<0 then 0 else sqri(w); 


WwW: 


Numerous tests of this procedure indicate; that two other 
changes are beneficial in reducing the number :of function calls 
required to yield a minimum to some prescribed accuracy. These 
concern the method of calculating the minimum of the interpolat- 
ing cubic and a modification to the extrapolation strategy. 

In the notation of Fletcher’s identifiers, the position of the 
minimum along s over the interval (@=0, b=ita) is a+r where r is 
the root of a quadratic equation given by: 


r = wlaX (ga+z+w)/(gatgb+2Xz) (1) 


and, for ga+z>0, it may be shown that r is the root of larger mag- 
nitude; otherwise, it is the root of smaller magnitude. The distance 
of the minimum from 6 is lambda=ita—r and Davidon[1] seems to 
have originated the proposal that lambda should be evaluated by: 


lambda := itaX (gb+w—z)/(gb—ga+2Xw) 


in order to avoid cancellation. In this respect it is only partly 
successful, and our experience shows that to avoid cancellation 
completely lambda should be calculated in the more orthodox man- 
ner: 


lambda := itaX(1— (if gat+z>0 then (ga+z+w)/(ga+gb+2Xz) 
else ga/(ga+z—w))); 


Once the minimum along s has been bounded, the use of cubic 
interpolation is rewardingly accurate and it is natural to inquire 
whether cubic extrapolation can provide a better farther bound 
than is afforded by an arbitrary search. It may be shown that, pro- 
vided zt 2—gaXgb>0 and r>0 where r is given by eq. (1), cubic 
extrapolation will yield the position of the predicted minimum 
along s asa-+r, using a value for ita given by the step length of the 
previous iteration. To bound the minimum we take the interval 
(a,a+2Xr) if the above conditions are satisfied; otherwise, we 
adopt Fletcher’s strategy of using the interval of the previous 
iteration scaled by a factor of 4. 

REFERENCE: 

1. Davipon, W. C. Variable metric method for minimization. 
ANL-5990. US Atomic Energy Commission Res. Develop. 
Rep., 1959. 
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Remark on Algorithm 251 [E4| 
Function Minimization [M. Wells, Comm. ACM 8 
(Mar. 1965), 169.] 


F. R. House [Recd. 25 Aug. 1970 and 1 Dec. 1970] 
Department of Pharmacology, Guy’s Hospital Medical 
School, London, S.E.1. England 


The above procedure, as modified by Fletcher [1], and Hamilton 
and Boothroyd [2], may appear to fail if the process converges after 
fewer than n iterations. In particular, if the starting point coincides 
with the minimum, failure is certain. The trouble arises from the 
statement 


if gb >0 then go to exit; 
which appears two lines after the label search along s. 
The following modifications are proposed. 


(i) After the first call of funct insert the statement 
if sqrt(dot(g, g)) < eps then 
begin 
for i := 1 step 1 until 7 do x{/] := x[/] +1; funct (n, x, f, g) 
end; 
(ii) Replace the statement 
if gb > O then go to exit; 
by 
if gb = O then go to skip; 
if gb > O then go to exit; 
The apparently perverse move away from the minimum implied 
by modification (i) ensures that A is updated at least once. 


(iii) The text from 

if count > n then 

to 

end; 

should occur after the label fest. The relevant portion of the pro- 
gram reads 


test: if count > n then 
begin 

if sqrt(dot(s, s)) < eps /\ sgrt(dot(sigma, sigma)) < eps 

then go to finish 

end; 
if count > limit then go to exit; 

Experience with the algorithm has shown that when the process 
converges from a poor starting point on a nonquadratic surface 
the final estimate of A is inclined to be somewhat erratic. 

This modification causes A to be updated once more using the 
very latest information, and will often effect a substantial improve- 
ment in accuracy. The estimated position of the minimum is, of 
course, unaffected. 

References: 

1. Fletcher, R. Comm. ACM 9 (Sept. 1966), 686-687. 

2. Hamilton, P. A., and Boothroyd, J. Comm. ACM 12 (Sept. 
1969), 512-513. 
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ALGORITHM 252 [Z] 

VECTOR COUPLING OR CLEBSCH-GORDAN 

COEFFICIENTS 

J. H. Gunn 
(Reed. 17 Aug. 1964, 13 Nov. 1964 and 21 Dec. 1964) 

Nordisk Institut for Teoretisk Atomfysik, Copenhagen, 
Denmark 

real procedure VCC(/1, J2, J, M1, M2, M, factorial); 

value J/1, /2, J, M1, M2, M; 

integer J1, J2, J, M1, M2, M; array factorial; 

comment VCC calculates the vector coupling or Clebsch-Gor- 
dan coefficients defined by the following formula 

(fi my Joma | jr jaj m) 

27+ DC + j2— DMA — je + DMA t+ je +9)! 

(At p+7+1)! 
X [Chr ma) (jr — a) (ja + m2) ja — ma) 7 +m) 1G — m)!] 


x Ds (—)?/lela + fe — j — 2) (ji — m, — 2)! 


= 8(m, + meym| 


(jo + m2 — 2G — potmt+2igG—j — M2 + 2)]] 


where jl = J1/2, 72 = J2/2,7 = J/2, ml = M1/2, m2 = M2/2, 
m = M/2, [Reference formula 3.6.11, p. 45 of Epmonps, Alan 
R. Angular momentum in quantum mechanics. In Investiga- 
tions in Physics, 4, Princeton U. Press, 1957.]. The parameters 
of the procedure, J1, J2, J, M1, M2 and M, are interpreted as 
being twice their physical value, so that actual parameters may 
be integers. Thus to call the procedure to calculate (} 0 30|$ 
2 90) the call would be VCC(1, 1, 0, 0, 0, 0, factorial). The pro- 
cedure checks that the triangle conditions for the existence of a 
coefficient are satisfied and that j1 + j2 + 7 is integral. If the 
conditions are not satisfied the value of the procedure is zero. 
The parameter factorial is an array containing the factorials 
from 0 up to jl + 72 + 7 + 1. Since in actual calculations the 
procedure VCC will be called many times it is more economical 
to have the factorials in a global array rather than compute 
them on every entry to the procedure; 
begin integer z, zmin, zmax; real cc; 
if M1 + M2 # M V/ abs(M1) > abs(/1) V abs(M2) > abs(J2) \/ 
abs(M) > abs) VJ > JL + J2V J < abs(J1—J2) \V J 
+J2+ 5) #2xX (Ji+/24+/) +2) then VCC := 0 else 
begin zmin := 0; 
if J ~ J2+ M1 < O then zmin := —J + J/2 — Mi; 
ifJ —J1 — M2+ zmin < Othen zmin -= —J + J1 + M2; 
zmar: = Ji + J2 — J; 
if J2 + M2 — zmax < 0 then zmaz := J2 + M2; 
if /1 — Ml — zmax < 0 then zmaz := J1 — M1; 


cc := Q; 
for z := zmin step 2 until zmaz do 
cc ce + (if z=4X (z+4) then | else —1)/(factorial[z+2 


factorial| (J1+J2—J—z) +2] 
factorial[(J1—M1—z) +2] 
factorial[{(J2+ M2—z) +2] 
factorial] (J —J2+M1+2z) +2] 
factorial[{(J —J1—M2+2z) +2}); 


xxxxx! 


vce 


xXx XXX 


end 
end VCC 
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:= sqri(J+l1) X factorial[(J1+J2—J) +2] 


factorial[(J1—J2+/) +2] 

factorial[(—J1+/2+J)+2] x factorial[(J1+M1) +2] 
factorial{((J1—M1)+2] X factorial[(J2+M2) +2] 
factorial[(J2—M2)+2] xX factorial[(J+M) +2] 
factorial[(J —M) +2]/factorial[(J1+J/2+J+2) +2]) 

cc 
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ALGORITHM 253 [F2] 

EIGENVALUES OF A REAL SYMMETRIC MATRIX 
BY THE QR METHOD 

P. A. BusincEr* 
(Reed. 17 Aug. 1964, 3 Nov. 1964 and 8 Dee. 1964) 


University of Texas, Austin, Texas 

* This work was supported in part by the National Science Foundation through 
grant NSF GP-217 and the Army Research Office through grant DA-ARO(D) 31-124- 
G888. Thanks are due the referee for suggesting several improvements. 


procedure symmetricQRh 1 (n,g); valuen; integer 7; 
array 9; 

comment uses Householder’s method and the QR algorithm to 
find all n eigenvalues of the real symmetric matrix whose lower 
triangular part is given in the array g[l:n, 1:n]. The computed 
eigenvalues are stored as the diagonal elements g[z, 7]. The 
original contents of the lower triangular part of g are lost during 
the computation whereas the strictly upper triangular part of g 


is left untouched. 


REFERENCES: 
Francis, J. G. F. The QR transformation—Part 2. Comput. J. 4 (1961), 3382-345. 
Orveca, J. M., ano Karspr, Il. F. The LLT and QR methods for symmetric tri- 
diagonal matrices. Comput. J. 6 (1963), 99-101. 
Paruerr, B. The development and use of methods of LR type. New York U., 
1963. 
Wiixkinson, J. H. 
(1962), 354-361. 


Touseholder’s method for symmetric matrices. Numer. Math. 4, 


Test Resutts: 

A version of this procedure acceptable to the Oak Ridge ALGoL 
compiler was tested on a CDC 1604 computer (relative machine 
precision 1.539-11). For a number of testmatrices of order up to 
64 the dominant cigenvalue was found to at least 8 digits and 
it was always among the most accurate values computed. In 
some cases the accuracy of the nondominant eigenvalues varied 
greatly, in one case the least accurate value had only 4 good 
digits. 


EXAMPLE: 
For the 5X5 symmetric matrix whose lower triangular part is 


5 
4 6 
3.0 7 
24 6 8 
13 579 
this prodecure computed the cigenvalues  22.406875305, 
7.5137241530, 4.8489501197, —1.0965951813, 1.8270455994; 
begin 
real procedure sum (1, m,n,a); value m,n; 
integer i, m,n; real a; 
begin reals; s := 0; 
forz:= m_ stepluntilndos := s+a; sum:=s 
end sum; 
real procedure maz (a,b); valuea, b; real a, b; 


max := ifa > b thena else b; 

procedure Householder tridiagonalization 1 (n, g, a, bg, norm); 
value n; integer n; array g,a,6q; real norm, 
comment nonlocal real procedure sum, maz; 
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comment reduces the given real symmetric n by n matrix g 
to tridiagonal form using n—2 elementary orthogonal trans- 
formations (I—2ww’) = (U-gamma uu’). Only the lower tri- 
angular part of g need be given. The diagonal elements and 
the squares of the subdiagonal elements of the reduced matrix 
are stored in a[l:n] and bg{l:n—1] respectively. norm is set 
equal to the infinity norm of the reduced matrix. The columns 
of the strictly lower triangular part of g are replaced by the 
nonzero portions of the vectors u; 

begin integer 7, j,k; real t, absb, alpha, beta, gamma, sigma; 
array p[2:n]; 
norm := absb := 0; 
for k := 1 step 1 until n—2 do 
begin afk] := g[k, &]; 

sigma := bq[k] := sum(i, k+1, n, g[z, k] f 2); 
t := absb+abs(alk]); absb := sgrt(stgma); 
norm := mazx(norm, t+absb); 
if segma ~ 0 then 
begin alpha := g{k+1, k]; 
beta := if alpha < 0 then absb else—absb; 
gamma := 1/(sigma—alphaXbeta); gl{k+1, k] := alpha— 
beta; 
for 7 := k+1 step 1 until n do 
pli] := gammaX (sum(j, k+1, 1, gle, J)Xgl7, AI) + 
sum(j, i+1, n, gj, ]Xg19, KI); 
t:= 0.5XgammaXsum(i, k+1, n, g{t, k)Xplt]); 
for 7 := k+1 step 1 until n do p[z] := plt]—tXglt, 4]; 
for 7 := k+1 step 1 until n do 
for j := k+1 step 1 until 7 do 
gli, J] := glt, jJ]—glt, klxply]—plt] xl, &] 


end 
end k; 
a[n—1] := g[n—1, n—1]; ba[n—1] := gin, n—1JT 2; 
a[n] := gln, nj; t := abs(g[n, n—1)); 
norm := max(norm, absb+abs(a[n—1])+1); 
norm := max(norm, t+abs(a[n])) 


end Householder tridiagonalization 1; 

integer 71, k, m, m1; real norm, epsq, lambda, mu, sql, sq2, u, 
pq, gamma, t; array allin], bg[0:n—1]; 

Householder tridiagonalization 1(n, g, a, bg, norm); 

epsg := 2.2519-22X norm} 2; comment The tolerance used in 
the QR iteration depends on the square of the relative ma- 
chine precision. Here 2.25,9-22 is used which is appropriate 
for a machine with a 36-bit mantissa; 


mu:= 0; m:= n; 
inspect: if m=0 then go to return else 2 := k := ml := m—1, 
bg(0] := 0; 
if bg[{k] S epsq then 
begin g[m, m] := a[m]; mu := 0; m:=k; 
go to inspect 
end; 


for 7 := i—1 while bg|¢] > epsq do k := 7; 
if k = ml then 
begin comment treat 2 X 2 block separately; 
mu := alml]xalmJ—bq[m1]; sql := alml]+a[m]; 
sq2 := sgri((a[m1]—a[m]) fT 2+4xXb¢[m1)); 
lambda := 0.5X (if sql 2 0 then sq1+sq2 else sql1—sq2); 
glm1, m1] := lambda; g[m, m] := mu/lambda; 
mu:=0; m:=m-—2; go to inspect 
end; 


COLLECTED ALGORITHMS (cont.) 


lambda := if abs(a[m|—mu) < 0.5Xabs(a[m]) then a[m]+0.5X 
sgrt(bq|m1]) else 0.0; 

mu := alm; sql := sq2:= u := 0; 

for z := & step 1 until ml do 

begin comment = shortcut single QR iteration; 


gamma := ali|—lambda—u; 
pq := if sqi¥1 then gammaf 2/(1—sql) else (1—sq2)X 
bg{e—1)]; 


t:= pqtba|i]; bgli—1] := sqlXt; sq2 := sql; 
sql := bg[tl/é; w:= sq1X (gamma+al[i+1]—lambda); 


alt] := gammatutlambda 
end 7; 
gamma := alm|—lambda— 1; 
bg[m1] := sqixX(if sqi:¥1 then gamma | 2/(1—sqi)_ else 


(1—sq2) Xbq[m1]) ; 
alm] := gamma+lambda; go to inspect; 
return: end symmetric QR 1 


CERTIFICATION OF. ALGORITHM 253 [F2] 

KIGENVALUES OF A REAL SYMMETRIC MATRIX 
BY THE QR METHOD [P. A. Businger, Comm. 
ACM 8 (April 1965), 217] 

Joun H. Weuscu (Recd. 3 June 1965, 1 Aug. 1966 and 
1 Mar. 1967) 

Stanford Linear Accelerator Center, Stanford, California 


The procedure symmetrig QR 1 was transcribed into ALGOL for 
the Burroughs B5500 (39-bit. mantissa) and tested with no syntax 
or logic changes (except to change the tolerance from 2.25;9-22 
to 3.3510-24). The eigenvalues of the matrix in the example given 
in the procedure declaration were found to 15 units in the 11th 
significant place and in the order given. 

Two defects of this algorithm have been found (personal com- 
munication from Prof. W.. Kahan); one concerning the conver- 
gence, the other concerning the numerical stability. 

The procedure symmetric QR 1 was slow to converge on matrices 
of large order with the form 

0 1 
1 O 1 
1 0 1 


1 
L 0 


The trouble is caused by a poor choice of the shift, lambda, 
for accelerating convergence. The fault was corrected as described 
in the Certification of Algorithm 254. 

The second defect is not as easy to detect cr correct. On matrices 
of large order with pairs of eigenvalues of opposite sign, members 
of the pairs were found to varying accuracy. Another indication 
of an instability was a distinct jump in the computed values of 
the eigenvalues of the matrix 


x | 
1 L 
1 -2z 1 
1 -1 
ut gz = 1075, as x was given the values 107%, 1074, --- , 107". 


It, appears that the square-root-free QA Algorithm described 
by Ortega and Kaiser (“The LL™ and QR methods for symmetric 
tridiagonal matrices,’ Comput. J. 6 (1963), 99-101) is numerically 
unstable; therefore Algorithm 253 should be avoided. [Rutis- 
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hauser (Letter to the Editor, Comput. J. 6 (1968), 133) suggested a 
modification which is also mentioned by Wilkinson (The Algebraic 
Eigenvalue Problem, Clarendon Press, Oxford, 1965, p. 567). How- 
ever, even with this modification the Algorithm is numerically 
unstable as was pointed out in a private communication from 
Wilkinson to Kahan (1966)—ReErF.] 
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ALGORITHM 2354 [F2] 
KIGENVALUES AND EIGENVECTORS OF A REAL 
SYMMETRIC MATRIX BY THE QR METHOD 
P. A. Busincrr* 
(Reed. 17 Aug. 1964, 17 Nov. 1964 and 8 Dee. 1964) 
University of Texas, Austin, Texas 


*This work was supported in part by the National Science Foundation through 
grant NSF GP-217 and the Army Research Office through grant DA-ARO(D) 31-124- 
G388. Thanks are due the referee for suggesting several improvements. 
procedure symmetric QR 2 (n, g, x); 

array g, Z; 
comment uses Householder’s method and the QR algorithm to 

find all n eigenvalues and eigenvectors of the real symmetric 

matrix whose lower triangular part is given in the array g. The 
computed eigenvalues are stored as the diagonal elements 
giz, 7] and the eigenvectors as the corresponding columns of the 

array x. The original contents of the lower triangular part of g 

are lost during the computation whereas the strictly upper 

triangular part of g is left untouched. 


value n; integer 7; 


REFERENCES: 
Francis, J.G.F. The QR transformation—Part 2. Comput. J. 4 (1961), 382-345. 
Paruett, B. The development and use of methods of LR type. New York U., 
1963. 
Witxkinson, J. H. Householder's method for symmetric matrices. Numer. Math. 
4 (1962), 354-361. 


Test RESULTS: : 

A version of this procedure acceptable to the Oak Ridge ALGoL 
compiler was tested on a CDC 1604 computer (relative machine 
precision 1.51;-11). For a number of testmatrices of order up to 
64 the dominant eigenvalue was found to at least 9 digits. Eigen- 
values much smaller in magnitude than the dominant eigenvalue 
have fewer accurate digits. In some cases the components of the 
eigenvectors were slightly less accurate than the eigenvalues. 


HEIXAMPLE: 
For the 5 X 5 symmetric matrix whose lower triangular part is 


this procedure computed the eigenvalues A,=22.406875306, 
Ne =7.5187241547, A3=4.8489501203, Ay= —1.0965951820, 
As= 1.38270455995, and the corresponding eigenvectors 
x, = (0.24587793851, 0.30239603954, 0.45321452335, 
_  0,57717715229;, 0.55638458400) , 
ze = (0.55096195546, 0.70944033954, —0.34017913315, 
— 0.083410953290, —0.26543567685), 
ag = (0.54717279573, —0.31256992008, 0.61811207635, 
—0.11560659356, —0.45549374666) , 
x, = (—0.46935807220, 0.54221219466, 0.54445240360, 
—0.42586566248, --0.088988503134), 
zs = (—0.34101304185, 0.11643462042, 0.019590672072, 
0.68204303436, -—0.63607121400) ; 
begin 
real procedure sum (i, m,n, @); 
integer 1,™,7; real a; 


value m, n; 
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begin reals; s := 0; 
for 2 := mstep 1 until n dos := s+a; sum :=s 
end sum, 
real procedure max (a, 6); valuea,b; reala, b; 
max := ifa > b thena else b; 


procedure Householder tridiayonalization 2 (n, g, a, 6, x, norm); 
value n; integern; array g,@,b,2; real norm; 
comment nonlocal real procedure sum, max; 
comment reduces the given real symmetric n by matrix g 
to tridiagonal form using n—2 elementary orthogonal trans- 
formations —2ww’) = ([—gamma wu’). Only the lower 
triangular part of g need be given. The computed diagonal 
and subdiagonal elements of the reduced matrix are stored in 
a(l:n] and 6[1:n—1] respectively. The transformations on the 
right are also applied to the n by n matrix xz. The columns of 
the strictly lower triangular part of g are replaced by the 
nonzero portion of the vectors u. norm is set equal to the in- 
finity norm of the reduced matrix; 
begin integer i,j,k; real t, sigma, alpha, beta, gamma, absb; 
array p[2:n]; 
norm := absb := 0; 
for k := 1 step 1 until n—2 do 
begin alk] := glk, k]; 
sigma := sum(t, k+1, n, g[t, k] T 2); 
t := absb+abs(a[k]); absb := sgqrt(sigma); 
norm := max(norm, t+absb); alpha := gl|k+1, k}; 
b[k] := beta := if alpha < 0 then absb else —absb; 
if sigma ~ 0 then 
begin gamma := 1/(sigma—alphaXbeta) ; 
glk+1, k] := alpha—beta, 
for ¢ := k+1 step 1 until n do 
pit] := gammaX(sum(j, k+1, 7, glt, J]Xgl9, Kl) 
+sum(j,i+1, n, gly, t1Xgl7, &])); 
t:= 0.5XgammaxXsum(i, k-+1, n, glt, k]Xplt]); 
for 2 := k+1 step 1 until n do p[7] := pl{z]—txgli, 4]; 
for 7 := k+1 step 1 until n do 
for j := k+1 step 1 until 7 do 
glt, J] := glt, Jl—glt, klxplyl—ple) Xgl, £1; 
for 2 := 2 step 1 until n do 
pit] := gammaxXsum(j, kK+1, n, xt, j1Xgls, *)); 
for 7 := 2 step 1 until n do 
for j := k+1 step 1 until n do 
alt, j) := alt, J]—ple]Xgly, &] 
end 
end k; 
al[n—1] :=g[n—1,n—1]; 
t := abs(b[n—1)); 
norm := max(norm, absb+-abs(a[n—1])+42); 
norm := max(norm, t+abs(a[n])) 
end Householder tridiagonalization 2; 
integer 7, 7, k, m, m1; real t, norm, eps, sine, cosine, lambda, 
mu, a0, a1, 60, beta, x0, x1; 
array a{l:n], b[0:n], c[(O:n—1], cs, sn[lin—1]; 
for 7 := 1 step 1 until n do 
begin comment set x equal to the identity matrix; 


al[n] :=g[n.n]; b[n—1] := g[n, n—1]; 


alt, 7] := 1; 
for 7 := 7+1 step 1 until 2 do z{7, 7] := z[j, 7] := 0 
end 7; 


COLLECTED ALGORITHMS (cont.) 


Householder tridiagonalization 2 (n, g, a, 6, x, norm); 

eps := normX1.5-11; comment the tolerance used in the 
Qk iteration is set equal to the product of the infinity norm 
of the reduced matrix and the relative machine precision 
(here assumed to be 1.519-11 which is appropriate for a machine 
with a 36-bit mantissa) ; 


b[0] := mu := 0; m:=n; 
inspect: if m=0 then go to return else i := k := ml := m—1; 
if abs(b[k]) S eps then 
begin 
gim, m| := alm]; mu := 0; m:=k; go to inspect 
end; 


for t := 1-1 while abs(b[i]) > eps do k := t; 

lambda := if abs(alm]~mu) < 0.5Xabs(a[m]) \V m1=k then 
a{m]+0.5Xb[m1] else 0.0; 

mu i= alm]; alk] := alk]—-lambda; beta := b[k]; 

for J := k step 1 until ml do 

begin comment transformation on the left; 


a0 := aly]; al := afj+1|—lambda; b0 := b[j]; 

i := sqgrt(a0 t 2+beta fT 2); 

cosine := cs[j] := a0/t; sine := sn[j] := bela/t; 

alj] := cosineXa0+sineXbeta; alj+1] :== —sinexbO+ 


cosine Xal; 
bl7| := cosineXb0+sineXal; beta := b[j+1]; 
bij+1] := cosineXbeta; clj] := sineXbeia 
end 7; 
bik—1] := c{k—1] := 0; 
for 7 := k step 1 until ml do 
begin comment transformation on the right; 
sine := snlj]; cosine := cs[j]; 
a0 := alg]; 60 := big]; - 
blj7—1] := b[j-1]Xcosine+e[j—1]Xsine; 
alj] := a0Xcosine+b0Xsine+lambda; 
bij] := — a0 X sine +b0X cosine; alj-+1] := alj+1]xXcosine; 
for 2 := 1 step 1 until n do 
begin 20 := z[z, 7]; xl := x{z, j+1]; 


zt, j] := x0Xcosinet+r1Xsine; zi, 7-+1] := —z0%sine+ 
x1 Xcostne 
end 7 
end j; 


afm] := alm]+lambda; go to inspect; 
return: end’ symmetric QR 2 


CERTIFICATION OF ALGORITHM 254 [F2] 

EIGENVALUES AND EIGENVECTORS OF A REAL 
SYMMETRIC MATRIX BY THE QR METHOD 
[P. A. Businger, Comm. ACM 8 (April 1965), 218] 

Joun H. Wexscu (Recd. 3 June 1965, 1 Aug. 1966 and 
1 Mar. 1967) 

Stanford Linear Accelerator Center, Stanford, California 


The procedure symmetric QR 2 was transcribed into AuGou for 
the Burroughs B5500 (39-bit mantissa) and tested with no syntax 
or logic changes (except to change the tolerance from 1.519 — 11 
to 1.8819 — 12). The eigenvalues of the matrix given in the initial 
comment of the procedure declaration were found to 8 units in the 
11th significant place and in the order given. The components of 
the eigenvectors found by the procedure differed from those given 
by at most 7 units in the 10th significant place and that occurred 
in the smallest component of Xz. The computed vectors X3 and 
A4 were the negative of thase given. 

It was found (personal communication from Prof. W. Kahan, 
University of Toronto) that symmetric QR 2 was slow to converge 
on matrices of large order with the form 
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1 0 


The trouble observed seems to be caused by a poor choice of the 
shift, lambda, for accelerating convergence. The following change 
corrects this fault and did not change the results of these tests 
except that the eigenvalues are found in a different order. Replace 
the 8 lines following the line labeled inspect by: 


if abs(b[k]) S eps then 

begin g[m, m] := alm]; m := k; go to inspect end; 
for i := 7 — 1 while abs(®[t]) > eps do k := 1; 
comment find eigenvalues of lower 2 X 2; 
bO := b[m1] T 2; al := sgrt((alml1j—alm])t2+4Xb0); 
t:= alml1] X alm] — 60; ad := alml] + alm]; 
lambda := 0.5 X (if a020. then a0-+al else a0—al); 
i := t/lambda; comment compute the shift; 
if abs(t—mu) < 0.5 X abs(t) then mu := lambda := ¢ 
else if abs(lambda—mu) < 0.5 X abs(lambda) then mu := lambda 
else begin mu := t; lambda := 0 end; 
alk] := alk] — lambda; beta := b{k]; 


The modified procedure (called QF? 2 below) was compared with 
the procedures given by J. H. Wilkinson [Numer. Math. 4 (1962), 
354-376] of the Householder tridiagonalization, Sturm sequence 
bisection, and inverse iteration algorithms. Evaluation of the 
Sturm sequence caused exponent underflows and overflows, so the 
procedures were modified (referred to as HSJ below) by scaling 
and overflow detection. 

To measure the effectiveness of the procedures, two quantities, 
E, and E., were evaluated for each of eleven matrices used as 
test data. These quantities are suggested by Prof. W. Kahan (in 
“Inclusion Theorems for Clusters of Eigenvalues of Hermitian 
Matrices,” University of Toronto, Feb. 1967) and are defined as 
follows. Let A be a Nermitian matrix, A a diagonal matrix of its 
approximate eigenvalues and V a matrix whose columns are ap- 
proximate cigenvectors ordered to correspond with A. Define 
W = V*V —TIandk = AV — VA, then 


E, = || W |l2and By = | R |l2/| A |e, 
where |] X |]: = maximum eigenvalue of X*X. Then it is shown 
that the maximum absolute error in an eigenvalue is less than or 
equal to 


; if By <1. 
V1i- ki 


The computation of W and R was done with double-precision inner 
products. 

The results of the tests are summarized as follows: 

(a) Both QR 2 and HSI found the dominant eigenvalues to 
better relative accuracy, but the same or worse absolute accuracy 
than the other eigenvalues. 

(b) QR 2 was on the average 1.8 times faster than HSI (QR 2 
required 2.5 seconds on a Ililbert segment of order 15). 

(c) QR 2 always found orthogonal eigenvectors (£, ~ 107"); 

(d) in most cases E; ~ 107" for HSI also, but several times 
HSI found two eigenvectors almost parallel (#, ~ 1.0). 

(e) FE. ~ 10-" for both QR 2 and HSI with neither being con- 
sistently better than the other. 


COLLECTED ALGORITHMS (cont.) - 


Conclusions. The orthonormalized eigenvectors, speed, and 
comparable accuracy would recommend symmetric QR 2 over the 
Wilkinson procedures for finding all of the eigenvalues and eigen- 
vectors of a real symmetric matrix. The latter procedures are good 
for finding selected eigenvalues and eigenvectors. 
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ALGORITHM 255 

COMPUTATION OF FOURIER COEFFICIENTS [C6] 
Linpa TrrseELo (Recd. 18 Nov. 1964 and 25 Nov. 1964) 
Stanford Computation Ctr., Stanford U., Calif. 


procedure FOURIER(F, eps, subdivmax, m, cosine, sine, cint, 
sint) ; 
value eps, subdivmaz, m, cosine, sine; 
Boolean cosine, sine; 
real procedure fF; 
comment FOURIER computes the Fourier coefficients cint = 
So F(x)cos(mmx) dx (if cosine is true) and/or sint = fo F(x) 
sin (mmx) dx (if sine is true), where m > 0. The method is that 
of Filon (for a brief exposition see [1] and for Filon’s original work 
see [2] or [3]). Computation is terminated when the number of 
times the interval [0,1] has been halved (n) has exceeded sub- 
divmaz (10 is suggested), or when n > 5 and two successive ap- 
proximations of the integral agree to within eps (1077 is sug- 
gested) times the value of the last approximation. In the former 
case, cint or sint is assigned the value of the last approximation. 
The condition n > 5 is imposed because of substantial cancella- 
tions which may take place during the early stages of sub- 
dividing; 
begin real sumcos, sumsine,. oddcos, oddsine, pi, a, b, g, t, h, p, k, 
c0, cl, s0, 81, intl, int2, previntl, prevint2, tnl, 18, temp; 
integer n, 7; Boolean bool; 
bool := false; pi := 3.14159265359; k:= mx pi: 
sumcos := (F(1.0) X cos(k)+F(0)) X .5; 
sumsine := F(1.0) X sin(k) X .5; 
10: n:=1; h:= 0.5; t:= .5Xk; tml := 1; 
L1: cO0 := cos(2.0Xt); cl := cos(t); 
s0 := sin(2.0Xt); sl := sin(t); 
%:= t73; p:=clX si; 
a:= ((f2—sl fT 2X2.0+itXp)/0; 
b := (2.0X (tX (cl T 2+1.0)—-2.0Xp))/23; 
g := 4.0 X (—tXel + s1)/88; 
if bool then go to L2; 
if sine then 
begin 
oddsine := F(h) X s1; 
for 7 := 2 step | until inl do 
begin temp := cl X c0 — sl X 30; 
sl := sl X c+ cl X s0; 


real eps, cint, sint; 
integer subdivmaz, m; 


cl := temp; 
oddsine := F((2Xt—1)Xh)Xs1 + oddsine 
end; 


if n = 1 then 
begin n := 2; h:= 25; t:= 25Xk; tnl := 2; 
prevint2 := (aX (F(0)—F (1.0) Xcos(k))+ 
bXsumsine+gXoddsine) X .5; 


sumsine := sumsine + oddsine; go to Ll 
end 
else 
begin int2 := h X (aX (F(0)—F(1.0) Xcos(k))-4- 
bXsumsine+g xX oddsine) ; 


if abs (prevint2—int2) <epsXini2/A\n>5 then 
begin sint := int2; bool := true; go to LO end 
else 
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begin n :=n+1; 
if n > subdivmaz then 
begin bool := true; 
sint := int2; go to LO 
end; 
sumsine := sumsine + oddsine; 
t:= 5X t; tnl := 2X tn; 
prevint2 := int2; go to Ll 
end 
end 
end of sine computations; 
L2: if cosine then 
begin 
oddcos := F(h) X cl; 
for 2 := 2, step 1 until inl do 
begin temp := cl X c0 — sl X 80; 
sl := sl XK c0O+ cl X s0; 
cl := temp; 
oddcos := F((2Xt—1)Xh) X cl + oddcos 
end; 
ifn = 1 then 
begin n := 2; h:= .25; t:= .25 Xk; tnl := 2; 
previntl := (aXF (1.0) Xsin(k) +b X sumcos+g X oddcos) 


h:= 5Xh; 


xX .5; 
sumcos := sumcos + oddcos; bool := true; ge to L1 
end 
else 
begin intl := h X (aXF(1.0) Xsin(k) +b Xsumeos+gX 
oddcos) ; 


if abs(previnti—intl) < eps X intl A n> 5 then 
begin cint := intl; go to exit end 
else 
begin n :=n-+1; 
if n > subdivmax then begin cint := intl; 
go to exit end; 


sumcos := sumcos + oddcos; h := 6 Xh; 
t:= 6X t; ml := 2X tnl; 
previntl := intl; goto Ll 
end 


end 
end of cosine computations; 
exit: end FOURIER 


REFERENCES: 
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CERTIFICATION OF ALGORITHM 259 [C6] 
COMPUTATION OF FOURIER COEFFICIENTS 
[Linda Teijelo, Comm. ACM 8 (May 1965), 279] 
GILLIAN Hatu* and VALERIE A. Rayt (Reed. 31 Mar. 
1969 and 1 July 1969) 
National Physical Laboratory, Teddington, Middlesex, 
England 
* M.R.C. team, Division of Computer Science (formerly of Di- 
vision of Numerical and Applied Mathematics). 
t Division of Numerical and Applied Mathematics. 


KEY WORDS AND PHRASES: 
coefficients, Filon’s method 
CR CATEGORIES: 5.16 


numerical integration, Fourier 


The algorithm was translated using the KDF9 Kidsgrove 
ALGOL compiler, and needed the following correction. 


The tests for convergence on lines 51 and 83 should read re- 
spectively: 
if abs(prevint2—int2) < eps X abs(ini2) A n > 5 then 
if abs(previntl—intl) < eps X abs(intl) A n > 5 then 


With this alteration, the program was tested ‘successfully on a 
series of functions /'(x) using a range of values of m and eps for 
each function. The parameter subdivmax was set at the recom- 
mended value, 10. For F(z) = x?, for which the method is exact, 
results were obtained correct to machine accuracy, i.e. 103 deci- 
mal places. 

Remarks. (i) It would be better to declare:the identifier tn1 
as type integer, i.e. to replace lines 20 and 21 of the text by: 


cO, cl, sO, sl, entl, int2, previntl, prevint2, 88, temp; 
integer n, 7, tnl; Boolean bool; 


(ii) There is no indication, after execution of the algorithm, 
whether the computation was terminated because of apparent 
convergence or because the number of times, n, that the interval 
was halved became greater than subdivmax. The following modifi- 
cation provides such an indication; it has the effect that cosine 
and sine will retain their entry values except in the case where 
cosine or sine has the value true on entry and n becomes greater 
than subdivmaz in the course of computation. In this case the value 
on exit will be false. 

Line 3 becomes: 
value eps, subdivmax, m; real eps, cint, stint; 

Line 57 becomes: 
sint := int2; sine := false; go to LO 
Line 88 becomes: 
false; 


cosine := go to exit end; 


(iii) To avoid the repeated evaluation of Fr), F(1.0) the 
following modification is suggested: 

Declare a new variable term1 of type real on line 20. 

Replace lines 23 and 24 by: 


term1 := F(1.0) X cos(k); 
sumcos := (F(0)+terml) X 0.5; 
sumsine := 0; 

terml := 2 & (sumcos—term1); 


Replace lines 44, 45 and 49, 50 by: 


prevint2 : seas te ee XK 0.5; 
begin int =hX (aXterm1+bxXsumsinet+gx oddeineys 


Replace lines 76, 77 and 81, 82 by: 
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previntl := (bXsumcos+gXoddcos) X 0.5; 
begin intl := h X (bXsumcos+gXoddcos) ; 

The work described above has been carried out at the National 
Physical Laboratory. 
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ALGORITHM 256 

MODIFIED GRAEFFE METHOD [C2] 

A. A. Grav (Reed. 29 July 1964, 23 Oct. 1964 and 18 
Jan. 1965) 

Northwestern University, Evanston, Illinois 


The algorithm given here mechanizes a modified form of the 
Graeffe process designed to avoid an expanding number range. 
This was discussed in [1]; the notation used below is the same as in 
that article. 

Let the given polynomial be 


Apt” 4- +++ + An | 


the degree n and the array of coefficients a are input parameters of 
the procedure. An additional input parameter w is used to deter- 
mine the number of stages needed to obtain a desired order of 
resolution; this may be considered to be roughly the number of 
significant decimal places expected in the zeros of the polynomial. 

The algorithm finds the moduli, d, (s = 1, --+ , 7), of the zeros 
of the polynomial and the number of stages used for this, p. If the 
algorithm succeeds, the output parameter q is set equal to 0; 
otherwise, the value of q serves as the indicator for the reason of 
failure: ¢ = 1 if the polynomial has a zero-valued coefficient, and 
q = 2 if a zero-valued divisor is encountered somewhere in the 
process. In either case, the moduli of the zeros are not found. 
Apart from these two cases, the algorithm applies generally; this 
includes the cases where some zeros have equal moduli or are 
imaginary. 

The algorithm has been tested with polynomials of degree up to 
10, including ill-conditioned cases such as polynomials with one or 
more sets of multiple or imaginary zeros. The algorithm has been 
compiled as it stands using both the Oak Ridge Aucot Translator 
for the Control Data 1604 and the SHaRE AuGot Translator for the 
IBM 709/7090. In the case of. the latter, one change as noted in a 
comment had to be made; this is presumably no longer necessary 
in a revision of the translator. 

Garwick’s device [2] is used as convergence criterion in both 
root extraction and the basic process. From w and the number of 
stages determined from it, it is possible to conclude whether some 
zeros may be considered to be of equal moduli; in such cases an 
adjustment of their values is possible and is made. 

The quantities used in the modified Graeffe process are related 
to those occurring in the ordinary root-squaring process. This 
implies that in general the limitations of the Graeffe process (sce, 
for example [3, pp. 67-69]) hold also in the modified process; the 
most serious of these is that initially the condition of successive 
polynomials may deteriorate. 

An expanding number range is avoided by introducing at each 
step arithmetic divisions. It follows that if c; is near zero, over- 
and underflow can occur in computing subsequent quantities. In 
the usual machine system, such a condition results in the auto- 
matic termination of computation; in this case this is not serious. 
In an ALGOL system where this is not true, a very unsatisfactory 
arrangement generally, machine-dependent facilities must be 
added to the algorithm to obtain the same effect; the ALGou lan- 
guage contains no way of doing this. Theoretically a bridging 
mechanism is possible to work around near-zero divisors, but this 
has not been attempted here. 
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The modified process can be expected to perform somewhat 
better than the standard process in the case of equal moduli. 
procedure Modified Graeffe (w, n, a, d, p, 9); 

value w,n; integer w, 7, p,q; array a, d; 
begin 

real aa, eps, eps2, h, h1, h2, hh2, m, nh2; 

integer 1, k, k0, k00, s, s3; 

array c(0:n], dl, hh[1:n], e[1:n, 1:n/2]; comment Using the 

SHARE processor, the last subscript bound n/2 was replaced by 


entier (n/2); 
eps := eps2 := 10—5; 
k00 := 40; comment This is the maximum number of stages 


needed on the CDC 1604 where about 10 significant decimal 
figures may be obtained. On the IBM machines it is less, but 
the figure was not changed for such use; 

for s := 0 step 1 until n do 

begin if a[s| = 0 then begin q := 1; 

Determine the number of stages: 
kO := entier (3.56 & w + 3.21); 
if kO > 00 then kO := k00; 


go to out end end; 


Initialization: 
for s := 1 step | until n do 
begin 
ifs +s > n then s3 := n—s else 83 := 8; 
for 7 := 1 step 1 until s3 do 


e(s, 7] := a{s+7] X afs—2]/(afs+7—1] X als—t+1)]); 
di{s] := abs (a[s]/a[s—1]}) 


end; 
c[0] := c[n] := 1; 
mis; 
Main loop: 
for k := 1 step 1 until k0 do 
begin 
m:= m/2; 
for s := 1 step 1 until n—1 do 
begin 
ifs +s > nthen s3 := n—s else 83 := 8; 
h := 0; 
for 7 := s3 step —1 until 1 do 
h := (1—h) X efs, ¢]; 
cls] := 1—2 XA; 
if cls] = 0 then 
begin q := 2; go to out end 
end; 
for s := 1 step 1 until n do 
begin 
ifs +s > nthen s3 := n—s else 83 := 8; 


for 7 := 1 step 1 until s3 do 

begin 
h := (e[s+i]/e[s+i—1]) X efs, 2]; 
els, 7] := (e[s—t]/c[s—z+1]) X e[s, 7] X h; 

end; 

comment _ In the paper [1] on which the algorithm is based, 
there is an error in equation (13) and results derived from 
it. The equation should be 

edtPogep 


Rg iP 
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Root extraction: 
aa := abs (e[s]/c[s—1]); 
comment If the 7 operation is suitably implemented for 
fractional exponent, the following 12 lines may be replaced 
by 
hh{s] := hl := aa f (1/27); 
Al :=h := 1+ (aa-1) X w—-2 Xm; 
nh2 := 1; 
AB: fori := 1 step 1 until k doh :=h X h; 
h2 := (aa/h—1) X m; 
h:=hl :=hl +hl X h2; 
hh2 := abs (h2); 
if hh2 > eps then go to AB; 
if Ah2 < nh2 (A hh2 ¥ 0 then : 


begin 
nh2 := hh2; goto AB 
end; 
hh{s] := Al; 
dl[s] := dl[s] X hl 
end; 
h := 0; 
for s := 1 step 1 until n do 


begin 
hl := abs (hh[s]—1); 
if hl > eps then go to AC; 
ifhl >h thenh := hl 
end; 
ifh < eps2 \ h ¥ O then 
begin eps2 := h; go to AC end; 
go to Root determination; 
AC: end Main loop; 
k := kO; 
Root determination: 


q:=0; p:=k; gs :=1; 
BA: fori := s step 1 until n do 
begin 
if abs (c[t]—1) < eps2 then 
begin k := 7; go to AE end 
end; 
k:=n; 
AE: if k = 8 then 
begin 
d[s] := dl[s]; go to AG 
end 
else 
begin 
aa := 1; 
for 7 := s step 1 until k do 
aa := aa X dl{i]; 
comment If the 7 operation is suitably implemented for 
fractional exponents, the following 13 lines may be replaced 
by 
h := aa fT (1/(k—s+1)); 
hl := dl{[s]; 
nh2 := 1; 
AF:h := 1; 
for 2 := s step 1 until k do 
h:=hX Al; 


h2 := (aa/h—1)/(k-—s+1); 
hl :=h1l + Al X h2; 
hh2 := abs (h2); 
if hh2 > eps then go to AF; 
if hh2 < nh2 A hh2 ¥ 0 then 
begin 

nh2 := hh2; 
end; 
for 7 := s step 1 until k do d[{i] := hl 

end; 


go to AF 
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TABLE 1 


p | Computed Moduli (dg) 


Coefficients (ag) Aciual Zeros 


ReEsuits From CDC 1604 


3.000000000 2.000000000 
« 1.000000000 1.000000000 


1.414213563 1.414213563 
1.000001068  .9999989324 


1.002108373 .9999999404 
9999999400 .9978961853 


5.000000000 4.000000001 
2.999999999 2.000000000 
1.000000000 


1.003023179 
1.000737942 
-9977555433 


5.999999999 5.000000004 
3.999999998 3.000000001 
2.000000000 1.000000000 


1.009739721 1.009739721 
1.000072156 1.000072156 
-9902827716 .9902827715 


o 


4}/1-1-—5 -1 -6 i —2 3 2 


4/12322 +i —1-tt 35 
4/14641 —1 (four-fold) | 35 


5 | 1-5 —15 125 —226 120 1234-65 9 


1.000737942 
9977555433 


5 1151010861 —1 (five-fold) | 21 


1234-5 —6} 9 


6 | 11 —45 35 524 —1236 720 


6 | 16 15 201561 —1 (six-fold) | 22 


Resu.its From IBM 709 


6 /16 15 201561 —1 (six-fold) | 22 | 1.0442011 1.0219216 
1.0219216 97855264 

97855264 95767000 

10 | 1 10 45 120 210 252 210 120) —1 (ten-fold) | 23 ; 1.1896983 1. 1896983 
45 10 1 1.0977241 1.097724] 
1.0001204 1.0001204 

-91099190 -91099190 

84044056 84044056 

10 | 1 —55 1320 —18150 157773 | 1234567 8 10 | 10.001153 8.9947183 
—902055 3416930 —8409500 | 9 10 8.0090868 6 .9926022 
12753576 —10628640 3628800 6.0027695 4.9996995 
3.9999811 2.9999883 

2. 0000007 1.0000000 


AG: if k = n then go to out; 


s:=k+1; 
go to BA; 
out: 


end Modified Graeffe 

Tests. Some of the tests (Table 1) were run on the CDC 1604 
using an earlier version of the algorithm; minor improvements in- 
corporated afterwards should not affect the results substantially. 
The results obtained using the SHARE ALGoL translator and the 
IBM 709 suffer in comparison to those obtained on the 1604 for two 
main reasons: (1) significance of floating-point numbers is 27 bits 
vs. 35, and (2) input conversion routines introduce greater per- 
turbations into input numbers. The last cases given are very 
poorly conditioned, so that the rather poor results should not be 
especially surprising. 

Thanks and acknowledgements are due to several members of 
the Mathematics Division of Oak Ridge National Laboratory for 
running tests on the Control Data 1604, and to Mrs. Virginia 
Klema for running tests on the IBM 709 computer at Northwestern 
University. 
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REMARK ON ALGORITHM 256 [C2] 

MODIFIED GRAEFFE METHOD [A. A. Grau, Comm. 
ACM 8 (June 1965), 379] 

G. STERN (Recd. 8 Mar. 1965 and 24 Mar. 1965) 
University of Bristol Computer Unit, Bristol 8, England 


This procedure was tested on an Elliott 503 using the two 
simplifications noted in the comments on page 380. When the 16th 
line from the bottom of page 380, first column, was changed to read 

hi := aa fT (1/(k-—s+1)); 
(as suggested in a private communication from the author) correct 
results were obtained. 
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ALGORITHM 257 

HAVIE INTEGRATOR [D1] 

Rospert N. Kusik (Reed. 9 June 1964 and 21 Dec. 1964) 
The Babcock & Wilcox Co. Lynchburg, Viriginia 


real procedure havieintegrator (x, a, b, eps, integrand, m); 
value a, b, eps, m; integer m; 
real integrand, x, a, b, eps; 

comment This algorithm performs numerical integration of defi- 
nite integrals using an equidistant sampling of the function and 
repeated halving of the sampling interval. Each halving allows 
the calculation of a trapezium and a tangent formula on a finer 
grid, but also the calculation of several higher order formulas 
which are defined implicitly. The two families of approximate 
solutions will normally bracket the value of the integral and 
from these convergence is tested on each of the several orders of 
approximation. The algorithm is based on a private communica- 
tion from F. HAvie of the Institutt for Atomenergi Kjeller Re- 
search Establishment, Norway. A Fortran version of the al- 
gorithm is in use on the Philco-2000. A few test cases have been 
run on the Burroughs B5000. In particular, a and 6 are the lower 
and upper limits of integration, respectively, eps is the con- 
vergence criterion, integrand is the value of the function to be 
integrated (sampled), and m is the maximum order approxima- 
tion to be considered in attempting to satisfy the eps conver- 
gence criterion. If convergence is not gained, then the value 
returned is that of the nonlocal variable, mask. The parameter 
integrand must be an expression involving the variable of in- 
tegration x. See the driver program of this algorithm for ex- 
amples of the procedure ¢all; 

begin real h, endpts, sumt, sumu, d; 
integer 1, 7, k, n; 
real array t, u, tprev, uprev[1:m]; 


x:= a; endpts := integrand; x«:=b; endpts := 0.5 X 
(integrand+endpts) ; 
sumt := 0.0; t:=n:=1; h:=b—-a; 
estimate: t[1] :=h X (endpis+sumt); sumu := 0.0; 


comment ¢{l] = h X (0.5X/[0]+/[1]+/[2]+---+0.5/[2*-4)); 
z:=a— h/2.0; 
for j := 1 step 1 until n do 
begin 
zx:=at+h; 
end; 
ull] -=h X sumu; k := 1; 
comment ull] = h X (f[l/2]+/[8/2]+---+/[2*—-1)/2]), & 
corresponds to approximate solution with truncation error 
term of order 2k; 
test: if abs(t{k]—u[k]) < eps then 
begin 
havieintegrator := 0.5 X ([k]+ul[k]); 
end; 
if k ~ 71 then 
begin 
ad := 27 Xk); 
tik+1] := (dXt[k]—tprev[k])/(d—1.0); 
tprev[k] := t[kl]; 
u[k+1] := dXulk]—uprev[k])/d—1.0); 
uprev{k] := u[k]; 


sumu := sumu + integrand 


go to exit 
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comment This implicit formulation of the higher order in- 
tegration formulas is given in [RompEerRG, W. Vereinfachte 
Numerische Integration. Det Kong. Norske Videnskabers 
Selskabs Forhandl. 28, 7 (1955), Trondheim; and in STIEFEL, 
EK. Einfiihrung in der Numerische Mathematik. Teubner 
Verlagsges., Stuttgart, 1961, pp. 131-186. (English transla- 
tion: An Introduction to Numerical Mathematics, Academic 
Press, New York, 1968, pp. 149-155)]. See also Algorithm 60 
where the same implicit relationship is used to calculate 
tik+1] only; 


k:=k+41; 

if k = m then 

begin: 

havieintegrator := mask; go to exit 

end; 

go to test 
end; 
h := h/2.0; sumt := sumt + sumu; 
tprev[k] := t[k]; uprev[k] := ulk]; 
ti=t~_til; ni=2X*n; 


go to estimate; 
exit: end havieintegrator 
Following is a driver program to test havieintegrator. 
begin comment First test case, y = 7)? cose dx = 1.0 
(0.9999999981 as executed on the B5000), is an example of the 
higher order approximations yielding fast convergence as in 
Algorithm 60; second test case, y = ['? e~*” dx = 8862269255 
(.8862269739 as executed on the B5000), is an example where 
this algorithm is superior to Algorithm 60 because the higher 
order approximations converge more slowly than the linear 
approximations; see also [THacHER, H. C., Jr., Remark on 
Algorithm 60. Comm. A.C.M.7 (July 1964), 420]; 
real a, b, eps, mask, y, answer; 
a := 0.0; b := 1.5707963; eps := 0.000001; mask := 9.99; 
answer := havieintegrator (y, a, 6, eps, cos(y), 12); 
outreal (1, answer); 
a:= 0.0; 6 := 4.3; 
answer := havieintegrator (y, a, b, eps, exp(—yXy), 12); 
outreal (1, answer); 
end 


CERTIFICATION OF ALGORITHM 257 [D1] 

HAVIE INTEGRATOR [Robert N. Kubik, Comm. ACM 
8 (June 1965), 381] 

Kennetu Hiuistrom (Recd. 28 Feb. 1966, 29 Apr. 1966 
and 15 July 1966) 

Applied Mathematics Division, Argonne National Labora- 
tory, Argonne, Illinois 


Work performed under the auspices of the U.S. Atomic Energy Commission, 


Havie Integrator was coded in CDC 3600 Fortran. This rou- 
tine and a Fortran-coded Romberg integration routine based 
upon Algorithm 60, Romberg Integration [Comm. ACM 4 (June 
1961), 255] were tested with five and four integrands, respectively. 

The results of these tests are tabulated below. (The ALGOL- 
coded Havie routine was transcribed and tested for the two 
integrands used by Kubik, with identical results in both cases.) 
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In the following ‘table, A is the lower limit of the interval ot 
integration, B is the upper limit, EPS the convergence criterion, 
VI the value of the integral and VA the value of the approxima- 
tion, 


ter eam eR ee ee A 


Number 

of 

Integrand A! B EPS VI Routine VA rae 
Evalu- 

ations 

cos x 0 7/2 1076 1.0 Havie 0.9999999981 17 
Romberg 1..000000000 17 

ee 0 4.3 10-8 0.886226924 Havie 0. 886226924 17 
Romberg 0. 886336925 65 

in z 1 10 10-@ 14.0258509 Havie 14.02585084 65 
Romberg  14.02585085 65 

1/2 

(5 0 20 1078 5.7707276 Havie 5. 770724810 32,769 
eee. Romberg _5.770724810 16,385 
cos (42) 0 aw 1078 0.0 Havie 3. 1415926536 3° 


® Since in the Havie procedure, the sample points of the interval, chosen for 
function evaluation, are determined by halving the interval and are, therefore, 
function-independent, there are functions for which the convergence criterion is 
satisfied before the requisite accuracy is obtained. An example is the integrand 
f(z) = cos (42) integrated over the interval (0, 7]. The value obtained from the 
routine is = 7r. The true value of the integral is 0. 

This inherent limitation applies to all integration algorithms that obtain sample 
points in a fixed manner. 
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HAVIE INTEGRATOR [Robert N. Kubik, Comm. 
ACM 8 (June 1965), 381] 

I. Farkas (Recd. 29 Apr. 1966 and 18 Aug. 1966) 

Institute of Computer Science, University of Toronto, 

Toronto 5, Ont., Canada 


Havieintegrator was translated with some modifications into 
Fortran IV and was run on the IBM 7094 ITI at the Institute of 
Computer Science, University of Toronto. To reduce the effect of 
roundoff, the calculations were carried through in double preci- 
sion internally and the result was rounded to single precision. The 
main change made was that the parameters x and integrand in 
havieintegrator were replaced by a single parameter of type FUNC- 
TION in Fortran IV. The other change was that mask was re- 
moved. The maximum order of approximation was kept less than 
or equal to 25, and convergence was obtained in every case. 

The results obtained for the two test cases were in agreement 
with the author’s result. Besides, 14 other successful tests were 
made and those shown in Table I are typical. 


TABLE I 

Integrand A B True value eps Error X 108 Bees d 
er | 0.0 1.0 1.7182818 10-6 0 3 
10-4 240 2 
10-2 3700 2 
wl? 0.01 1.1 . 265559382 10-8 —2 a 
10-4 59 3. 
107? 36041 2 
Vx 0.0 1.0 .66666667 1076 —27 3 
10+ — 1982 2 
10-2 — 126848 2 
V/V2 0.01 1.0  1.8000000 1076 0 3 
10-4 140 2 
2 


10? 790 
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Like other integration algorithms that determine sample points 
in the interval in a deterministic manner, havierntegrator may fail 
in certain instances. For example, any integrand with the property 
that f(a) = f(b) = fl(a + b)/2)] will lead to the value ( _ a)f(a) 
which will in general not be an acceptable approximation to 
Se® f(x) dx. Thus fo” sin? x dx leads to 0. Moreover, fo ze~* dz leads 
to “almost zero” (in fact, 5.7966 107"). 


COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 258 
TRANSPORT [H] 
G. Bayer (Recd. 4 May 1964 and 4 Mar. 1965); 
Technische Hochschule, Braunschweig, Germany 


procedure transport (c, x, a, 6b, m, n, inf, cost); 
value m,n, inf; integer m, n, inf, cost; 
integer array c, x, a, b; 
comment The parameters are c[t,j] array of costs, the quantities 
available a[z], the quantities required b[j7], 7 = 1,---,m,j = 
1,--+ , nm. Sum of afi] = sum of b[7]. inf has to be the greatest 
positive integer within machine capacity, all quantities have to 
be integer. The flows 2[7, 7] are computed by the ‘‘primal-dual- 
algorithm,’’ cited in [Haptey, G. Linear Programming. Read- 
ing, London, 1962, pp. 351-367]. The procedure follows the de- 
scription given on p. 357. Multiple solutions are left out of 
account; 
begin integer 7, j, p, A, k, y, t, 1; 
integer array 0, x87, 8, 7, listv[1:n], u, xis, d, g, istu[l:m]; 
Boolean array xb[1:m, 1:n]; 
integer procedure sum(z, a, b, x); 
integer 7, a, b, x; 
begin integer s; 
s := 0; 
for 7 := a step 1 until b dos :=s +2; 
sum i= 8 
end; 


value a, 0; 


comment Array zb for notation of ‘circled cells,’ Iéstu and 
listv lists of labeled rows and columns. Other notations follow 


Hadley; 
for 7 := 1 step 1 until m do zis[z] := alt]; 
for j := | step 1 until n do xsj[j] := fy]; 
for 7 := | step 1 until m do 
begin h := inf; for j := 1 step 1 until n do 
begin z[7, J] := 0; p:=cli,7]; ifp <hAthenh := pend; 


ult] := A; 
for 7 := 1 step 1 until n do 
xb[t, j] := if c[z, 7] = h then true else false 


end u[2]; 
for 7 := 1 step 1 until n do 
begin h := inf; 

for 1 := 1 step 1 until m do 


begin if zb[i, j] then 
begin v[j] := 0; go to aa end; 
d{i] := p := eft, j] — ult]; 
if p <Athenh := p 
end; 
vj] = h; 
for 71 := 1 step 1 until m do 
begin if d[?] = h then xb[i, 7] := true end; 
aa: 
end [7]; 
for j := 1 step 1 until n do listv[j] := 0; 
for 7 := 1 step 1 until m do listu[z] := 0; 
s2: fori := 1 step 1 until m do 
begin for j := 1 step 1 until n do 
begin if xb[z, 7] then 
begin h := 2[t, j] := if xsj{7] < ats[z] 
then z87[7] else xis[z]; 
«sjlj] := xsjlj] — h; 
zis[i] := ais[iq] — h 
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end 
end 
end; 

803: if sum(j, 1, n, xsj7l7]) = 0 then go to s6; 
for 7 := 1 step 1 until n do s[j] := r[j] := 0; 
h:=0; k:=1; 

83: for 7 := 1 step 1 until m do 
begin if zis[{i] > 0 then 

begin d[t] := xis[i]; g{z] := 2X n; 
for j := 1 step 1 until n do 
begin if xb[z, 7] A rly] = 0 then 
begin s[j] := dt]; r[j] := 7; 
if xsj[j] > h then 
begin h := xsj{7]; 
end 
end 
end 
else d[z] := g[z] := 0 
end; 

863: if k = 1 then go to s13; 
t:=1; 
for k := 1 step 1 until n do 
begin 7 := listv[k]; lstv[k] := 0; 

for 7 := 1 step 1 until m do 
begin if xb[i, j] A zfz, j] > 0 A g[t] = 0 then 
begin d[z] := if z[7, 7] < sly] 
then 2z[2, j] else s[7]; 


lesto[k] := 9; 


p := jend 


if 7 = 0 then go to s33; 


git] := 9; lstull] := 7; l:=2+1 
end 
end 
end; 
833: if 1 = 1 then go to s13; 


k:=1; 

for | := 1 step 1 until m do 

begin 7 := listu{l]; lstu[{l] := 0; if ¢ = 0 then go to s43; 
for j := 1 step 1 until n do 
begin if xb[z, j] A rly] = 0 then 


begin s[j] := d[¢]; r{j]) := 7; Usto[k] := 7; ki=k +1; 
if zsj[j] > h then 
begin h := xsj[j];_ p := j end 
end 
end 
end; 


843: go to 853; 
813:; comment end of labeling process; 
ifh > 0 then go to s4 else 
if sum(j, 1, n, xsj[j]) = 0 then go to s6 else go to 35; 
84: k:= p; 
h := if s{k] < xsj{k] then s[k] else xsj[k]; 


s41: y:= r[k]; xly, k] := aly, k] +h; 
xis[y] := wvis[y] — h; xsj[k] := xsj[k] — h; 
t:= gly]; ift = 2X nthen goto 803; aly, ¢] := 2ly, ¢] — fh; 
zis[y] := zis[y] +h; xsj[t] := asj{t] th; k:=t; goto s4l1; 
sh: h i= inf; 
for 7 := 1 step 1 until m do 


for j := 1 step 1 until n do 
begin if g[¢] = 0 A 7r[j] = 0 then 
begin p := elt, j] — ult] — vf]; 
ifp <hthenhkh := p 
end 
end; 


COLLECTED ALGORITHMS (cont.) 


for 7 := I step 1 until m do 
begin if g[¢7] ~ O then u[z] := ult] + 2 end; 
for 7 := 1 step 1 until n do 
begin if r[{j] # 0 then v[7] := v[y] — A end; 
for 7 := I step 1 until m do 
for j := 1 step 1 until n do 
begin if c{z, 7] = ul¢] + vf[j] then zb[z, 7] := true end; 
go to s03; 
86: cost := sum(i, 1, m, a[i}xXulz]) + sum, 1, n, bly] Xelz)) 


end; 


REMARK ON ALGORITHM 258 [H] 

TRANSPORT [G. Bayer, Comm. ACM & (June 1965), 381] 
G. Bayrr (Recd. 11 June 1965) 

Technische Hochschule, Braunschweig, Germany 


The following correction should be made in the procedure. 


Change the: second line above the label s6 from 

begin if cli,j] = wli]+olj] then zb[7,j] := true end; 
to 

xbli,j] := cli,j] = ult] + oy); 


CERTIFICATION OF: 


ALGORITHM 258 [H] 
TRANSPORT 

[G. Bayer, Comm. ACM 8 (June 1965), 381] 
ALGORITHM 293 [H] 
TRANSPORTATION PROBLEM 

[G. Bayer, Comm. ACM 9 (Dec. 1966), 869] 


Ler 8. Sims (Recd. 21 Feb. 1967 and 17 Mar. 1967) 
Kates, Peat, Marwick & Co., Toronto, Ont., Canada 


Both of these algorithms were coded in Extended ALGou 60 
and tested on a Burroughs B5500. Three problems were solved 
correctly, one of them being of medium size (55 X 167). On this 
larger problem transp1 was found to be about twice as fast as 
transport. 

In coding and debugging transp1 three apparent errors were 
found. In the right-hand column on page 870, after line 27 which is 

t:= listulu}; nloi := nlefe]; 

a line is missing. This line should read 

for s := (t(—1) X n +1 step 1 until nlvi do 
Also in the right-hand column, the line 

S4:; 
should be inserted ahead of line —12, which begins 

comment Step 4. A column j with b[j] has been labeled, b[j] 
On page 871, in the left-hand column, line —22 which reads 

for s := 1 step 1 until n do 
should read 

for s := 1 step 1 until n do 
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COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 259 

LEGENDRE FUNCTIONS FOR ARGUMENTS 
LARGER THAN ONE* [816] 

Water Gautscui (Recd. 5 Mar. 1965) 

Purdue University, Lafayette, Ind. and Argonne National 
Laboratory, Argonne, Ill. 
*Work performed in part under the auspices of the U.S. Atomic Energy 

Commission. 

begin 

comment Control is transferred to a nonlocal label, called 
alarm, whenever the input variables are not .in the intended 
range; 

procedure integer Legendre 1 (x,a, nmazx, P); 
value x, a, nmax; integer a, nmazx; real z; array P; 

comment This procedure generates the associated Legendre 
functions of the first kind, 


(a? — 1)"2 data 


FN gh das 


(a? — 1), 


for n = 0(1)nmaz, assuming a > 0 an integer; and x > 1. The 
results are stored in the array P. The method of computation is 
derived from the (finite) continued fraction 


in + oF, /F,, = Mbetati-) @+ot+)@—xn) 


nx + (n+ La + 
(1+a-+ 2) a@—n—1) 2a-} 
(n+ 2)a, + ax (ens 4), 


where F, = Pa"(x)/(n+a)!, a1 = 2a(x2?—1)~4, and the identity 
Fo +2 aE = [x+(x?—1)4#]*/a}. 


If x is very close to 1, the computation of 2: is subject to can- 
cellation of significant digits. In such cases it would be better 
to use y = x—1 as input variable, and to compute (z?—1)! 
by [y(2+y)]! everywhere in the procedure body; 
begin integer ; real xl, c, sum,r, s; 
array Rr|O:nmaz—1}; 
if*™<1Va<0V nmaz < 0 then go to alarm; 
ifzx =1Va= 0then 
begin 
P[0] := 1; 
go to L 
end; 
for n := a+1 step 1 until nmax do P[n] := 0; 
x1 := sqrt (xf2—-1); 


for n := 1 step 1 until nmaz do P{n] := 0; 


c:= 1; forn := 2 step 1 untiladoc:=n Xe; 

sum := (a+21)fa/c; zl := 2 XK x/rl; 

r:= gs :=0; 

for n := a step — 1 until 1 do 

begin 
r:= (atl—n)/(nX2l+(n+a+l1)Xr); s := r X (2+s); 
if n < nmaz then Rr[n—-1] := r 

end; 


P(O] := ¢ X sum/(1+s); 
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for n := Ostep 1 until if nmaz < a then nmaz—1 else a—1 do 
Pin+1] := (nta+1) X Rrin}] K P[n]; 

L: end integer Legendre 1; 

procedure integer Legendre 2(x, m, nmax, d, Q); 
value x, m, nmaz,d; integer m, nmaz,d; real x; array Q; 

comment This procedure generates to d significant digits the 
associated Legendre functions of the second kind, Qn”(x), for 
n = O(1)nmax, assuming m > 0 an integer, and x > 1. The 
results are stored in the array Q. The procedure first generates 
Qo" (x) from the recurrence relation 

ge = Oy Gee ewener 20 

(2? — 1) (1) 

(r = 1,2,+--,m—1) 
with n = 0, and the initial values 


1 1 
ex) =n tt Qe) = We — 
2 24-1 
Then a variant of the backward recurrence algorithm of J. C. 
P. Miller is applied to the recursion 


(n—M+1)Qru1 -- (2n+1)cQn™ + (n+m)Qra = 0 (2) 
(n=1, 2, 3,° . *) 


(For more details see [2]. See also [4] for a very similar al- 
gorithm.) If m > 1, the leading coefficient in (2) vanishes 
for n = m — 1, which invalidates the theoretical justification 
for the backward recurrence procedure. Nevertheless, it appears 
that the procedure produces valid results for arbitrary m > 0. 
Convergence of the backward recurrence algorithm is slow for 
x near 1, but improves rapidly as x increases; 

begin integer 7, nu, p; real x1, Q0, Q1, Q2, epsilon, 7; 
array Qapproz, Rr{0: nmaz]; 
ifz<1V nmaz <0 Vm <0 then go to alarm; 
xl := sqrt (xf2—-1); 
Q1 := 6X In ((a+1)/(x—-1)); 
if m = 0 then Q[0] := Q1 else 


begin 
Q2 := —1i/r71; zl := 2 X 2/zl; 
for n := 1 step 1 until m — 1 do 
begin 
Q0 := Q1; Ql := Q2; 
Q2:= -—-n X al X Ql — n X (n—-1) XK QO 
end; 
Q[0] := Q2 
end; 
for n := 0 step 1 until nmaz do Qapproz[n] := 0; 


epsilon := .6 X 107(—d); 
nu := 20 + entier (1.25 X nmaz); 


10: r:=0; 
for n := nu step — 1 until 1 do 
begin 
r i= (n+m)/((2Xn+1) Xx— (n—m+1)Xr); 
if n < nmaz then Rr[n—1] := r 
end; 
for n :=0 step 1 until nmar—1 do Q[n+1] :=Rr[n] K Q[n); 
for n := 0 step | until nmaz do 


if abs(Q[n]—Qapproz[n]) > epsilon XK abs(Q[n]) then 
begin 


COLLECTED ALGORITHMS (cont.) 


for p := 0 step 1 until nmaz do Qapproz[p] := Q[p]; 
nu:=nu+10; go to LO 
end 
end integer Legendre 2; 


procedure integer Legendre 3(x, n, mmaz, d, Q); 
value xz, n, mmax,d; integer n, mmaz,d; realz; array Q; 
comment This procedure generates to d significant digits, and 
stores in the array Q, the Legendre functicns of the second kind, 
Q.™(x), for m = 0(1)mmaz, assuming n > 0 an integer, and 
x > 1. The procedure integer Legendre 2 is used to obtain initial 
values @n°, Qn!, and subsequent values are obtained from the 
recursion (1) of the preceding comment; 
begin integer m; real zl; array Q1([0:n); 
ifn <0 V mmazx < 0 then go to alarm; 
integer Legendre 2(x, 0, n, d, Q1); Q[0] := QI1ln]; 
xl := 2 X 2/sqrt(xt2~—1); 
if mmax. > 0 then 
begin 
integer Legendre 2(x, 1, n, d, Q1); Q[1] := Q1[n] 
end, 
for m := 1 step 1 until mmazx—1 do 
Q{[m+1] := —mX 21 X Q{m] — (m+n) X (m—n—-1) K Q[m—-1] 
end integer Legendre 3; 
procedure Legendre 1(x, alpha, nmaz, d, P1); 
value z, alpha, nmaz,d; integer nmaz, a; 
real x, alpha; array P1; 
comment This procedure evaluates to d significant digits the 
Legendre functions 


Ta+n+ 1) "= 
rl(a@ + 1) 


for n = O(1)nmaz, where x > 1 and a is real. The results are 
stored in the array P1. It is assumed that a nonlocal procedure 
gamma be available which evaluates I'(z) for 0< z < 2. (See 
[3].) The procedure first gencrates the quantities f, = P."(x)/ 
U(a+n+1) from the recurrence relation 


Pay [a + (2? — 1)? cos t]% cos né dt 


2nzx Te Sn a 
(i ee 1) Gt 8 Ht ae et 


and the identity 


Sait + fra= 0, 


Cee Op 
fo + adits Tae fea 
applying a variant of the backward recurrence algorithm of 
J. C. P. Miller. (See [2] for more details.) Then P."(x) = 
C(atn+1)f, is obtained recursively. If a < —4,we let a= —a—1 
and compute P,"(x) = Pae"(r). The substitution is made to 
avoid loss of accuracy when z is large. The rate of convergence 
of this procedure decreases as x increases. A general idea of the 
speed of convergence may be obtained from the graphs in [2, §6]. 
ff x is very close to 1, the same changes as mentioned in the 
first procedure are recommended; 
begin integer n, nu,m; reala, epsilon, x1, sum, c, 7, 8; 

array Papproz, Rr[0:nmaz]); 
ifx <1V nmaz <0 V entier(alpha) — alpha = 0 then 
go to alarm; if « = 1 then 
begin 

P1[0] := 1; 

go to Ll 
end; 
a := if alpha < —.5 then — alpha — 1 else alpha; 
for n := 0 step 1 until nmaz do Papproz{n] := 0; 
epsilon := .56 X 10T(~d); 
ifa < 1 thenc := gamma(1+a) else 


for n := 1 step 1 until nmar do Pl{n] : 


I 
= 


begin 
m := entter(a) — 1; ¢ := gamma(a—m); 
for n := 0 step 1 until m doc := (a—n) X ¢ 


end; 
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zl := sqrt (xf2—1); sum := (a+a1)Ta/e; al := 2 X x/cl; 
nu := 20 + entier ((37.26+.1283 X (a+38.26) Xz) Xnmazx/ 
(37.26+ .1283X (a+1) Xzx)); 
IO: r:=s := 0; 


for n := nu step — 1 until 1 do 
begin 
r:= (at1—n)/(nXz1+(n+a+l) Xr); s:= 7 X (2+8); 
if n < nmaz then Rr{[n—1] := r 
end; 
P1[0] := sum/1+s); 
for n := 0 step 1 until nmaz — 1 do 
Pi[n+1] := Rrln] X Pl[nJ; 
for n := 0 step 1 until nmaz do 
if abs (P1[n]—Papproz|n]) > epsilon X abs (P1[n]) then 
begin. 
for m := 0 step 1 until nmaz do Papprox [m] := P\{m]; 
nu := nu + 10; go to LO 
end; 
P1(0] := c X P10]; 
for n := 1 step 1 until nmaz do 
begin 
ec := (atn) Xc; Pl{[n] :=c X Pin] 


end; 
T1: end Legendre 1; 


procedure Legendre 2(x, a, m, nmaz, d, P2); 


value z,a,m, nmax,d; integer m, nmax,d; real z,a; 
array P2; 
comment This procedure evaluates to d significant digits the 


Legendre functions P?.(z) for fixed z > 1, a, m > 0, and for 
n = 0(1)nmaz. The results are stored in the array P2. They are 
obtained recursively from 


2n+2a+1 ntatm 

rnt+a-m+l1 n+a-m+l1 
the initial values being calculated with the help of the proce- 
dure Legendre 1, 

begin integer n; array P1(0: ml]; 
ifm <0 then go to alarm; 


Peale) = £P Pike) = Pos), 


Legendre \(x, a, m,d, P1); P2[(0] := Pilm]; 
if nmazx > 0 then 
begin 
Legendre 1(x, a+1, m,d, P1); P2[1] := P1{m] 
end; 
for n := 1 step 1 until nmax—1 do 

P2[n+1) := ((2Xn+2Xa+l1) XxX P2[n] 


—(nta+m) X P2[n—1])/(n+a-—m+1) 
end Legendre 2; 


procedure conical (x, tau, nmax, d, P); 
valuez, tau,nmaz,d; integernmaz,d; realz,tau; array P; 
comment This is an adaption of the procedure Legendre 1 to the 
case a = —% + ir, where r is real. The procedure thus generates 
Mehler’s conical functions P",,;,(r) to d significant digits for 
n = 0(1)nmaz and zx > 1. The results are stored in the array P. 
To avoid excessively large and excessively small numbers, we 


let fa = Poy4:,(e)/n! and first compute f, from the recurrence 
relation 
Qnx (n —4)?+ 72 
qT 1 j wm Tee ee ie n—., — 0, 
iG asi a ea) 


and the identity 
fot So Anta = [e+ (x?—1)3]-} cos (x In [x+(2?—1)4]), 
n=1 


where 


rg — tr) 
Ta — air + n) J 


| (g + tr) 
An = 1! : 
TG +ir+n) 


COLLECTED ALGORITHMS (cont.) 


The )’s are obtained recursively by 


ar hiss hd alee eet ny 
eee G4 E49)’ 


1 
1+- 


n 


And = 12 re 
eG) 
2n n 


The procedure converges rather slowly if x and + are both large 
(see the graphs in §6 of [2]). If the accuracy requirement as 
specified by d is too stringent the procedure may not converge 
at all due to the accumulation of rounding errors; 

begin integer n, nu, m; real epsilon, t, x1, 22, sum, lambda 1, 
lambda 2, lambda, r,s; array Papprox, Rr[O:nmaz]; 
ifx <1 V nmaz < 0 then go to alarm; 
ifz = 1 then 
begin 

P(0] := 1; for n := 1 step 1 until nmar do P|n] := 0; 

go to L3 


(2rn — Ana) (n = 2,3,---). 


end; 
t := tauf2; 
for n := 0 step 1 until nmaz do Papproz{n] := 0; 


epsilon := 6 X 10T(—d); 
a1 := sqrt(af2—-1); 22 := 2+ 21; 
sum := cos(tauXln(x2))/sqrt(a2); x1 := 2 X 2/zl; 
nu := 30 + entier ((1+(.140+.0246Xtau) & (a—1))Xnmaz); 
£0: n := 2; , 
lambda 1 := 1/(.25+4); 
lambda 2 := (3-4Xt)/((.25+1) X (2.25+1)); 
L1: lambda := (1+1/n) X (2Xlambda 2—lambda 1)/ 
((1-+.5/n)t2 + (tau/n)12); 


ifn < nu then 


begin 
lambda 1 := lambda 2, lambda 2 := lambda; 
n:=n+1; goto Li 
end; 
r:=s:=Q; 
L2: r := —((1—.5/n) 12+ (tau/n)t2)/(c1+(1-+1/n) Xr); 
8 :=r X (lambda 2+s); 


ifn < nmaz then Rr{[n—1] := 1; 
lambda 1 := lambda 2; 
lambda 2 := 2 X lambda 2 — ((1+.5/n)t2+ (tau/n)t2) 
xX lambda/(1+1/n); 
lambda := lambda 1; 
n:i=n-—1; ifn > 1 then go to 12; 
P[O] := sum/(1+s); 
for n := O step 1 until nmaz — 1 do P[n+1] := Rr[n] X Pin); 
for n := 0 step 1 until nmaz do 
if abs (P[n]—Papproz[n]) > epsilon X abs(P{n]) then 
hegin 
for m := 0 step 1 until nmazx do Papprozx{m] := Pim); 
nu := nu + 60; comment To avoid an infinite loop in 
case of divergence the user should provide for an upper 
bound on nu, say 1000, and exit from the procedure when 
nu exceeds this bound, printing an appropriate error 


message ; 
go to LO 

end; 
i:= 1; 
for n := 1 step 1 until nmaz do 
begin 

t:=nXt; Pin] :=t X& Pin) 
end; 
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J: end conical; 
procedure toroidal (x, m, nmaz, d, Q); 
value x, m, nmaz,d; integer m, nmax,d; real x; array Q; 


comment This procedure generates to d significant digits the 
toroidal functions of the second kind, Q™j4n(x), for n = O(1) 
nmax, where z > 1, and m is an integer, positive, negative or 
zero. The method of computation is based on the recurrence 
relation 


(N-—M+4)Q yp n41 (2), — WaxeQ™y4n(e) + (n+m—HQM n(x) = 0, 
and the identity 


i a m/2 
Q™ (x) +2 yy Qn sina) = (-—1)™ ue T(m + 4)(« -ay(E4t) ; 
n=1 2 x-l 

to which a variant of J. C. P. Miller’s backward recurrence 

algorithm is applied. (See [2] for more details.) The convergence 

of this procedure is slow for x near 1, and improves rapidly as x 

increases; . 
begin integer n, nu, p; real epsilon, x1, c, sum, 7, 8; 

array Qapproz, Rr|O:nmaz); 

ifz <1 V nmac < 0 then go te alarm; 

for n := 0 step 1 until nmaz do Qapproz|n] := 0; 

epsilon := .6 X 10f(—d); 

ce := 2,.2214414691; 

ifm > 0 then 

for n := 0 step 1 until m—1 doc := —(n+.5) X ¢ 
else 
for n := 0 step —1 until m+1 doc := —c/(n—.5); 

sum := ¢ X ((e+1)/(e@—1)) tT (m/2)/sqrt@@—-1); zl i= 2X 2; 

nu := 20 + entier ((1.15-+ (.0146-+ .00122 m)/(x—1)) Xnmaz) ; 
IO: r:= 8s := 0; 


for n := nu step —1 until 1 de 

begin 
rs= (n+m—.5)/(nX21l—(n—m+.5)Xr); s :=7rX (2458); 
if n < nmaz¢ then Rr{[n—1] := r 

end; 


Q[0] := sum/(1+s); 
for n := 0 step 1 until nmaz — 1 do Q[n4+1] := Rrin] X Q[n]; 
for n := 0 step 1 until nmaz do 
if abs(Q[n]—Qapproz{n]) > epsilon X abs(Q[n]) then 
begin 
for p := 0 step 1 until nmaz do Qapproz[p] := Q[p]; 
nu :=nu-+i10; goto LO 
end 
end toroidal; 
comment All procedures were tested on the CDC 3600 computer. 
Some of the tests that were run are described below; 
comment The procedures integer Legendre 1-3 were driven to 
print test values to 6 significant digits of P,™(x), Qm™(x), Qn” (2), 
m = 0(1)10, for x = 1.5, 3, 10, and n = O(1)5. As far as pos- 
sible, the results were compared with values tabulated in [5], 
and found to be in complete agreement. Similarly, test values of 
Pr yin (x),m = 0(1)4, were obtained from the procedure Legendre 
1, for = 1.5, 3, 10, and n = 0(1)5. All agreed with values 
tabulated in [5]. More extensive tests could be run by having 
the procedure ‘‘verify” the addition theorem 


Palay—VS (2? — 1) V/(y? — 1)) = Palx)Paly) 


+20 (-b" Ame Pa (x) Pay), 
m=1 


z>l,y>1; 
TMatm+1) a 


comment |. The procedure conical (with d=6) was run to produce 
test values of P™;,;,(z), m = 0, 1, for z = 1.5, 5, 10, 20, and 
+ = 0(10)30. The results agreed to 6 significant digits with 
those in [10], [11]; 

comment The procedure toroidal was driven to generate test 
values to 6 significant digits of Q™yin(z), Q-Tan(z), n = 0(1)5, 


COLLECTED ALGORITHMS (cont.) 


for x = 1.5, 3, 10, and m = 0(1)4. All values of Q™42(x) were 
checked against those in [5]. There were no discrepancies. The 
values of Q-fi.(x) were compared with those of [[(n—m-+4)/ 
T(n+m+$)]Q74n(z). The largest relative error observed was 
1.510 — 9, occurring at m = 4, n = 5, « = 1.5; 

comment Integrals of the form 


n/2 
fr(k?, a) = (1 [ {1 — k? sin? y]« cos 2ny dy, O<k <1, 
0 


are repeatedly encountered in applied mathematics (see, e.g., 


(6]-[9], where a = —$, ora = —3). It is readily seen that 
2 wy) = (— am Fo +1) — f2)a2tPn Lee 
fall, a) = (DF ogy Ot ~ BrP (Taw) 


The program that follows generates (1—k?)fi(k?, a), n = 
0(1)10, for a = —}, —§, and k? = |1, .5, .9, calling for an ac- 
curacy of 6 significant digits. Selected results are shown below. 
(1 — h?) fu(h2, «) 
1.5807576371 
5 .24564404.7210—8 
9 .080164866710—16 
3 .437822884910—1 
2 .82958444.2310—3 
1.8215954&8010—5 
4.861556123710—1 
5.287840870810—2 
8 .810774395410—3 


1.6169191877 

2 396902298410 —7 
7 .339411710610—15 
8 .472130846310—1 
1.494014960510—2 
1.476430268410—4 
4 .9389962376 

9 .707320038310— 1 
2.169517031710—1 


a k2 


-1.5 1 


2 


—_ 


—2.6 1 


— 
N Pe OO O OOnNN FF OM © 


Ne} 
man 


Those for a = —$ were compared with values tabulated in [6]. 
There was agreement in all four decimal places given; 
begin integer n; real alpha, k2,c; array P1(0:10]; 


for alpha := —1.5, —2.5 do 
for k2 := .1, .5, .9 do 
begin 


e := 1.570796327 K (1—k2)f(1+alpha/2); 

Legendre 1 (.5X (2—k2)/sqri(l1—k2), alpha, 10, 6, P1); 

for n := 0 step 1 until 10 do 

begin 
Pitn] := ¢ X Pl{nj; ¢ := —c/(n+alpha+1); 
outreal (1, Plf{n]) 

end 

end; 
go to skip; 
alarm: outstring (1, ‘parameters not in range’); 
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skip: end; 
comment The integrals 


ab) = | (ae wavde: O21 720 50x 
0 


arose in recent radiation field studies ({1]). One has 
Q,(k) = r(1—k)- 44 P_ (74). 


The program below calculates 0;(k) to 8 significant digits for 
k? = .2(.2).8,7 = 0(1)9. The results agree to 8 figures with the 
values tabulated in [1]; 
begin integer j; real k2, x, 71; array P2, omega (0:9); 
for k2 := .2 step .2 until .9 do 
begin 
a := 1/sgrt(1—k272); 
Legendre 2(x, —.5, 0, 9, 8, P2); 
xl := 3.1415926536 X sqrt(z); 
omega [0] := zl & P2(0); 
for j := 1 step 1 until 9 do 
begin 
zl := x X 21; omegaly] := x1 K Plz] 
end; 
for j := 0 step 1 until 9 do owtreal (1, omega[j]) 
end; 
gd to skip; 
alarm: outstring (1, ‘parameters not in range’); 
skip: end 
end 
REFERENCES: 
1. Epstem, L. F., anp HUBBELL, J. H. Evaluation of a general- 
ized elliptic-type integral. J. Research NBS 67B (1963), 1-17. 


2. Gautscu1, W. Computational aspects of three-term recur- 
rence relations. Unpublished. 

3. ——. Algorithm 221—Gamma function. Comm. ACM 7? 
(Mar. 1964), 143. 

4. Hernvon, J. R. Algorithm 62—A set of associate Legendre 
polynomials of the second kind. Comm. ACM 4 (July 1961), 
320-321; Remark on Algorithm 62. Comm. ACM 4 (Dec. 
1961), 544. 

5. NBS Tables of Associated Legendre Functions. Columbia Uni- 
versity Press, New York, 1945. 

G6. Rieceis, F. Formeln und Tabellen fiir ein in der rdumlichen 
Potentialtheorie auftretendes elliptisches Integral. Archiv 
der Mathematik 2 (1949/50), 117-125. 

7. SIEKMANN, J. Concerning an integral occurring in airfoil 
theory. SIAM Review 8 (1961), 243-246. 


8. —-—-. Analysis of ring aerofoils of elliptic cross section, Part 
I: General theory. J. SIAM 11 (1963), 941-963. 
9. ——. Note ona Riegels-type integral. Z. Angew. Math. Phys. 


15 (1964), 79-83. 
10. Zurina, M.I., anp Karmazina, L.N. Tablicy funkcii LeZan- 
dra P_4,;,(z), Vol. I. Akad. Nauk SSSR, Moscow, 1962. 
11. ——, AND Tablicy funkcif LeZandra P1y,;,(x). Akad. 
Nauk SSSR, Moscow, 1963. 
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REMARK ON ALGORITHM 259 


Legendre Functions for Arguments Larger than One [S16] 
[W. Gautschi, Comm. ACM 8, 8 (Aug. 1965), 488-492] 


J.K.M. Jansen [Reed 24 May 1976 and 12 August 1976] 
Technological University, Eindhoven, The Netherlands 


The purpose of the changes presented here is to simplify the procedures developed 
by Gautschi, in particular to remove the necessity of calling the gamma-function 


routine in procedure Legendre 1. 


COLLECTED ALGORITHMS (cont.) 


(1) procedure integer Legendre 1 


(ii) 


(1) Replace F, = P.*(x)/(n + a)! 
by F, = Pyx)Xal/(n + a)! 
(2) Replace Fy + 2>°° 1 F, = [x + (a2 — 1)!” I\2/a! 
by Fo t+ 2) ha Fa = [2 + (a? — 1) 
(3) Replace real 21, c, sum, 1, s; 
. by real #1, sum, ', 8; 
(4) Sak the statements 
= 1; for n i= 2 step 1 until a doc := nXc; 
(5) Hesiaee sum := (x + 21) T a/c; 


by sum e (x + 21) Ta; 
(6) Replace P[0] := cXsum/(1 + 8); 
by P[0| := sum/(1 + 8); 


procedure Legendre 1 

(1) Omit the sentence of the comment 
It is assumed that a nonlocal procedure gamma be available which evaluates 
T(z) for 0 < 2 < 2. (See [3].) 

(2) Replace f, = Poe(x)/I(a + n+ 1) 
by fn = Pat(x)XT(a + 1)/T(a +n + 1) 

(3) Replace fo + 22 una1fm = [2 + (a? — 1)7I*/T(a + 1) 
by fot 2d fan = [e+ (a? — 1)" 

(4) Replace Po(x) = (a +n + 1)fn 
by P.( 2) = [T(a + 2 + 1)/Me + Dif 

(5) Omit the statements 
ifa<1 then. c := gamma(1 + a) else 


begin 
m = entier(a) — 1;¢ := gamma(a — m); 
for n := 0 step 1 until m doc := (a—n)Xc 
end; 
(6) Replace sum := (x + 21) f a/c; 
by sum := (a + 21) Ta; 
(7) Replace P1[0] := cx P1[0]; 
by c:=1; 


(8) During computations it sometimes happens that entéer(alpha) — alpha = 0 
and consequently the process stops. We remark that if entier(alpha) — 
alpha = 0 this algorithm accomplishes the same as the procedure integer 
Legendre 1, although in an inefficient manner. To continue the computa- 
tions we propose to replace 
ifz <1 \V nmaz <0 V entier(alpha) — alpha = 0 then 
by 
ifx <1\ nmaz < 0 then 


The same tests as described by Gautschi were run on the Burroughs B6700 and 
Philips P9200 digital computers of the Computer Center of the Technological Uni- 
versity at Eindhoven and were found to be in complete agreement. 
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COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 260 

6-J SYMBOLS [Z] 

J. H. Gunn (Reed. 13 Nov. 1964) 

Nordisk Institut for Teoretisk Atomfysik, Or aihasin, 
Denmark 


real procedure SJS (J/1, J2, J3, Ll, L2, 13, factorial); 
value J/1, J2, J3, £1, L2, L3; 
integer J1, J2, J3, L1, £2, L3; 
array factorial; 
comment SJS calculates the 6-7 symbols defined by the fol- 
lowing formula 
i j2 a _ A(Jl, 72, 73) A(J1, 12, I3)A(L1, 72, 13)A(11, 12, 73) 
W1203f & do. (—1)#(e +1) (2-71 —j2—78) (2 -j1—12—18)! 
(2—l1—j2—13) !(2—11— 12-73) 1(71+92+11+12—2z)! 
(G2+738+12+-13 —z)!(G8+71+13+11—z) !) 


where 
At er (at+b—o!(a—b+oce!(-a+b+olfh 
ce mas Gite. 1 


and where jl = J1/2, j2 = J2/2, 73 = J3/2,11 = L1/2,12 = L2/2" 
13 = L£3/2. [Reference formula 6.3.7 page 99 of EpMonps, A. R. 
Angular momentum in quantum mechanics. In Investigations 
in Physics, 4, Princeton U. Press, 1957]. The parameters of the 
procedure J1, J2, J3, D1, £2, £3 are interpreted as being twice 
their physical value, so that actual parameters may be inserted 
as integers. Thus to calculate the 6-7 symbol 


as, 


2 2 Of 


the call would be SJS (4, 4, 0, 4, 4, 0, factorial). The proce- 
dure chécks that the triangle conditions for the existence of a 
coefficient are satisfied and that jl + 72 + 78, jl + 12 + 23, 
11 + j2 + 13 and /1 + 12 + 78 are integral. If the conditions 
are not satisfied the value of the procedure is zero. The parame- 
ter factorial is an array containing the factorials from 0 up to 
at least 1 + largest of 71 + j2 + 78,71 + 12 + 23,11 + j2+ 28 
and 71 + 12 + 73. Since in actual calculations the procedure 
SJS will be called many times it is more economical to have the 
factorials in a global arrav rather than compute them on every 
entry to the procedure. The notation is consistent with that 
used in the procedure for calculating Vector-coupling coeffi- 
cients. See Algorithm 252, Vector Coupling or Clebsch-Gordan 
Coefficients [Comm. ACM 8 (Apr. 1965), 217]; 
begin integer w, wmin, wmaz; 
real omega; 
xeal procedure delia (a, b, c); 
value a, b, c; 
integer a, b, c; 
begin delta := sqrt (factorial [(a+b—c) +2] 
X factorial [(a—b-+c) +2] 
X factorial [(—a+b+c)+2]/factorial [(a-+b+e+2) +2]) 
end delta; 
if Ji+J2 <J3 V abs(J1 — J2) > J3 JV JI 4+J24+ 53 = 
2X ((J1+J2+J3) +2) 
VJ1+ 12 < £3 V absWJ1—L2) > D3 VJI+ 22+ 03 ~# 2x 
((J1+£2+L3) +2) 
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V Li+ J2 < L38 V abs(L1—J2) > 13 \V LI +J2+ 18 #2*xX 
((L1+J2+L3) +2) 
V Li+ £2 < J3 V abs(L1-L2) > J3V 1+ 224+ 43 ~2&X 
((L1+L2+J3) +2) 
then S/S := 0 else 
begin 
omega := 0; 
wmin := J1 + J/2+ J3; 
if wmin <J1+ 124+ 23 then wmin 
if wmin < L1 + J2+ 23 then wmin : 
if wmin < L1 + 12+ J3 then wmin : 
wmax := J1+J24+ D1 + £2; 
if wmax > J24+J/3+ 124+ £23 then wmaz 
L2 + LB; 
“af wmaz > J34+-714 2384+ 21 then wmaz 
£3 + 21; 
for w := wmin step 2 until wmaz do 
omega := omega + (if w=4X (w+4) then 1 else —1) 
X factorial [w+2+1]/(factorial .[(w—J1—J2—J3) +2] 
X factorial [(w—J1—L2—L3) +2] 
X factorial [(w-L1—J2—L3) +2] 
X factorial [(w—L1—L2—J3) +2] 
X factorial [((J1+/2+21+L2—w) +2] 
X factorial [((J2+J/3+L2+13—w) +2] 
X factorial [(J/84+J1+£3+L1—w) +2]); 
SJS := delta (J1, J2, J3) X delta (J1, L2, L8) 
X delia (L1, J2, L3) X delta (L1, L2, J3) X omega; 
end 
end SJS 


‘a J1 4+ 12 + 13; 
Li + J2 + 13; 
Li + £2 + J3; 


{| 


= J2+ 73 + 


<= J3+J1+ 
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ALGORITHM 261 then NJS := 0 else 

9-J SYMBOLS [Z| begin NJ := 0; 

J. H. Gunn (Reed. 18 Nov. 1964) ; aH Ge bap onaer then kmin := abs(J11~J33); 

Nordisk Institut for Teoretisk Atomfysik, Copenhagen, if kmin < abs(J12—J23) then kmin := abs(J12—J23); 
Denmark kmax := J21 + J32; 

real procedure NJS(J11, J12, J13, /21, J22, J23, J31, J32, J33, if kmax > J11 + J33 then kmax := J11 + J33; 


if kmax > J12 + J23 then kmazr := J12 + J23; 
for k := kmin step 2 until kmaz do 
NJ := NJ + (if k=2X (k+2) then 1 else —1) X (k+1) X 
SJS(J11, J21, J31, J32, J33, k, factorial) X 
SJS(J12, J22, J32, /21, k, J23, factorial) < 
SJS(J18, J23, J33, k, J11, J12, factorial) ; 


factorial) ; 
value J11, J12, J13, J21, J22, J23, J31, J32, J33; 
integer J11, J12, J13, J21, J22, J23, J31, J32, J33; 
array factorial; 
comment NJ/S calculates the 9-7 symbols defined by the follow- 
ing formula 


NJS := NJ 
jll 12 713) adh eles = 17 end 
j2l 722 j23}= > (—ymarpayey jai 731\ end NJS 
531 732 733 j32 733k J 


jl2 j22 3732\ Ay 523 

j21_ k jaatlk gll 712s. 
where jll = J11/2, j12 = J12/2, 718 = J18/2, j21 = J21/2, 
j22 = J22/2, 723 = J23/2, j31 = J31/2, j32 = J32/2, 733 = 
J33/2 [Reference formula 6.4.3 page 101 of Epmonps, A. R. 
Angular momentum in quantum mechanics. In Investigations 
in Physics, 4, Princeton U. Press, 1957]. The parameters of the 
procedure J11, J12, J18, J21, J22, /23, J31, J32, J33 are inter- 
preted as being twice their physical value, so that actual param- 
eters may be inserted as integers. Thus to calculate the 9-7 


symbol 
2 2 0 
2 2 0 
000 


the call would be NJS (4, 4, 0, 4, 4, 0, 0, 0, 0, factorial). The 

procedure checks that the triangle conditions for the existence 

of a coefficient are satisfied and that 711 + j21 + 731, j21 + 

j22 + 728, 78l + 782 + 783, JIL + 712 + 718, 712 + 722 + 

32, j13 + j23 + 733 are integral. If the conditions are not 

satisfied the value of the procedure is zero. The parameter fac- 

torial is an array containing the factorials from 0 up to at least 

1 + largest of gli + j21 + 731, j21 + 722 + 728, 731 + 732 + 

9338, gll + g12 + 718, 712 + j22 + 782, 718 +- 723 + 733. The 

procedure makes use of the procedure S/S [Algorithm 260, 

6-7 symbols, Comm. ACM 8 (Aug. 1965), 492], for calculating 

6-7 symbols; 

begin integer k, kmin, kmax; 

real NJ; 

if Jil + J21 < J31 V abs(J11—J21) > J31 V Jil + J21 + 
J3l #2 & (JiN+/214+J781) +2) 

V J21 + J22 < J23 V abs(J21—J22) > J23 VV J21 + J22 + 
J23 % 2X ((J21+J22+./23) +2) 

V J31 + J382 < J33 V abs(J31—J32) > J33 VV J31 + J32 + 
J33 # 2X ((J31+J7382+/33) +2) 

V Jil 4+ J12 < J13 *V abs(J11—J12) > J13 V J11 + J12 + 
J13 # 2X (VJ11+J12+./13) +2) 

V J12 + J22 < J382 V abs(J12—J22) > J382 V J12 + J22 + 
J32 % 2X (J12+/22+732) +2) 

V J13 + J23 < J383 V abs(J13—J23) > J33 V J13 + J23 + 
J33 4 2 & ((J13+/23+/33) +2) 
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ALGORITHM 262 

NUMBER OF RESTRICTED PARTITIONS OF N 
[Al] 

J. K.S. McKay (Recd. 7.Dec. 1964 and 9 Mar. 1965) 

Computer Unit, University of Edinburgh, Scotland 


procedure set (p, N); integer N; integer array p; 
comment The number of. partitions of n with parts less than 
or equal to m is set in p[n, m] for all n, m such that N > n => 
m> 0. 
REFERENCES: 
1. Gupra, H., Gwytuer, C. E., anp Miuurr, J. C. P. Tables of 


partitions. In Royal Society Mathematical Tables, vol. 4, 
Cambridge U. Press, 1958. 
2. Harpy, G. H., anp Wrieut, KE. M. The Theory of Numbers. 
Ch. 19, 4th ed., Clarendon Press, Oxford, 1960; 
begin integer m,n; 
plO, 0] := 1; 
for n := 1 step 1 until N do 
begin pjn, 0] := 0; 
for m := 1 step 1 until n do 
pin, m] := pin, m—1] + 
pi[n—m, if n—m<m then n—m else m] 
end 
end set 
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COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 263 

PARTITION GENERATOR [A1} 

J. K.S. McKay (Reed. 7 Dec. 1964 and 9 Mar. 1965) 
Computer Unit, University of Edinburgh, Scotland. 


procedure generate (p, N, position, pin, length); 
integer array p, pin; integer JN, length, position; 
comment The partitions of N may be mapped in their natural 
order, 1 — 1, onto the consecutive integers frdm 0 to P(N)—1 
where P(N) (=p[N, N]) is the number of unrestricted partitions 
of N. The array p is set by the procedure set [Algorithm 262, 
Number of Restricted Partitions of N, Comm. ACM 8 (Aug. 
1965), 493]. On entry posttion. contains the integer into which 
the partition is mapped. On exit length contains the number of 
parts and pin[1: length] contains the as of the partition in 
descending order. 
REFERENCE: 
1. Lirttewoop, D. E. The Theory of Group Chuitaclers: Ch. 5, 
2nd ed., Clarendon Press, Oxford, 1958; 
begin integer m, n, psn; 
n:= N; psn := position; length := 0; 
A: length := length +1; m:= 1; 
B: if p[n,m] < psnthen beginm := m-+1; goto Bend else 
if p[n, m] > psn then : 


C: begin 
pin[length] := m; psn := psn — p[n, m—1]; ni=n-— mM; 
ifn ~ 0 then goto A; goto D 
end 


else m := m+I1; gotoC; 
D: end generate 
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COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 263A 

GOMORY 1 [H]} 

H. LANG@MAACK (Recd. 17 June 1964 and 13 May 1965) 

Mathematisches Institut der Technischen Hochschule, 
Miinchen, Germany 


When testing Algorithm 153 GOMORY [F. L. Bauer, Comm. 


ACM 6 (Feb. 1963), 68] in ALaou on the SIEMENS 2002 and 
TELEFUNKEN TR4 computers and in PROSA (assembler 
code) on the SIEMENS 2002 computer I found that some correc- 
tions were necessary. After discussions with Prof. Dr. Bauer I 
wish to submit the following remarks on Algorithm 153 GOMORY 
and on Certification of Algorithm 153 GOMORY [B. Lefkowitz and 
D. A. D’Esopo, Comm. ACM 6 (Aug. 1963), 449]. The improved 
algorithm GOMORY 1 is presented below. 

1. The evaluation of the integer number ¢[j] in the algorithm 
GOMORY or t in the revised form of the algorithm GOMORY is 
not correct, since é[j] (or t) must be the largest integer number 
such that.column j of the matrix a is not lexicographically less 
than column | multiplied by ¢[j] (or 4), provided such a ¢[j] (or ¢) 
exists. A suitable change is incorporated in the algorithm 
GOMORY 1 given below. 

2. The second remark deals with the fact that a theoretically 
correct ALGOL program may not necessarily run correctly when 
translated into a particular machine language and run on that 
machine. In general real numbers are represented only approxi- 
mately and the mathematical division indicated by the ALGoL 
operator / is transformed into the approximate operation of ma- 
chine division. There are two possibilities that the algorithm 
GOMORY might fail: 

A. The lambda calculated by 


abs (alr, j]/¢t[9]) 
in the algorithm GOMORY (or by 

—afr, ji/t 
in the revised form of the algorithm GOMORY) may be less than 
the exact theoretical value of lambda. This may lead to columns 
which are lexicographically negative, but this situation is not 


allowed. 
B. The quantities c[j] (or c) calculated by 


entier (a[r, j]/lambda) 


may be different from the exact values, a situation which may 
lead to incorrect matrix transformations. 

To avoid these unwanted effects the author suggests remedying 
the problem in the following way: 

a. Since lambda is only an intermediate result, it is proposed 
to keep the numerators and denominators of the candidates for 
lambda separate and to compare them by cross multiplication. 

b. It is preferable to compute 

A/lambda 
by 


(A X denominator of lambda)/numerator of lambda 


where A is an integer type expression. 
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c. In the algorithm GOMORY there are statements of the 
form 


C := entier (A/B) 


where C is an integer variable, and A and B are integer type 
expressions. In order to prevent roundoff errors the result C 
should be checked to make sure that 


CXB<A<CXB+B 


and corrected if these inequalities are not satisfied. 

The corrections, a, b, c, lead to a program which cannot fail 
unless the products developed should overflow. However, anyone 
who wishes to use the algorithm may prefer to do some analysis 
of the particular division his computer performs and seek an al- 
ternative which is not as time-consuming. Many machines have a 
built-in Euclidean division instruction for integer numbers 
which would be very useful for Gomory’s algorithm. Unfortu- 
nately ALGOL translators are not likely to produce this instruction 
in their object programs since an arithmetical expression A/B is 
a real type expression by definition. 


procedure Gomory 1 (m, n) transient: (a) exit: (no solution); 
value m, 7; 
integer ™, n; 
integer array a; 
label no solution; 
comment Gomory 1 algorithm for all-integer programming. 
The objective of this procedure is to determine the integer solu- 
tion z{1], --- ,z[n—1] of alinear programming problem with inte- 
ger coefficients only. In other words: The problem is to find 
integer numbers 
z(l], --- , z[n—1) 
minimizing the objective function 
alO, 1] X 2[1] + --- + 4/0, n—1] X 2[n—1] 
under the constraints 
z{1] > 0,--- ,2[n—1} 2 0 
and 
alt, 1] X x[{l] + --- + afz, n--1] X 2[n—1] < alt, n) 
for? =1,---,m—n+l (2<n<™m). 

The tableau matrix a used by the procedure consists of m+1 
rows and n columns. The components are alt, j] for 7 = 0, 1, 
Ss »mj= 1l,--- ,n. 

The input values for the components are given partly by the 
problem itself (see above). The remaining components must 
have been previously assigned in the following manner: 

a[0, n] := 0 
and 

afi, j]) := ifi =j7 +m—n-+1 then —1 else 0 
fori = m—n+2,---,m,j = 1,---,n. The tableau columns, 
with the exception of the last column, have to be lexicographically 
positive. 

The algorithm is finished if all entries in the last column, 
except the topmost entry, are non-negative. Then —a(0, n) is 
the value of the objective function. The optimal solution 
z(1], --: ,z[n—1] is given by the n—1 components a[m—n+2, n], 

- ,a[m, n] of the last column of a. 

The exit no solution is used if a row is found which has a 
negative entry in the last column, but otherwise only non- 


negative entries; 


COLLECTED ALGORITHMS (cont.) 


begin integer i, k, j, 1, r, c, t, s, lambda num, lambda denom; 
integer procedure Euclid (u, v); 
value 4%, v; 
integer 4, 0; 
begin integer w; 
w := entier (u/v); 
[8: if w Xv > uthen 


begin w := w—1; go to L8 end; 


L9: if (wt+1) x »v < u then 
begin w := w+1; go to L9 end; 


Euclid := w 
end Zuclid; 
L1: fori := 1 step 1 until m do if a [i, n] < 0 then 


begin r := 7; 
go to end; 
L2: for k := 1 step 1 until n—1 do if a[r, k] < 0 then go to L4; 
go to no solution; 


go to L2 end; 


4: l:=k; 
for j := k+1 step 1 until n—1 do if a[r, j] < 0 then 
begin 7 := 0; 


L3: if alt, j] < a[z, 1] then 1 := 7 else 
if alt, 7] = a[z, l] then 


begin 2 := i+1; goto L3 end 
end; 
8s := 0; 
L5: if als, 1] = 0 then 
begin s := s+1; go to L5 end; 


lambda num := —al[r, I]; 
lambda denom := 1; 
for j := 1 step 1 until /—1, 1+1 step 1 until n—1 do 
if alr, j] < 0 then 
begin 
for 7 := 0 step 1 until s—1 do if a[, j] ¥ 0 then go to L7; 
t := Euclid (a[s, j], als, l)); 
if (Xal[s, 1] = als, j]) A (£>1) then 
begin 7 := s; 
6: t:= 7t+1; 
if ¢Xal[z, 1] = alt, j] then go to L6 else 
if ¢Xalz, 1] > alt, 7] then ¢ := t—1 


end; 

if —al[r, j] X lambda denom > t X lambda num then 

begin lambda num := —al[r, j]; lambda denom := t,end; 
L7: end; 


for j := 1 step 1 until /—1, 1+1 step 1 until n do 
begin c := Euclid (a[r, j] X lambda denom, lambda num); 
if c # 0 then 
for 1 := 0 step 1 until m do 
alt, j] := aft, J] +e X alt, 1] 
end; 
go to LI; 
end: 
end 


CERTIFICATION OF ALGORITHM 263A [H] 

GOMORY 1 [H. Langmaack, Comm. ACM 8 (Oct. 
1965), 601-602] 

L. G. Protu (Reed. 15 Sept. 1969) 

Department of Mathematics, University of 
Southampton, U.K. 

KEY WORDS AND PHRASES: 


variables, dual method 
CR CATEGORIES: 5.41 


Algorithm 263A was coded in Augou for an ICL 1907 computer 
and ran successfully without alteration. Execution times and 


linear programming, integer 
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pivot counts for a sample of 12 published examples are given in 
Table I. 

Problem 1 is taken from Haley [1,-p. 127]. Problems 2, 3, and 4 
are Balas [2, ex. 2; 3, 4] in which the variables were not restricted 
to be 0 or 1. Problems 5-10 are IBM [8, test problems 1-5 and 9]. 
Problems 11 and 12 are Pierce [4, ex. 1, 2]. 

Wilson [5] has shown that it is possible to derive potentially 
stronger cuts than those of Gomory with little extra computation. 


TABLE I 
Problem m n No. of pivots Time (sec.) 
1 13 10 13 1 
“2 17 11 8 1 
3 13 10 35 1 
4 18 13 600T 
5 14 8 9 1 
6 14 8 16 1 
7 10 8 16 1 
8 30 16 17 2 
9 30 16 2569 248 
10 65 16 600T 
11 4l 32 5 2 
12 3l 27 5 2 


t termination not reached 


Wilson’s cuts can be incorporated into GOMORY 1 by means of 

the following alterations: 

(a) in the declarations at the head of the procedure body, insert 
Boolean null, nflag; 


(b) in the line commencing L4: 1 := k; insert the stat ment 
null := true; 
(c) replace L7: end; by 
Li: end 
else null := false; 
ce := Euclid (alr, n]Xlambda denom, lambda num); 
s:= —(c+1); t:= —alr,n]; nflag := true; 


if null then go to L10; 

for j := 1 step 1 until n — 1 do if a[r, j] > 0 then 

begin c := Euclid (a[r, j|Xlambda denom, lambda num); 
if s X afr, j] << tXcthen 
begin ¢ := a[r, j]; s := ¢; 

end; 


T10: if s X lambda num < t X lambda denom then 
begin lambda num := if nflag then 100 X ¢ — 1 else i; 
lambda denom := if nflag then 100 X s else s 
end; 
(d) replace the line commencing 
begin c := Euclid(a[r, j]xlambda denom, lambda num); 
by 

begin c := if lambda denom ¥ 0 then Euclid(a[r, j]< 
lambda denom, lambda num) 
else if alr, j] < 0 then —1 else 0; 


nflag := false end 


TABLE II 
Problem No. of pivots Time (sec.) 
7 7 1 
9 2238 235 
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With these alterations some reduction in the number of pivots 


needed to solve problems 7, 9 was observed. New pivot counts 
and execution times for these problems are given in Table IT. 
Execution times for the problems not listed in Table II were 
unaltered. 


I. 


2. 


on 


REFERENCES: 

Hautey, K. B. Mathematical Programming for Business and 
Industry. Macmillan, New York,. 1968. 

Bauas, E. An additive algorithm for solving linear programs 
with zero-one variables. Oper. Res. 13 (1965), 517-545. 


. Haupt, J. 25 integer programming test problems. Working 


Paper No. 48, Grad. Sch. of Bus., Stanford U., Stanford, 
Calif., 1964. 


. Prerce, J. F. Application of combinatorial programming to a 


class of all zero-one integer programming problems. Man. 
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. Wiuson, R. B. Stronger cuts in Gomory’s all-integer integer 


programming algorithm. Oper. Res. 16-(1967), 155-156. 
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ALGORITHM 264 

MAP OF PARTITIONS INTO INTEGERS [Al] 
J.K.S. McKay (Recd. 7 Dec. 1964 and 9 Mar. 1965) 
Computer Unit, University of Edinburgh, Scotland 


integer procedure place(p, n, pin); value n; 
integer array D; pin; integer n; 

comment place is the inverse of. the procedure generate [Al- 
gorithm 263, Partition Generator, Comm. ACM 8 (Aug. 1965), 
493]. The array p is set by the procedure sét [Algorithm 262, 
Number of Restricted Partitions of N, Comm. ACM 8 (Aug. 
1965), 493]. The procedure produces the integer into which 
the partition of n, stored in ‘descending order: of parts in ptn[1] 
onwards, is mapped; 

begin integer j, d; 


ad :=0; 
if n = 0 then go to B; 
vey, 


A: j:=j+1; d:= pln, pinfj]-1] + 4; n= n — ptnlj); 
if n ~ 0 then go to A; 

B: place := d 

end place 
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ALGORITHM 264A 

INTERPOLATION IN A TABLE [E1] 

J. STAFFORD (Recd. 16 Nov. 1964 and 7 June 1965) 

Westland Aircraft Ltd., Saunders-Roe Division, Hast 
Cowes, Isle of Wight, England 


real procedure [IN POL(T, X, I, N, OUT, XOUT, EX POL); 
value X, N; array 7, X; integer I; integer array N; 
real XOUT, EXPOL; Boolean OUT; 

comment Evaluation of a function by polynomial interpola- 
tion in a table of values. 

The values may be specified at arbitrary intervals, at nodes 
of a multidimensional rectangular grid. The interpolation is 
by Neville’s process, repeated in each dimension. 

The given values are arranged in a one-dimensional real array 
T, as follows. The first value in the table, T[0], is D, the number 
of independent variables (or dimensions). It will normally be 
integral (although of type real), but if not then its integral part 
is taken. 7[1], T[2],--- , T[D] are the numbers of values of 
X1, X2,--- , XD, and must be integral. These are followed by 
T{1] values of X1, T[2] values of X2,--: T[D] values of XD. 
The values of each of these independent variables must all be 
distinct and must be arranged in monotonic order. Finally 
come the 71] X T[2] xX --- X T[D] values of the dependent 
variable F(X1, X2,--- , XD), arranged as T[D] sets of T[D—1] 
sets of --- of T[2] sets of 7[1] values of F'. 

The table is represented by a one-dimensional array because 
it is not feasible to use a general D-dimensional array. 

The given values of the independent variables are X{[I] 
([=1, 2,--- , D). N{Z] of the tabulated values X[I] are used to 
interpolate in the Jth dimension. IN POL is the required value 
of the function. The actual parameter corresponding to the 
formal parameter EX POL should be an expression which pro- 
vides the value of JN POL if any of the X[I] is outside the range 
covered by the array 7. If this occurs XOUT is the particular 
value of X[Z] concerned, The variables J, OUT and XOUT are 
declared as formal parameters of JIN POL so that they may be 
used in the actual parameter corresponding to the formal pa- 
rameter EX POL. 

An example of a call of INPOL is Z := INPOL(A, X, K, N, 
OUT, Y, if K=1 then EXTRAPOLATE (A, 1, N, OUT, Y) 
else if K=2 then LIMTAB (A, 2, OUT, Y) else Y—2). If 
X(1] is outside the range covered by the array A this statement 
will use the extrapolatory procedure EXTRAPOLATE (given 
below) to provide a value for ZNPOL. If X[2] is out of range 
the procedure LIMTAB (also given below) will be used to 
replace the value of X[2] by its value at the nearer edge of the 
table, before returning to JN POL to continue the interpolation. 
If some other variable (X(3], say) is out of range the value of 
IN POL is taken as X[(3] — 2. 

The procedures INPOL, EXTRAPOLATE and LIMTAB 
were tested on an ICT Atlas computer. They were also tested 
on a National-Elliott 803 computer, after being altered to 
conform to the restrictions of the 803 ALGoL compiler. The 
tests were for D = 0, 1, 2 and 3, and included all special cases; 

begin integer D, J, K, L, M,Q, XI; 


procedure FORS3(N, P, V, UB); 
value N; integer NV; procedure P; 
integer array V, UB; 
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comment Nesting of for statements, adapted from procedure 
Fors 1 [Algorithm 137, Comm. ACM 6 (Nov. 1962), 555]; 
begin integer J; 
if N = 0 then P else for J := 1 step 1 until UB[N] do 
begin V[N] := J; FORS3(N—1, P, V, UB) end 
end FORS3; 


real procedure NEV(X, AX, SAX, AY, SAY, N); 
value X, SAX,SAY, N; real X; integer SAX, SAY,N; 
array AX, AY; 
comment One-dimensional interpolation by Neville’s process. 
N values of the independent variable are used in the inter- 
polation, namely, N consecutive elements of array AX start- 
ing at subscript SAX. The corresponding values of the de- 
pendent variable are the N consecutive elements of array 
AY starting at subscript SAY. X is the value of the inde- 
pendent variable for which the value of the dependent vari- 
able (namely, NEV) is to be interpolated; 
begin integer I,J, NJ, KI; array F{0: N—1]; 
for J := 0 step 1 until N — 1 do FiJ] := AY[SAY + J]; 
for J := 1 step 1 until N — 1 do 
begin 
NJ :=N—J-—1; 
for J := 0 step 1 until NJ do 
begin 
KI := SAX +7; 
FU) := (FU+1)—-FU)) K (X—AX[KI))/ 
(AX[KI+J]—AX[KI])+F I] 
end; 
NEV := F(0] 
end NEV; 


D := entier (T[0]); 

comment D = number of dimensions. The special case D = 0 
implies that the tabulated function F is a constant, the value 
of which is 7[1]. The same value is taken if D < 0; 

if D < 1 then INPOL := T[1] else 


begin XJ := 1; 
for J := 1 step 1 until D do 
begin 


if N(J] < 2then N[I] := 2; 

if N(Z) > T[] then N[J] := Tl); 

comment Adjustment of number of points used for inter- 
polation. Normally N[J] must be at least 2, and if N[I] < 2 
it is set equal to 2. N[I] also may not exceed the number 
of values of the independent variable in the correspond- 
ing dimension (namely, T[Z]), and if it does so it is re- 
duced accordingly. 

The combination of these two tests, in this order, 
permits as a special case one-point interpolation in any 
particular dimension (I, say), if T[Z] = 1. This implies 
that the dependent variable is independent of X[JZ]. If 
this is intended then the actual parameter corresponding 
to the formal parameter EXPOL must be a function 
designator which (if called for) replaces the value of 
XOUT by the single value of the 7th variable from the 
array 7. (Procedure LIMTAB may be used for this 
purpose. ) 

Since array N is called by value none of these adjust- 
ments affects the values of N[Z] in the nonlocal array N; 

XI := XI+ Nj 
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end J; 
begin array F(1: XIJ—N{[I1]]; integer array V, XINIT, 
YINC [1: D}; 


procedure ONEWAY; 
comment Performs an interpolation in the first dimen- 
sion. If this is the last of a set of N{2] such interpola- 
tions, a further interpolation is performed in the second 
dimension, and so on to as many higher levels as neces- 
sary; 
begin F[V[1]] := NEV (X[1], T, XINIT(I1], T, Q, L); 
I:=1; M :=0; 
for K := 1 step 1 until D — 1do 
begin Q := Q + YINC[K]; 
if V[K] + N[K] then go to CONTINUE else 
begin M := M + N[K]; 
F[IM+V[K+1]] := NEV (X[K+1], 7, XINIT(K+1], 
F, I, N{K)); 
I:=I+ N[K] 
end 
end; 
CONTINUE: 
end ONEWAY; 


Q:= XI:=D+1; M :=1; 
for [ := 1 step 1 until D do 
begin K := XI + T{Z] — 1; 
OUT := (X[Z] — T[XI]) X (X[Z] — T[K]) > 0; 


if OUT then 
begin 
XOUT := X[I]; INPOL:= EXPOL; X[{I|:= XOUT; 
if T(0] < 0 
then begin K := K + T[0]; T[0) := D end 
end; 


comment If X[J] is outside the range covered by the 
table, the extrapolatory expression E.X POL is evalu- 
ated. It is expected that it will often be or contain one 
or more function designators, together with criteria 
for choosing between them, as in the example above. 
EX POL may incorporate, e.g., any of the following 
alternatives. In the first and third of these the side 
effects are the important ones, the value assigned to 

EX POL being merely a dummy to conform with Section 

5.4.4 of the Revised Report on Algol 60 [Comm. ACM 6 

(Jan. 1963), 1-17]. 

1. EX POL may be a function designator which uses 
the interpolatory formula to extrapolate by executing 
the statement OUT := false and returning to IN POL. 
The last N[Z] values of X[Z] are used in the formula, 
but BX POL may arrange to use the first N[Z] values 
instead (which will usually be preferable if X[J] lies 
beyond the lower limit of the table) by executing the 
statement 7[0] := N[Z] — T[Z] (in which the value of 
the local N[J] is to be used if it differs from that of 
the nonlocal N[I]). The procedure EXTRAPOLATE 
(given below) may be used for this purpose. 

2. HX POL may use some other formula to extrapolate, 
after which it must return to IN POL without altering 
the value of the Boolean variable OUT’. If this is all 
that is required the actual parameter corresponding 
to EX POL may be an ordinary arithmetic expression 
containing no function designators. 

3. EX POL may be a function designator which con- 
strains X [I] to lie within range by replacing it by the 
value of the Jth variable at the nearer limit of the 
table (or by some other value). In doing this it must 
operate on the value of XOUT and not directly on 
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X{[I]. The nonlocal array X will not be affected. 
EXPOL must also execute the statement OUT := 
false before returning to ZNPOL. The procedure 
‘LIMT AB (given below) may be used for this purpose. 
4. EXPOL may do something else and continue the 
program without returning to INPOL (e.g., by a go 
‘to statement referring to a nonlocal label). This 
should be considered an error exit as the value of 
INPOL will be undefined (see Section 5.4.4 of the 
Revised Report on Algol 60); 
‘if OUT then go to B; 
comment If OUT = true on exit from JN POL then 
extrapolation has occurred: The converse is not neces- 
sarily true, as it depends on the nature of the actual 
parameter corresponding to the formal parameter 
EX POL; 
J := XI; 
DL := J+K) + 2; 
if (X[(J)—T\]) < (X{)-T[L]) > 0 then J := L else 


K :=1; 
if K —J> |1then goto A; comment Find X[/] in 
table; 


L:= K -— N{I} + 2; 
if L < XI then L := XI else 
begin 

K:=XI+ TI) -—-NiQ; if 2> KthenLl := K 
end Adjustment near edge of table; 
Q:=Q24+T7TN+(@h-XD XM; XINIT | :=L; 
XI := XI + 7); 
YINC(] := M X (T[I| — Gif 7=1 then 0 else N[J})); 


M:=MXT(I] 
end I; 
V[(D) :=1; LD := Nfl]; 


for I := 1 step 1 until D — 1 do N{[J] := N{I+1); 
FORS3(D—1, ONEWAY,V,N); INPOL := F[M-+1| 
end scope of F 
end D> 1; 


end IN POL; 
real procedure EXTRAPOLATE(T, I, N, OUT, XOUT); 


array T; integer J; integer array VN; Boolean OUT; 
real XOUT; 


comment This function designator is intended for use in the 


actual parameter corresponding to the formal parameter 
EX POL in a call of procedure JINPOL. The parameters have 
the same significance as in INPOL. 

EXTRAPOLATE arranges for the interpolatory formula to 
be used to extrapolate for the Jth variable, and for the first 
N[I] values of this variable to be used in the formula instead 
of its last N[J] values if it lies beyond the lower limit of the 
table; 


begin integer J, K; 


OUT := false; EXTRAPOLATE := 0; 
comment This statement assigns a dummy value to EXTRA P- 
OLATE to conform with Section 5.4.4 of the Revised Report 
on Algol 60; 
J:=1; for K := 0step1 until J —1doJ :=J+ 7[K); 
if T[J] = 1 then XOUT := Tl[J] else 
if abs(XOUT — T{J]) < abs(XOUT—T[J+T[I]—1]) then 
begin K := N[I]; 
if K < 2then K := 2; 
if K > T{I) then K := T{J); 
T[0] := K — Td] 


end 


end EXTRAPOLATE; 


COLLECTED ALGORITHMS (con’t.) 


real procedure LIMTARB(T, I, OUT, XOUT); 
array T; integer J; Boolean OUT; real XOUT; 
comment This function designator is intended for use in the 
actual parameter corresponding to the formal parameter 
EXPOL in a call of procedure IN POL. The parameters have 
the same significance as in IN POL. 

LIMTAB replaces the value of XOUT, which is outside the 
range of the table, by the value of the 7th variable at the nearer 
edge of the table; 

begin integer J, K; 
J:=1; for K := 0step1 until J—1doJ :=J + T[K]; 
K:=J+T7 I) — 1; 
LIMTAB := XOUT := if abs(XOUT-—T{J]) > 
abs(XOUT—T[K]) then T[K] else Tl}; 
comment This statement assigns a dummy value to LIMTAB 
to conform with Section 5.4.4 of the Revised Report on 

Algol 60; 

OUT := false 
end LIMTAB 
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ALGORITHM 265 

FIND PRECEDENCE FUNCTIONS [L2] 

Nixuaus Wirtu (Recd. 14 Dec. 1964 and 22 Dec. 1964) 
Computer Science Dept., Stanford U., Stanford, Calif. 


procedure Precedence (M, f, 9, », fail); 

value n; integer ; integer array M, f,g; label fail; 

comment M is a given n X n matrix of integers designating one 
of the four relations <, =, >, °. The identifiers Js, eq, gr des- 
ignate variables declared outside the procedure to which distinct 
integers representing the relations <, =, > have been assigned. 
This procedure then determines integers f[{1] . . J{n] and g[1] 
. . .g{n] such that for all z, 7, f(t] M{z, 7] gly] is true and so that the 
smallest of these integers is +1. o designates the empty relation, 
so that x oy is true for arbitrary 2, y. If M is such that no f and g 
exist which satisfy all n? relations, then control is transferred to 
the label parameter fazl. This procedure has been used to deter- 
mine the precedence functions of symbols in a given precedence 


grammar (see [FLtoyp, R. Syntactic analysis and operator 


precedence. J.ACM 10 (1963), 316-333]); 
begin integer i, j, k, kl, fmin, gmin; 

procedure fizrow (i, 1,2); valuei,l,z; integer 1, l, x; 
begin integer J; f{¢] := g(t] :-+ 2; 

ifk = kithen 

begin if M[i, k] = Ils (A f[¢] > g{k] then go to fail else 

if M[i, k] = eq Af{t] glk] then go to fail 

end; 

for j := kl step —1 until 1 do 

if M[i, j] = ls A fli] => gj] then fixcol (7,7, 1) else 

if M[t,j] = eq A flt] ¥ glJ] then fizcol (i, 7, 0) 
end fixrow; 
procedure fixcol (1, j, x); value l, j, x; integer l,j, x; 
begin integer 7; g[j] := f{U] +2; 

if k ¥ ki then 

begin if M[k,j] = gr A f[k] < g[j] then go to fail else 

if M{k,j] = eq A flk] ¥ glj] then go to fail 

end; 

fori := k step —1 until 1 do 

if M[t,j] = gr A fit] < gf] then fizrow (i, j, 1) else 

if M[z,j] = eq A fli] ¥ gfj] then fizrow (i, j, 0) 


end fixcol; 

kl := 0; 

for k := 1 step 1 until n do 
begin fmin := 1; 


for j := 1 step 1 until k1 do 
if M[k, 7] = gr A fmin < g[j] then fmin := g{j]+1 else 
if M[k, j] = eq A fmin < g[{j] then fmin := g[J]; 

flk] := fmin; 

for 7 := k1 step —1 until 1 do 
if Mk, 7) = ls A fmin => g{j] then fixcol (k, j, 1) else 
if M{k, j].= eq \fmin > g{j] then fixcol (k, j, 0); 

kl := ki+1; gmin := 1; 

fori := 1 step 1 until k do 
if M[i, k] = Ils A ft] = gmin then gmin := f[t]+1 else 
if M[¢, k] = eq A flt] > gmin then gmin := f[1]; 

glk] := gmin,; 

for i := k step —1 until 1 do 
if M(z,k] = gr A f[t] < gmin then fizrow (i, k, 1) else 
if M[i, k] = eq A flt] < gmin then fizrow (i, k, 0) 

end < 
end Precedence 


It 
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ALGORITHM 266 
PSEUDO-RANDOM NUMBERS [G5] 
M. C. Pike anno I. D. Hitn 
(Reed. 15 Feb. 1965 and 6 July 1965) 
Medical Research Council, London, England 


real procedure random (a, b, y); 
reala, 6; integer y; 
comment random generates a pseudo-random number in the 
open interval (a, b) where a < 6. The procedure assumes that 
integer arithmetic up to 3125 X 67108863 = 209715196875 is 
available. The actual parameter corresponding to y must be an 
integer identifier, and at the first call of the procedure its value 
must be an odd integer within the limits 1 to 67108863 inclusive. 
If a correct sequence is to be generated, the value of this inte- 
ger identifier must not be changed between successive calls of 
the procedure; 
begin 
y := 38125 X y; y := y — (y+67108864) x 67108864; 
random := y/67108864.0 X (6~a) +a 
end random 


Coveyou [2] showed that for multiplicative congruential 
methods of generating pseudorandom numbers, the correlation 
between successive numbers will be approximately the reciprocal 
of the multiplying factor. Greenberger [3] showed further that the 
factor should he considerably less than the square root of the 
raodulus. 

The method of Algorithm 133 [1] satisfies Greenberger’s condi- 
tion, but since the reciprocal of its multiplying factor is as high as 
0.2, Coveyou’s result shows that it is very unsatisfactory for pur- 
poses requiring statistically independent consecutive random 
numbers. 

Algorithms 133 and 266 have both been tested by computing a 
number of sets of 2000 successive random integers between 0 and 9, 
dividing each set into 400 groups of 5, and performing the poker 
test [4]. The results were classified in the following seven cate- 
gories: 


(i) all different 
(it) 1 pair 
(tit) 2 pairs 
(iv) 3o0fakind 
(v) 3 of a kind and 1 pair 
(vt) 4o0f a kind 
(viz) 5 of a kind, 
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The following tables resulted: 
ALGORITHM 133 


Run | Starting Value (t) (it) (tit) (iv) (v) (vi) | (vit) 
1 13421773 | 114 193 42 37 7 7 0 
2 22369621 | 111 181 46 40 14 8 0 
3 33554433 | 130 178 48 28 7 6 3 
4 6871947673 | 118 179 51 35 10 5 2 
5 11453246123 | 128 189 44 28 6 4 1 
6 17179869185 | 135 155 45 52 6 5 2 


Expected for each 
Run 


Expected for 725 .76)1209 60/259 . 20/172 . 80/21 .60)10.80/ 0.24 


Total 
ALGORITHM 266 

Run Starting Value (t) (it) (iit) (iv) (v) (vt) (vit) 
1 13421773 132 191 35 38 2 2 0 

2 22369621 140 187 45 27 0 1 0 

3 33554433 129 198 44 25 4 0 0 

4 8426219 107 202 50 37 2 2 0 

5 42758321 101 207 60 25 5 2 0 

6 56237485 | 118 203 42 34 1 2 0 

7 62104023 | 119 206 41 27 6 1 0 

Expected foreach | 120.96) 201.60) 43.20) 28.80; 3.60) 1.80/-0.04 


Run 


Total for 7 Runs 846 


Expected for 846 . 72/1411 . 20/302. 40/201 .60/25.20/12.60) 0.28 


Total 


Combining categories (vz) and (viz) in each case, the observed 
totals give x? values (on 5 degrees of freedom) of 159.0 for Algo- 
rithm 133, and of 3.28 for Algorithm 266. 
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REMARK ON ALGORITHM 266 [G5] 
PSEUDO-RANDOM NUMBERS [M. C. Pike and I. D. 
Hill, Comm. ACM 8 (Oct. 1965), 605] 
M. C. Pixs ann I. D. Hitt (Recd. 9 Sept. 1965) 
Medical Research Council, London, England 

Algorithm 266 assumes that integer arithmetic up to 3125 x 
67108863 = 209715196875 is available, which is not so on many 
computers. The difficulty arises in the statements 


y = 8125 X y; y := y — (y+67108864) & 67108864; 
They may be replaced by 


integer k; 
for k := (for list) do 
begin 
y= ky; 
y := y — (y+67108864) * 67108864 
end; 


where the (for list) may be 

125, 25 (requiring integer arithmetic up to less than 233) 

25, 25, 5 (requiring integer arithmetic up to less than 2?) 
or 

5, 5, 5, 5, 5 (requiring integer arithmetic up to less than 2”) 
according to the maximum integer allowable. The first is appro- 
priate for the ICT Atlas. [And also for the IBM 7090, the second 
for the IBM System/360... Ref.] 

Note. There are frequently machine-dependent instructions 
available which will give the same values as the above statements 
much more quickly, if speed is of much importance. 


REMARK ON ALGORITHM 266 [G5] 
PSEUDO-RANDOM NUMBERS [M. C. Pike and I. D. 
Hill, Comm. ACM 8 (Oct. 1965), 605] 

L. Hansson (Recd. 25 Jan. 1966) 

DAEC, Riso, Denmark 


As stated in Algorithm 266, that algorithm assumes that integer 
arithmetic up to 3125 X 67108863 = 209715196875 is available. Since 
this is frequently not the case, the same algorithm with the con- 
stants 125 and 2796203 may be useful. In this case the procedure 
should read 


real procedure random (a, b, y); 
reala,b; integer y; 

begin 
y = 125 X ys y := y — (y+2796203) X 2796203; 
random := y/2796203 X (b—a) +a 


end 


The necessary available integer arithmetic is 125 « 2796203 = 
348525375 < 2 f 29. With this procedure body, any start value 
within the limits 1 to 2796202 inclusive will do. 

Seven typical runs of the poker-test gave the results: 


start value all different 1 pair 2 pairs 3 3 4- pair 4 3 
100001 129 199 39 3l 2 0 0 
1082857 115 206 45 3l 2 1 0 
724768 120 195 49 32 3 1 0 
78363 130 198 36 31 5 0 0 
1074985 127 189 44 34 4. 2 0 
2567517 124 193 50 28 3 2 0 
2245723 119 202 49 24 4 1 1 


Totals for 7 runs: 
864 1382 312 211 23 7 1 
Totals for 100 consecutive runs with first start value 100001 : 
12023 20297 4301 2837 «863858 —Sss«d8LS 8 
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Certification of Algorithm 266 [G5] 
Pseudo-Random Numbers [M.C. Pike and I.D. Hill, 
Comm. ACM 8 (Oct. 1965), 605] 


Walter L. Sullins [Recd. 12 Feb. 1971] 
School of Education, Indiana State University 
Terre Haute, IN 47809 


Key Words and Phrases: pseudo-random numbers, testing 
random number generators 
CR Categories: 5.5 


The Pike and Hill Algorithm 266 [2] generates pseudo-random 
numbers in a prescribed open interval. Pike and Hill presented 
favorable evidence for the ‘serial and poker tests [1] but omitted 
discussion of frequency tests. 

The purpose of the present certification was to test the hy- 


‘pothesis that the numbers generated by the algorithm are rec- 


tangularly distributed. Nine sequences of numbers in the interval 


(0, 1) were generated, and each was divided into 500 blocks of 


various lengths. In each case the distribution of numbers was tested 
against a uniform distribution, with .1 interval width, by com- 
puting x? on nine degrees of freedom for each of the 500 blocks 
within the sequence. The results are given in the table below. 


Run Starting Block Sequence  Pro- 


value length length portion 

1 32347753 400 200,000 .012 
2 52142147 600 300,000 .018 
3 - 52142123 640 320,000 .014 
4 53214215 960 480,000  .008 
5 23521425 1000 500,000 .006 
6 42321479 1040 520,000  .006 
7 20302541 1560 780,000  .006 
8 32524125 1600 800,000  .010 
9 


42152159 2600 1,300,000 .004 


' The proportions reported are the proportions of the 500 blocks 


‘which produced significant chi-square values when the probability 


of incorrectly rejecting the hypothesis of uniformity was set at .01. 
Thus there is considerable assurance that the numbers generated 
by the algorithm are rectangularly distributed. These findings also 
support the algorithm with respect to Yule’s [3] recommendation 
that block sums be compared with expectation. 


References 

1. Kendall, M.G., and Babington-Smith, B. Randomness and 
random sampling numbers. J. Royal Statist. Soc. 101 (1938), 
147-166. 

2. Pike, M.C., and Hill, I.D. Algorithm 266: Pseudo-random 
numbers. Comm. ACM 8 (Oct. 1965), 605. 

3. Yule, G. Udny. A test of Tippett’s random sampling numbers. 
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RANDOM NORMAL DEVIATE [G5] 

M. C. Prxe (Recd. 3 May 1965 and 6 July 1965) 
Medical Research Council, London, England 


procedure RN D(z1, 22, Random); 
real procedure Random; real <1, 22; 

comment ND uses two calls of the real procedure Random 
which is any pseudo-random number generator which will 
produce at each call a random number lying strictly between 0 
and 1. A suitable procedure is given by Algorithm 266, Pseudo- 
Random Numbers (Comm. ACM 8(Oct. 1965), 605] if one chooses 
a = 0, 6 = 1 and initializes y to some large odd number, such as 
13421773. RND produces two independent random variables x1 
and x2 each from the normal distribution with mean 0 and 
variance 1. The method used is given by Box, G.E.P., anp 
Mu.uuEr, M.E., A note on the generation of random normal 
Jeviates. |Ann. Math. Stat. 29 (1958), 610-611]; 

begin real /; 
xl := sqrt(—2.0 X In(Random)); 
t := 6.2831853072 X Random; 
comment 6.2831853072 = 2 & pi; 
a2 := 21 X sin(t); vl := x1 X cos(t) 

end RND 


Algorithm 121, NormDev [Comm. ACM 6 (Sept. 1962), 482; 8 
(Sept. 1965), 556] also produces random normal deviates and 
Algorithm 200, NORMAL RANDOM [Comm. ACM 6 (Aug. 1963), 
444; 8 (Sept. 1965), 556] produces random deviates with an approxi- 
mate normal distribution, but the procedure RND seems pref- 
erable to both of them. 

We may compare NORM AL RANDOM to RND (which is exact) 
by noting that at recommended minimum n NORMAL RANDOM 
requires 10 calls of Random while RND gets two independent 
normal deviates from 2 calls of Random and one call each of sqrt, 
In, sin and cos. Under the stated test conditions a single call of 
NORMAL RANDOM (with n = 10) took 20 percent more comput- 
ing time than a single call of RND when the real procedure Random 
was given by Algorithm 266. 

To compare NormDev to RND in the same way, we have first to 
calculate the expected number of calls of ln, sqrt, exp and Random 
?or each call of NormDev. This may be done by noting that there is 
(1) an initial single call of Random, then (2) with probability 0.68 
a random normal deviate restricted to (0, 1) has to be found and 
this requires on average 1.36 calls of Random and 1.18 calls of exp, 
and (3) with probability 0.32 a random normal deviate restricted to 
(1, ©) has to be found and this requires on average 2.04 calls of 
Random and 1.52 calls of each of In and sqrt. NormDev thus requires 
on average 2.58 calls of Random, 0.80 calls of exp, 0.49 calls of In 
and 0.49 calls of sqrt. (Note: NormDev requires one further call of 
Random if a signed normal deviate is required.) Under the stated 
test. conditions a single call of NormDev took virtually the same 
amount of computing time as a single call of RND when the real 
procedure Random was as above. 

(Note: In testing NormDev the procedure was speeded up by re- 
placing A by 0:6826894 wherever it occurred and removing it from 
the parameter list. In testing NORMAL RANDOM Mean, Sigma, 
n were replaced by 0, 1.0 and 10 respectively and removed from the 
parameter list.) 
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ALGORITHM 268 
ALGOL 60 REFERENCE LANGUAGE EDITOR [R2] 
W. M. McKereman* (Recd. 9 Dec. 1964, 23 Feb. 1965 and 
17 May 1965) 
Computer Science Department, Stanford University, 
Stanford, California 
* Supported in part by the Office of Naval Research under 
Contract Nonr 225(87), NR 044-211. 
The author expresses his thanks to the referee for several 
valuable suggestions. 


procedure Algoledit(characterset, 
string  characterset; 

integer  linelimit; 

comment If this procedure is presented an ALcou 60 program 
or procedure in the form of a sequence of basic'symbols, it will 
transmit to the output medium a copy of the text with indenta- 
tions between each begin-end pair and some rearrangement of the 
blank spaces within the text. This procedure is an example of its 
own output. It is used to edit ALGOL 60 text that is difficult to read 
because, for example, the ALGoL has been transcribed from 
printed documents, or written by inexperienced programmers, or 
stored in compressed form (i.e., with all redundant blank spaces 
removed). The integer ‘“—1’’ will represent the nonbasic symbol 
“carriage return’’, ‘“—2”’’ will represent an end-of-file mark, other 
symbols will have the integer value corresponding to their position 
in the parametric string ‘‘characterset’’. The string must contain 
exactly the 116 basic symbols of Aucou 60. The parameter ‘‘line- 
limit’? sets an upper bound on the number of basic symbols that 
the user wishes to appear on a line of output. The identifiers 
‘“Isq’’ and ‘‘rsq’’ will be used in place of strings of length one whose 


linelimit) ; 


only elements are ‘‘ ‘’”’ and ‘‘’ ’’, respectively; 

begin integer array spacesbefore, spacesafter[{1 : 116], 
buffer[1 : linelimit]; 
integer itabstop, symbol, i, symbolcount, level; 
Boolean = newline; 
integer procedure  val(s); 
string 5; 
comment The value of this procedure is the integer 


corresponding to the position in the string ‘‘characterset’’ 
of the symbol in the string ‘‘s’’. The body of the 
procedure must be expressed in code; 


procedure  get(symbol); 

integer symbol; 

begin insymbol(2, characterset, symbol); 
if symbol = — 2 then goto eof 

end get; 

procedure send(symbol); 

integer symbol; 

begin comment ‘“‘send’’ must not break identifiers 
across lines or insert spurious characters into 
strings; 
integer 1, U, 2; 
if symbol = ~1 V_~— symbolcount > linelimit 
then 
begin v :=  tabstop; 


if newline then goto £; 
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if level = 0 then 
begin comment Inside a string; 
for i := 1- step 1. until 
symbolcount do  outsymbol(1, 
characterset, buffer|t]); 
outsymbol(1, characterset, — 1); 
v:= 0 
end else 
begin wu := symbolcount; 
newline := true; 
if symbol = — 1 then goto D,; 


comment Find a convenient place to 
break the line; 
for u := symbolcount — 1 step — 


1 until 1 do if bufferfu + 1] = 
val(‘u’) \V buffer[u] = val(rsq) then 
go to D; 

u := symbolcount; 


comment Send the line; 


D:for 1:=1 step 1 until u do 
outsymbol(1, characterset, buffer|t]); 
outsymbol(1, characterset, — 1); 


comment Find a non-blank character 
to start the next line; 

for 7 := u+1 step 1 until 
symbolcount do if buffer(t] # val(‘u’) 
then goto F; 

go to G; 

comment Move a new line to the 
head of the buffer area; 

F : for i := 7 step 1 until 
symbolcount do 

begin v := 041; 


newline := false; 
buffer[v] := buffer[z] 
end; 
comment Insert blanks for tab stops; 
G :for i1:= 1 step 1 until 
tabstop do buffer[t] := val(‘u’) 
end; 
E : symbolcount := v 


end; 
comment Now we can put the new symbol in the 
buffer array; 


if symbol # —1 A -7 (newline / symbol 
= val(‘u’)) then 
begin symbolcount := symbolcount + 1; 
newline := false; 
buffer[symbolcount] := symbol 
end 
end send; 
for symbol := 1 step 1 until 116 do 
spacesbefore[symbol] := spacesafter[symbol] := 0; 
for symbol := val(‘+’), val(‘—’), val(‘-’), val(‘:’), 


val(‘:=’), val(‘<’), val(‘<’), val(‘=’), val(‘’), - 
val(‘>’), val(‘>’) do. spacesbefore[symbol] := 
spacesafter|symbol] := 1; 


0 
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for symbol := val(‘A’), ral(VV’), val(‘D’), val(‘=’), 
val(‘then’), val(‘else’), val(‘step’), ral(‘until’), 
val(‘while’), val(‘do’) do spacesbefore[symbol] := 


spacesafter[symbol| := 2; 
for symbol := val(‘go to’), 
ral(‘for’), val(‘procedure’), 
valCreal’), val(‘Boolean’), 
val (‘switeh’), val(‘lahel’), 
spacesafter|symbol] := 2; 
lerel := symbolcount := 
newline := true; 
nextsymbol : deblank : get(symbol); 
scanned : if symbol = cral(‘u’) V 
then goto = deblank; 
if symbol = val(‘begin’) then 
if symbol = valC‘end’) then 
begin tabstop := tabstop — 5; 
send( — 1) 


val (‘begin’), 
val (‘value’), 


val (‘string’), 


tabstop := 0; 


send( 


end, 


for 27 := 1 step Ll until spacesbefore| 
send(val (‘wu’) ); 
send(symbol) ; 
for «+ := 1 step 1. until 
send (val (‘u’)); 
if symbol = ral(‘comment’) then 
begin 
for 7 := 1 while symbol # val(; 
begin = get(symbol); 
send (symbol) 
end 
else if symbol = val(‘end’) then 
comment ‘end’? comments; 
for 7 := 1 while symbol # val(‘; 


yet(symbol) ; 


end 
begin 


begin 


val (‘integer’), 


spacesafter[symbol | 


val (‘it’), 


val(‘own’), 


do 


val (‘,’) 


symbol = — 1 


— 1) else 


symbol] do 


do 


comment Pass comments on unchanged; 


*) do 


*) do 


if symbol = val(else’) VV symbol = 


then 


val (‘end’) 
send (symbol) 


go to 


end 


end else if symbol = val(isq) then 

begin 

level := 1; 

for ¢ := 1 
get(symbol) ; 
send (symbol) ; 

if symbol = val(lsq) 

+ 1 else if 


then lerel := 


level # 0 do 


while 
begin 


then 


level ~ J 
end 

end; 

if symbol = val(‘begin’) 

symbol = ral(‘;’) 

go to nextsymbol; 

eof : send( — 1); 

outsymbol(1,  characterset, 

Algoledit 


tabstop 
send( 


then 


else if then 


3D) 


end 


level 
symbol = val(rsq) 


scanned; 


comment Pass strings on unchanged; 


:= lerel 


:= labstop + 5 


— 1); 


val(‘array’), 
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REMARK ON ALGORITHAL 268 [R2] 

ALGOL 60 REFERENCE LANGUAGE EDITOR 
(W. M. MeKeeman, Comm. ACA 8 (Nov. 1965), 667] 

CG. Saver (Reed. 23 Dec. 1968) 

Institut fiir Theoretische Physik der Justus-Liebig-Uni- 
versitét, 63 Giessen, West Germany 

KEY WORDS AND PHRASES: 

CR CATEGORIES: 4.49 


symbol manipulation 


In the procedure send, replace the line 

1 until 1 do if bufferfu+l1] = 
with the line 

1 until tabstop do if buffer[u+l] = (') 

The published version fails to clear the buffer when a line to be 
printed contains no blanks and tabsfop > 0, causing an array 
bounds violation. Knowing biuffer[fabstop+1] never to contain a 


blank character, the search for blanks may be stopped at u = 
fabstop + 1. 


(') The author is indebted to the referee for suggesting this brief 
form. 
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DETERMINANT EVALUATION [F3] 

JAROSLAV PFANN AND JOSEF STRAKA 
(Reed. 10 Sept. 1964 and 29 Dec. 1964) 

Institute of Nuclear Research, Rez by Prague, Czecho- 
slovakia 


real procedure determinant (A, n); array A; integer n; 
comment This procedure evaluates a determinant by triangu- 
larization with searching for pivot in row and with scaling of 
the rows of the matrix before the triangularization. This was 
done as in procedure EQUILIBRATE of the ‘Algorithm 135 
[Comm. ACM 6 (Nov. 1962), 553]; , 
begin real product, temp; integer 7, 7, 7, 8; 
array mult[1:n}; 
procedure EQUILIBRATE(A, n, mult); 
integer n; array A, mult; 
begin integer 7,7; real mz; 
for 7 := 1 step 1 until n do 
begin mz := 0.0; 
for j := 1 step 1 until n do 
if abs(A[t, j]) > mz then mz := abs(A[i, j]); 
if mz = 0.0 then 
begin determinant := 0; go to RETURN end; 
mult{t} := mz; comment := base f ex for exact scaling; 
if mz ~ 1.0 then 
for j := 1 step 1 until n do Afi, j] := Al[é, j]/mz; 
end 
end EQUILIBRATE; 
EQUILIBRATE(A, n, mult); 
product := 1; 
for r := 1 step 1 until n—1 do 
begin s := r; temp := abs(Al[r, 1r]); 
for j := r+ 1 step 1 until n do 
if temp < abs(A[r, j]) then 
begin temp := abs(Alr, j]); s := j end; 
if temp = 0 then begin determinant := 0; 
end; 
ifs ~ rthen 
begin product := — product; 
for 7. := r step 1 until n do : 
begin temp := Alt, rj]; Ali, r] := Ald, s];° 
Alt, s] := temp 
end 
end; 
product := product X Al[r, 7]; 
comment Be on guard against overflow or underflow here; 
for 1 := r+1 step 1 until n do 
begin temp := Ali, r]/Al[r, 7]; 
for 7 := r+1 step 1 until n do 
A(t, j] := Alt, 7] — Alr, j] X temp 
end 
end; 
temp := product X A[n, n]; 
for r := 1 step 1 until n do temp := temp X* mult [r]; 
comment Again danger of overflow or underflow; 
determinant := temp; 


go to RETURN 
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RETURN: 

end determinant 
REFERENCE: 

McKereman, W.M. Algorithm 135—Crout with equilibration and 
iteration. Comm. ACM 6 (Nov. 1962), 553. 


CERTIFICATION OF: 
ALGORITHM 41 [F3] 
EVALUATION OF DETERMINANT 
[Josef G. Solomon, Comm. ACM 4 (Apr. 1961), 171] 
ALGORITHM 269 [F3] 
DETERMINANT EVALUATION 
[Jaroslav Pfann and Josef Straka, Comm. ACM 8 
(Nov. 1965), 668] 
A. Brereson, (Recd. 4 Jan. 1966 and 4 Apr. 1966) 
Computing Lab., Sunderland Technical College, 
Sunderland, Co. Durham, England 


Algorithms 41 and 269 were coded in 803 ALGOL and run on a 
National-Elliott 803 (with automatic floating-point unit). 


The following changes were made: 

(i) value n; was added to both Algorithms; 

Gi) In Algorithm 269, since procedure HQUILIBRATE is only 
called once, it was not written as a procedure, but actually written 
into the procedure determinant body. 


The following times were recorded for determinants of order N 
(excluding input and output), using the same driver program and 
data. 


N T1 T2 
Algorithm 41 Algorithm 269 
(minutes) 

10 0.87 0.78 
15 2.77 2.18 
20 6.47 4.78 
25 12.47 8.99 
30 21.37 14.98 


From a plot of In(7;) against In(N) it was found that 
T, = 0.00104N?2-%2, 
Similarly, | 


T, = 0.00153N?-70 


From a plot of T; against T2, it was found that Algorithm 269 
was 30.8 percent faster than Algorithm 41, but Algorithm 41 
required less storage. 
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FINDING EIGENVECTORS BY GAUSSIAN ELIMI- 
NATION [F2] 

ALBERT NewuousE (Recd. 3 May 1965 and 16 July 1965) 

University of Houston, Houston, Texas 


procedure NULLSPACE (n, a, ec, eps); 
n,ec; realeps; arraya; 

comment NULLSPACE computes the vectors x of order n such 
that xa = z, where a is an nXn matrix, z is the zero-vector of 
order n, eps is a small positive number such that if the maxi- 
mum pivot element is numerically less than eps the procedure 
considers it zero. The ec vectors x are to be found in the first 
ec rows of the matrix @ upon exit from this procedure; 

comment In finding the eigenvectors z of an nXn matrix B 
after having found the eigenvalues \ of B by any of the many 
available methods, it is often desirable to start from the original 
matrix B and not from its transform from which the )\’s were 
obtained. Whereas the resulting eigenvectors will still be in- 
fluenced by errors in the d’s, the eigenvectors would not be 
influenced by errors in the transformed matrix. 

Since AJ — B = A isa singular matrix of rank r the problem is 
to find ec = » — r vectors x which form a basis of the left null 
space of A. 

Note: If the right null space is desired the matrix A should 
be transposed. 

The following algorithm finds these n—r linearly independent 
vectors by the Gauss-Jordan elimination in place using the 
maximal available element for the pivot. The process will termi- 
nate after r steps, since the maximal available elements for 
pivoting are then equal to zero. 

Now, replacing these zero pivot elements by unity, the rows 
of the matrix, from whieh no nonzero element has been chosen, 
are the basis of the null space of A, that is, if xz is such a row 
then <A = z, the zero vector of order n. 

The proof for this is established by the fact that the elimina- 
tion amounts to premultiplying B by a matrix A’, a product of 
elementary matrices, such that A’A is a matrix with ones on 
r of the diagonal positions and zeros everywhere else. 

Test results. A version of this procedure acceptable to the 
IBM 7094 (ALCOR-ILLINOIS 7090 ALGOL Compiler) was 
tested. 

With eps = 10~* the results for the 5X5 matrix 


1 2 3 4 6 
6 7 8 9 10 
Wl 12 138 14 15 
16 17 18 19 20 
21 22 23 24 25 


showed the dimension of the null space as 3 having as a basis 


value n, eps; integer 


a = (—.75, 1.00, 0.00, 0.00, —.25) 
t2 = (—.50, 0.00, 1.00, 0.00, — .50) 
Z3 = (—.25, 0.00, 0.00, 1.00, —.75) 
exact to 6 decimal places; 
begin integer array r, c[l:n]; 
real max, temp; 
for 7 := 1 step 1 until 2 do r[¢] := c[z] := 0; 
for m := 1 step 1 until n do 


integer 7, j,k, m, jj, kk, t; 
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begin maz := 0; 
for k := 1 step 1 until n do 
begin if r[k] # 0 then go to L else 
for j := 1 step 1 until n do 
if c[j7] = OA abs(a[k, j|) > max then 
begin kk := k; jj := j; max := abs(alk, j}) 
end 7 loop; 
L: end k loop; 
if max < eps then go to SORT; 
cljj) := kk; [kk] := 97; temp := 1/alkk, jj]; alkk, jj] := 1: 
for j := 1 step 1 until n do alkk, j] := afkk, j] X temp; 
for k := 1 step 1 until kk — 1, kk + 1 step 1 until n do 
begin temp := alk, jj}; alk, jj] := 9; 
for 7 := 1 step 1 until n do 


begin 
alk, j| := alk, 7] — temp X alkk, J]; 
if abs(alk, j]) < eps then a[k, 7] := 0 
end; 
end k loop; 


end m loop; 

SORT: for j := 1 step 1 until n do 
begin 
REPEAT: 
begin 

for k := 1 step 1 until n do 
if rik] = 0 then 
begin temp := alk, j]; 


if c[j] ~ OAj # clj] then 


alk, 7] := alk, efj]]; alk, clj]] := temp 
end & loop; 
t:= ef7]; elj] := clt]; clit] := ¢; goto REPEAT 
end; 
end conditional and j loop; 
ec := Q; 


for k := 1 step Il until n do 
if r[k] = 0 then 


begin ec := ec +1; afk, k] := 1; 
if ec ~ k then 
begin 
for j := 1 step 1 until do alec, j| := afk, 7} 
end; 


end conditional and k& loop; 
comment The first ec rows of the matrix @ are the vectors 
which are orthogonal to the columns of the matrix a; 
end NULLSPACE 
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ALGORITHM 271 
QUICKERSORT [M1] 
R. S. Scowen* (Recd. 22 Mar. 1965 and 30 June 1965) 
National Physical Laboratory, Teddington, England 

* The work described below was started while the author was 
at English Electric Co. Ltd, completed as part of the research 
programme of the National Physical Laboratory and is published 
by permission of the Director of the Laboratory. 


procedure quickersort(a, j); 
value j; integer j; array a; 

begin integer 1, k, q, m, p; 
lt{1 ln (abs (7) +2) /In (2) +0.01}; 

comment The procedure sorts the elements of the array a[1:7] 
into ascending order. It uses a method similar to that of QUICK- 
SORT by C. A. R. Hoare [1], i.e., by continually splitting the 
array into parts such that all elements of one part are less than 
all elements of the other, with a third part in the middle con- 
sisting of a single element. I am grateful to the referce for point- 
ing out that QUICKERSORT also bears a marked resemblance 
to sorting algorithms proposed by T. N. Hibbard [2, 3]. In par- 
ticular, the elimination of explicit recursion by choosing the 
shortest sub-sequence for the secondary sort was introduced by 
Hibbard in [2]. 

An element with value ¢ is chosen arbitrarily (in QUICKER- 
SORT the middle clement is chosen, in QUICKSORT a random 
element is chosen). ¢ and 7 give the lower and upper limits of 
the segment being split. After the split has taken place a value 
qg will have been found such that a[q] = t and afZ] < t < al/] 
for all J, J such that 7 < I <q < J < j. The program then 
performs operations on the two segments a[z:¢—1] and a[g+1:7} 
as follows. The smaller segment is split and the position of the 
larger segment is stored in the lt and wt arrays (/t and ut are 
mnemonics for lower temporary and upper temporary). If the 
segment to be split has two or fewer elements it. is sorted and 
another segment obtained from the /é and ut arrays. When no 
more scgments remain, the array is completely sorted. 

REFERENCES: 
1. Hoare, C. A. R. Algorithms 63 and 64. Comm. ACM 4 (July 

1961), 321. 

2. Hrpparp, Tuomas N. Some combinatorial properties of 


real ¢, 2; integer array ut, 


certain trees with applications to searching and sorting. 


J. ACM 9 (Jan. 1962), 18. 
3. ——. An empirical study of minimal storage sorting. Comm. 
ACM 6 (May 1963), 206-213; 
i:=m:=1; 
N:if 7-7 > 1 then 
begin comment This segment has more than two elements, 
so split it; 
p := (tt) + 2; 
comment p is the position of an arbitrary element in the 
segment al{i:j]. The best possible value of p would be one 
which splits the segment into two halves of equal size, thus 
if the array (segment) is roughly sorted, the middle ele- 
ment is an excellent choice. If the array is completely 
random the middle element is as good as any other. 

If however the array a[1:j] is such that the parts a[1:j+ 2] 
and a{j+2+1:j] are both sorted the middle element could 
be very bad. Accordingly in some _ circumstances 
p := (i+j) + 2 should be replaced by p := (i+3Xj) + 4 
or p’:= RANDOM(, j) as in QUICKSORT; 
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t := alp]; 
alp} := alt]; 
Q:= J; 


for k := i+ 1 step 1 until g do 
begin comment Search for an element alk] > ¢ starting 
from the beginning of the segment; 
if alk] > t then 
begin comment Such an a[k] has been found; 
for q := q step —1 until k do 
begin comment Now search for alq] < ¢ starting from 
the end of the segment; 
if alg) < ¢ then 
begin comment 


af{q] and a[k]; 


alg] has been found, so exchange 


x := alk]; 
alk] := alq]; 
alg] := 2; 
q:= 9-1; 
comment Search for another pair to exchange; 
go to L 
end 
end for q; 
q:=k-—1; 


comment g was undefined according to Para. 4.6.5 of 
the Revised Atcou 60 Report (Comm. ACM 6 (Jan. 
1963), 1-17]; 
go to M 
end; 
L: end fork; 
comment We reach the label M when the search going up- 
wards meets the search coming down; 
M: alt] := alq); 
alq] := ¢; 
comment The segment has been split into the three parts 
(the middle part has only one element), now store the 
position of the largest segment in the lt and wt arrays and 
reset 7 and j to give the position of the next largest segment; 
if2X q>i+jthen 
begin 


end; 
comment Update m and split this new smaller segment; 
m:= mt+i; 
go to N 

end 

else if 1 > 7 then 

begin comment This segment has less than two elements: 
go to P 

end 

else 

begin comment This is the case when the segment has just 

two elements, so sort a[z] and a[j] where 7 = i + 1; 

if a[z] > a[j] then 
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begin 
az := alt]; 
alt} := aly]; 
alg| 7= x 
end; 


comment If the lt and ut arrays contain more segments 
to be sorted then repeat the process by splitting the smallest 
of these. If no more segments remain the array has been 
completely sorted; 
P. m:= m-1; 
if m > 0 then 
begin 
a := It{m); 
j = ut[m]; 
go to N 
end; 
end 
end qutckersort 


CERTIFICATION OF ALGORITHM 271 (M1) 

QUICKERSORT [R. 8. Scowen, Comm. ACM 8 (Nov. 
1965), 669] 

CuHaRLEs R. Buiarr (Reed. 11 Jan. 1966) 

Department of Defense, Washington, D.C. 
QUICKERSORT compiled and ran without correction through 

the ALDAP translator for the CDC 1604A. Comparison of 

average sorting times, shown in Table I, with other recently pub- 

lished algorithms demonstrates QUICKERSORT’s superior per- 


formance. 


Algorithm 201 Algorithm 207 Algorithm 245 Algorithm 271 


Nuinber Shellsort Stringsort Treesort 3 Quickersort 
of items | = ie Bee be te ee Se - 
{ntegers | Reals | Integers | Reals | Integers | Reals | Integers | Reals 
10 | 0.01 | 0.01 0.03 | 0.08 0.02 | 0.02 0.01 | 0.01 
20) 0.02 | 0.02 0.05 | 0.05 0.04 | 0.04 0.02 | 0.02 
50 | 0.08 ; 0.08 0.20 | 0.20} 0.11 | 0.12 0.06 | 0.06 
100 |} 0.19 } 0.22 0.39 | 0.40 | 0.26 | 0.27 0.13 | 0.13 
200 0.48 | 0.58 1.0 1.1 0.59 | 0.62 | 0.28 | 0.30 
500 | 1.5 | 1.7 2.8 2.9 1.7 1.8 0.80 | 0.85 
1000 3.7 4.2 6.6 6.9 3.7 4.0 1.8 1.9 
2000 | 9.1 /10. 18. 14 8.2 8.7 3.9 4.1 
5000 | 27. 130. | 40. {41 23. 24 11. 12. 
10000 | 65 172 938. 97 49 52 23 25 
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ALGORITHM 272 
PROCEDURE FOR THE NORMAL DISTRIBUTION 
FUNCTIONS* [S15] 
M. D. MacLaren 
(Recd. 28 July 1964, 17 Nov. 1964 and 26 July 1965) 
Argonne National Laboratory, Argonne, Ill., and Boeing 
Scientific Research Laboratories, Seattle,’ Wash. 
* Work performed in part under the auspices of the US Atomic 
Energy Commission. 
real procedure phi(a, k); value a, k; real a; integer k; 
comment Before use, this procedure must be called once with 
k = 8 to initialize own variables. Thereafter for k = 1 the 
procedure gives 


if a 
&(a) = al exp (—#/2) dt, 


and for k = 2 it gives 
&*(a) 


2(@(| a|) — .5) 


2\? piel 
7 [ exp (—?/2) dt; 
T. 0 

begin own integer JN; 


own real B, EPS, EPS2, EPS83, ONE, DELTA, DELT AQ, PI2; 

comment &*(a)is computed by Taylor’s series expansion in the 
interval 0 < a < B, and by asymptotic series in the interval 
B <a. The Taylor’s series expanson is made about one of the 
points 0, B/N, 2B/N, --- , B, and the coefficients in the series 
are computed using the recursion formula for' Hermite poly- 
nomials. The number of terms to take in the series is deter- 
mined by an error estimate based on a majorizing series. This 
procedure, which is essentially the familiar one of interpolat- 
ing in a stored table of values, gives a fast program and can be 
used effectively for many functions. In this case another sig- 
nificant increase in speed could be obtained by also storing a 
table of values of the first derivative cf é*. The own variables 
B, EPS and N might be called program parameters. By suit- 
ably choosing their values the programmer may make the 
procedure as accurate as desired and may increase the speed 
of the procedure at the cost of extra storage space. This is the 
advantage of this procedure over others previously published 
in this journal (see [1-4]). 

The values of these program parameters are determined 
when the procedure is coded, not when it is called. They are 
set by means of an initializing call with k = 3. The other own 
variables are computed from B, EPS and N when the initializ- 
ing call is made. If Forrran IV were used, all the own vari- 
ables could be set by use of a DATA statement, An alternative 
to either method is to replace all occurrences of the parameters 
by the appropriate constants. 

The choice of the parameter N depends mainly on speed 
versus storage considerations. The larger N is, the faster the 
procedure will be and the more storage will be needed. Note, 
however, that N must be chosen large enough so that 
B?(1/(2N) + 1/(4N?2)) < 1, for otherwise the method of estimat- 
ing the error in the Taylor’s series may fail. The choice of B 
may also affect the speed, because for smaller values uf a the 


ll 
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asymptotic series for 6*(a) will take longer than the Taylor’s 
series. The choice of B depends, however, mainly on the error 
desired. Neglecting roundoff, the maximum error in the com- 
puted value of &*(a) will be HPS if a < B or max (EPS, 5(a)/2) 
if B < a, where 6(a) is the absolute value of the smallest term 
in the asymptotic series for *(a). Some values of (a) are: 
5(4) = 3.0 & 10-8, 5(5) = 3.0 X 107, 6(5.5) = 1.4 & 10-4, and 
6(6) = 4.4 & 10717. If N is large enough, roundoff will be no 
problem. (The referee has pointed out that the computation 
for B < a could be made by continued fractions, as in Algo- 
rithm 180. The advantage of this would be that the continued 
fraction expansion converges for all a > 0, but roundoff errors 
may be significant for smaller values of a.) 

With the program parameters having the values given 
below, the procedure was compiled as a ForTRAN IIT subroutine 
on the IBM 1620, using eight-digit floating point arithmetic, 
and tested for many values of a. The error never exceeded 
2 X 10-8. The program was also compiled with B = 6.0, HPS = 
2 X.107'5 and N = 60, using 15 digit arithmetic. Spot checks 
turned up no errors greater than 2 X 1075; 

own real array C([0:16]; 

comment The array C must give the value of ®*(a) at the 
point of expansion, i.e., C[m] must equal 6*(mB/N). Tables of 
©*(a) to fifteen decimal places are published by the National 
Bureau of Standards [5]. The upper bound for the array must 
equal the value of the program parameter NV; 

real f, f1, f2, x, y, z, t, #2, zt; 

integer ™; 

real procedure max(z, y); value x, y; realz, y; 

begin max := ifx < y then y elsez; 

end maz; 

if k = 3 then 

begin comment initialize own variables; 
EPS := .00000002; B:= 4.0; N := 16; C{0] := 0.0: 

C{l] := .19741265; 


C[2] := .88292492; C[3] := .5467530; 
C[4] := 68268049; 

C[5] := .78870045;  C[6] := .86638560; 
C[7] := .91988169; 

C(8] := .95449974; C9] := .97555105; 
C[10] := .98758067; 

C{11] := .99404047;  C[12] := .99730020; 
C[13] := .99884595; 

C[14] := .99953474; C[15] := .99982317; 


C{16] := .99993666; 

ONE := .99999999; 

comment ONE is the largest number less than 1 which may 
be represented in the machine. This prevents loss of ac- 
curacy in some implementations of floating point sub- 
traction; 

PI2 := .797884560802865 ; 

comment PI2 = (2/r)'?; 

DELTA := B/N; 

DELT A2 := 6 X DELTA; 

EPS3 := 2.0 X EPS; 

2 := max(BXDELTA, sqrt(2.0) X DELT A2); 
= DELTA2 X (B+DELTA2); 

za:= (t+sqrt(t)) X exp(.bXo); 
:= 12 & (1.04#2) & exp(.5X#2 T 2); 

ee <1Ay<2zthen EPS2 := EPS/y else EPS2 := EPS/z; 


0 
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phi := 0 
end initialization 
else 


begin comment compute &(@); 
y := abs(a); 
ify > B then 
begin comment computation by asymptotic series; 
ei=y 72; f:= PI2X% exp(—.5Xz)/y; 
xi 1.0/z; 2:=f;fl:= —f X22; 
for m := 3,m + 2 while abs(f1) < abs(f) do 
begin z:=z+f1; f:=fl; fl:= —-flxmxa; 
ifabs(f{) < EPS3 then goto L1 


end; 
LA: z2:= ONE —z2+ .5xXf 
end asymptotic computation 
else 


begin comment Taylor’s scries computation; 
m := entier (y/DELT A); 
xi=mMX DELTA; t:=y-2; 
if DELTA2 < tthen 


beginm:=m-+1; x:=x2+ DELTA; t:=y—xend; 
ati=x2Xt; W:=t 7 2; 

fl:=tX PIl2 X exrp(—.5Xx T 2); 

f2:= —.5X xt X fl; 


z:= C[m]+fl+fa; 


for m := 3, m+ 1 while (n—1) X EPS2 < maz(abs(f1), 
abs(f2)) do 
begin 
fi= (—atXf2—t2X (m—2)Xf1/(m—1))/m; 
Zo eb fy fl t= 7/2) f2= 7: 
end 
end Taylor’s series computation; 
ifk = 1 then 
begin 
z:=ifO<athen 5+ .5X zelse .5— 5X z 
end; 
phi := 2 
end computation 
end phi 
REFERENCES: 


1, Crawrorp, M., ann Tecno, R. Algorithm 123, Real error 
function, ERF (x). Comm, ACM 6 (Sept. 1962), 482. 

2. THacHEer, H.C., Jr. Algorithm 180, Error function—large X. 
Comm. ACM 6 (June 1963), 314. 

3. InppEtson, D. Algorithm 209, Gauss. Comm. ACM 6 (Oct. 
1963), 616. 

4. Crvin, S. J. Algorithm 226, Normal distribution function. 
Comm. ACM 7 (May 1964), 295. 

5. NATIONAL BUREAU OF STANDARDS. Tables of Normal Probability 
Functions. Applied Math. Series, No. 23, US Government 
Printing Off., Washington, D.C., 1953. 


REMARKS ON: 

ALGORITHM. 123 [S15] 

REAL ERROR FUNCTION, ERF(z) 
[Martin Crawford and Robert Techo Comm. ACM 6 
(Sept. 1962), 483] 


ALGORITHM 180 [815] 
ERROR FUNCTION—LARGE X 
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ALGORITHM 181 [S15] 
COMPLEMENTARY ERROR FUNCTION— 
LARGE X 

[Henry C. Thacher Jr. Comm. ACM 6 (June 1963), 315] 


ALGORITHM 209 [S15] 
GAUSS 
[D. Ibbetson. Comm. ACM 6 (Oct. 1963), 616] 


ALGORITHM 226 [S15] 
NORMAL DISTRIBUTION FUNCTION 
(S. J. Cyvin. Comm. ACM 7 (May 1964), 295] 


ALGORITHM 272 [S815] . 
PROCEDURE FOR THE NORMAL DISTRIBUTION 
FUNCTIONS 

[M. D. MacLaren. Comm. ACM 8 (Dec. 1965), 789] 


ALGORITHM 304 [S15] 

NORMAL CURVE INTEGRAL 
[I. D. Hill and S. A. Joyce. Comm. ACM 10 (June 
1967), 374| 


I. D. Hitt anv 8. A. Joyce (Recd. 21 Nov. 1966) 

Medical Research Council, 

Statistical Research Unit, 115 Gower Street, London 
W.C.1., England 


These algorithms were tested on the ICT Atlas computer using 
the Atlas ALGou compiler. The following amendments were made 
and results found: 


ALGORITHM 123 

(i) value x; was inserted. 

(ii) abs(T) < »w—-10 was changed to Y— T= Y 
both these amendments being as suggested in [1]. 

(iii) The labels 1 and 2 were changed to Li and L2, the go to 
statements being similarly amended. 

(iv) The constant was lengthened to 1.12837916710. 

(v) The extra statement xz := 0.707106731187 * x was made 
the first statement of the algorithm, so as to derive the 
normal integral instead of the error function. 

The results were accurate to 10 decimal places at all points 
tested except z = 1.0 where only 2 decimal accuracy was found, as 
noted in [2]. There seems to be no simple way of overcoming the 
difficulty [3], and any search for a method of doing so would 
hardly be worthwhile, as the algorithm is slower than Algorithm 
304 without being any more accurate. 


ALGORITHM 180 
(i) JT := —0.56418958/z/exp(v) was changed to 
T := —0.564189583548 X exp(—v)/xz. This is faster and also 
has the advantage, when »v is very large, of merely giving 0 
as the answer instead of causing overflow. 
(ii) The extra statement x := 0.707106781187 K x was made 
as in (v) of Algorithm 123. 
(iii) form := m-+1= waschanged to form := m+2. m+1 
is &@ misprint, and gives incorrect answers. 
The greatest error observed was 2 in the 11th decimal place. 


ALGORITHM 181 
(i) Similar to G) of Algorithm 180 (except for the minus sign). 
(ii) Similar to (ii) of Algorithm 180. 
(iii) m was declared as real instead of integer, as an alternative 
to the amendment suggested in [4]. 
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The results were accurate to 9 significant figures for z < 8, 
but to only 8 significant figures for z = 10 and x =. 20. 


ALGORITHM 209 
No modification was made. The results were accurate to 7 decins 
places. 


ALGORITHM 226 
(i) 10 fT m/(480Xsqrt(2X3.14159265)) was changed to 
10 fT m X 0.000831129750836. 
(ii) for 7 := 1 step 1 until 2 X » do was changed to 
m:=2Xn; fort := 1 step 1 until m do. 
(iii) —(¢Xb/n) T 2/8 was changed to —(iXb/n) T 2 X 0.125. 
(iv) if7 =2X* 2-1 waschanged to ift = m-—1 
(v) b/(6XnxX sqrt(2X3.14159265)) was changed to 
b/(15.0897696478X n). 
Tests were made with m = 7 and m = 11 with the following 
results: 


Number of significant Number of decimal 
7 jigures correct places correct 
m= 7 m= Il m = 7 m= ll 
—0.5 i 11 7 Il 
—-1.0 7 10 7 10 
—1.5 7 10 8 10 
—2.0 7 9 8 10 
—2.5 6 9 8 11 
—3.0 6 7 8 9 
—4.0 5 7 10 11 
—6.0 2 1 12 10 
—8.0 0 0 11 9 


Perhaps the comment with this algorithm should have referred 
to decimal places and not significant figures. To ask for 11 sig- 
nificant figures is stretching the machine’s ability to the limit, 
and where 10 significant figures are correct, this may be regarded 
as acceptable. 


ALGORITHM 272 
The constant .99999999 was lengthened to .9999999999. 

The accuracy was 8 decimal places at most of the points tested, 
but was only 5 decimal places at x = 0.8. 


ALGORITHM 304 \ 
No modification was made. The errors in the 11th significant figure 
were: 


abs (x) x > 0 = upper x > 0 + upper 
0.5 1 1 
1.0 1 2 
1.5 21°(5) 2 
2.0 25° (0) 4 
3.0 0 0 
4.0 2 3 
6.0 6 0 
8.0 14 0 
10.0 23 0 
20.0 35 0 
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® Due to the subtraction error mentioned in the comment section 
of the algorithm. Changing the constant 2.32 to 1.28 resulted in 
the figures shown in brackets. 


To test the claim that the algorithm works virtually to the 
accuracy of the machine, it was translated into double-length 
instructions of Mercury Autocode and run on the Atlas using the 
EXCHLF compiler (the constant being lengthened to 
0.398942280401432677939946). The results were compared with 
hand calculations using Table II of [5]. The errors in the 22nd 
significant figure were: 


abs (x) x > 0 = upper x > 0 # upper 
1.0 2 3 
2.0 7 1 
4.0 2 0 
8.0 8 0 


Timings. Timings of these algorithms were made in terms of 
the Atlas “Instruction Count,’’ while evaluating the function 100 
times. The figures are not directly applicable to any other com- 
puter, but the relative times are likely to be much the same on 
other machines. 


Instruction Count For 100 EVALUATIONS 


Algorithm number 


abs(x) L : Ret 
123 180 | 181 209 226 272 3042 3045 
m= 7 
0.5 58 8 97 24 25 24 
1.0 65¢ 8 176 24 29 29 
1.5 164 128 127 9 273 25 35 35 


2.0 | 194 78 90 8 387 24 39 3u 
2.5 | 252 54 68 10 515 24 131 44 
3.0 


0. 42 51 9 628 25 97 50 
4.0 27 39 9 9004 | 25 67 44 
6.0 15 30 6 14004 | 16 49 23 
8.0 9 28 7 21004 18 44 1] 

10.0 10 25 5 27004 16 38 1] 
20.0 9 22 5 65009 16 32 1] 
30.0 9 9 5 } 109004 16 11 1] 


® Readings refer to x > 0 = upper. 
»b Readings refer to x > 0 # upper. 
* Time to produce incorrect answer. A count of 120 would fit a 
smooth curve with surrounding values. 

4100 times Instruction Count for 1 evaluation. 


Opinion. There are advantages in having two algorithms 
available for normal curve tail areas. One should be very fast and 
reasonably accurate, the other very accurate and reasonably 
fast. We conclude that Algorithm 209 is the best for the first 
requirement, and Algorithm 304 for the second. 

Algorithms 180 and 181 are faster than Algorithm 304 and may 
be preferred for this reason, but the method used shows itself in 
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Algorithm 181 to be not quite as accurate, and the introduction 
of this method solely for the circumstances in which Algorithm 
180 is applicable hardly seems worth while. 


Acknowledgment. Thanks are due to Miss I. Allen for her 
help with the double-length hand calculations. 
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REMARK ON ALGORITHM 272 

PROCEDURE FOR THE NORMAL DISTRIBUTION 
FUNCTIONS [S15] [M. D. MacLaren, Comm. ACM 8 
(Dec. 1965), 789] 

M. D. MacLaren (Reed. 26 Dec. 1967) 

Argonne National Laboratory, Argonne, Ill. 60439 


KEY WORDS AND PHRASES: normal distribution function, 
error function, normal function, normal curve integral 
CR CATEGORIES: 5.5, 5.12 


In (1] Hill and Joyce report that the value produced by 
Algorithm 272 for the argument a = 0.8 is correct only to 5 decimal 
places, although the algorithm specifies an accuracy of 2 X 10-8. 
Upon checking we have found that the source of this inaccuracy is 
a typographical error in the section beginning ‘‘begin comment 
initialize own variables;”’ The statement initializing C[3] should 
be changed to “‘C[3] = .54674530.”’ With this change the published 
algorithm is, as far as we know, accurate within the specified error 
limit of 2 * 107°. 

In the first comment of the algorithm the lower limit of the first 
integral should be minus infinity and not merely a minus sign. 

REFERENCE: 

1, Hitz, I. D., ann Joyce, 8. A. Remark on algorithm 123. 
Comm. ACM 10 (June 1967), 377. 
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ALGORITHM 273 

SERREV [C1] 

Henry C. THacuer, Jr. (Recd. 2 Apr. 1965) 
Argonne National Laboratory, Argonne, Illinois 
(Work supported by the US Atomic Energy Commission.) 


procedure SERREV (A, B, C, N); 
value N; integer N; array A, B, C; 

comment This procedure produces in the array C the coefficients 
of the power series yi = )_ 4; C,,;2', where y is the solution of 


N N ; 
fy) = »» Avy = g(x) = 2 Bix 


and A, = 1. The arrays A and B are linear, with bounds 1 and 
M 2 N. The array C' is square, with bounds 1:M, 1:M. Ele- 
ments above the diagonal are not used. The derivation of the 
method is given in [1]; 
begin integer J, J, K, LIM; real T; 
for J := 1 step 1 until N do 
begin for J := J—1 step —1 until 1 do 
begin T := 0; LIM := I-—J; 
for K := 1 step 1 until LIM do T := C[K,1] x ClUI-K,J/] 
+7; Ci,J+1):=T7 
end for J; 
T := Bi); 
for J := 2 step 1 until J do T := T-—Al) X ClU,J/); 
Ci,1] := 7 ; 
end for I 
end 
REFERENCE: 
1. THacuer, H.C., Jr. Solution of transcendental equations by 
series reversion. Comm. ACM 9 (Jan. 1966), 10-11. 
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ALGORITHM 274 
GENERATION OF HILBERT DERIVED TEST 
MATRIX [F1] 
J. Booturoyp (Recd. 19 May 1965 and 27 Aug. 1965) 
University of Tasmania, Hobart, Tas., Australia 
procedure festmz(a,n); value n; integer n; array a; 
comment T.J. Dekker, “Evaluation of Determinants, Solution 
of Avstems of Linear Equations and Matrix Inversion”’ [Rep. 
No. MR638, Mathematical Centre, Amsterdam] describes a test 
matrix W{lin, lin] with the following propertics: 
(a) elements AZ[z,7] are positive integers, 
(b) the inverse has elements (—1) fT (+7) X M{t,7], 
(ec) the degree of ill-econdition increases rapidly with increas- 


ing n. 
Such matrices may be formed by AZ = FG UHG where F is a 
diagonal matrix diag(fi) with ft = factorial (n+1—1)/(factorzal 


(a—1372)/factorial(n—1), H is the order n segment of a Hilbert 
matrix and G is diagonal, diag (gi), with gi derived from the prime 
decomposition of fi by: 

a pip” ses pkmk, : pkmke?, 
This procedure forms matrices a[{l:n, 1:n] of this type and fol- 
lows Dekker in principle but not in detail. Factorials are avoided 
by evaluating the f7 with a recursion sequence 


gi = nee po a 


fl} := nm, fle+t) := fle] X (nf2—-772) + 72 


@=1, 2, aca ft); 
permitting the exact computation of fz for much larger n than 
would otherwise be possible. In the evaluation of expressions 
of the form (ab) + c, where the result is integral but ¢ is not 
a factor of cither a or 6, numcrator integer overflow is avoided 
by the simple device 


expression := q X 6 + (rxXb) +c where a=qX cr. 


Test matrices for 2 < n S 15 have been computed on a machine 
with » 39-bit integer register. During tests of the procedure the 
specification of the array paramcter was changed from real to 
integer and the results checked by matrix multiplication using 
an exact double precision integer inner-product routine. The 
unit matrix was obtained in all cases. As real arrays these 
matrices will find use. only for values of m such that all integer 
elements have an exact floating point representation. For 
10 =< » S 15 the values of the elements of largest modulus are: 


n Aft, 7}max 

10 1616615 
{1 49884120 
J2 108636528 
13 490804314 
14 1859890032 
15 22096817600; 


begin integer 7, j, k, ft, gi, d, ¢, r; Boolean even; 


integer array f, g{l:n]; 


comment First we compute F = diag(ft); 
foi=flll:=n; jie nXn; 
for 7 := 1 step | until n—1 do 
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hbegind := 1X1; k:= j—d; 
giz fti+d; r:= fi-qxXd; 
fli+l] := fi:=qxXk+ (xk) +d 

end; 

comment And now, using a modified prime factors algorithm 
to obtain @ = diag(gi), we compute FG, whose clements re- 
place those of F; 


for 7 := 1 step 1 until n do 

hegind := gi:=1; q:=ft:= fli; jg := 2; 
new]: even := false; 
next: if g 2 j then 


begin g := ft + J; 
if fc #~qX7 then 


begin 7 := j+d; d:= 2; go lo neuj end; 


if even then gt := gt X Jj; even := - even; 
ft :=q; goto next 
end; 
gli] := gts fli} = flr] + ge 
end; 
comment Finally, in one operation (/G!)HG where H is a 
nonexistent Tlilbert matrix whose reciprocal elements, 
i-+j—l, are computed as we go; 
for) := 1 step 1 until x do 


begin fi := f{iJ; 
for 7 := 1 step 1 until n do 
begin gi := gly]; & t= t4+7-1; 


qis fi +k; r:i=fi-—qxXk; 
at, jl}:=@X git (@Xgi) +k 
end 
end 


end tesimz 


REMARK ON ALGORITHALI 274 [F1] 
GENERATION OF HILBERT DERIVED 
MATRIX [J. Boothroyd, Comm. ACAI 9 (Jan. 1966), 11] 
J. Boornroyp (Reed. 7 Jan. 1969) 

University of Tasmania, Hobart, Tasmania, Australia 
KEY WORDS AND PHRASIES: 
CR CATEGORIES: 5.14 
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test matrix, Hilbert matrix 


An alternative, simpler, and more efficient procedure for gen- 
erating test matrices having the same properties as those gen- 
erated by Algorithm 274 is given below. The method, like that of 
Algorithm 274, is due to T. J. Dekker and may be described as 
follows. 

The elements of the inverse of a segment of a Hilbert matrix 
are given by 


(HH) = (-1)"9 & fi & F/G +7 - 1) 
where 


fi = factorial (n +1 — 1)/(factorial (@ — 1)) Tf 2/factorial (n — 7). 


COLLECTED ALGORITHMS (cont.) 


The f; may be factored as fi = fia X fiz, in which 


+i- | n—- 1 
ja = (* i- 1 \ xn, jel =). 


Test matrices 7 are constructed by T = D\HD, where Di = 
diag (fn), D2 = diag (fiz), and H is the Hilbert matrix segment 
Hy; = 1/@ + j — 1). It may be seen that this is equivalent to 
defining the 7’ matrices by: 


Tis = GOGD/G +7 —Y, 
; ama | 
we(*tizt}xn a= (7)): 


with fi, f7 given by the recurrence relations: 
(fii =n, (finn = (ft)i X (mn + t)/1, 
G1 = 41, fin = Gi)i K (n = 5/7. 


That the condition K(T) of these matrices is severe may be 
seen from an observation of the referee, who notes that 


K(T) = || Tix | TOU, 
> (max ¢:,;)12 = (transi) + 2)72 ~ (278n/138n) 12, 
L<iicn 

where || -- || is the Za , Le , L,, , or the Euclidean matrix norm. 


Other properties of these matrices shared by those of Algorithm 
274 are: 

(a) Each matrix has unit determinant; 

(b) The eigenvalues form a set A1 ,A2, °°: , 1/d2 , 1/1 , So that 
odd order matrices have one eigenvalue of unity. 

The procedure testmz1 below has been tested on an Elliott 
503 (positive integer word length of 38 bits) and matrices of all 
orders up to 13 were generated before integer overflow occurred 
with n = 14. 


procedure testmzl (a, n); value n; integer n; array a; 
comment generates in a[l :n, 1 : n] test matrices with integer 
elements given by 


nba n—-l1 . . 
is = ( i-1 \xax(*7t\ari-n 


and such that the elements of J’ inverse are (—1)**7 X t:,; . 

To determine for a particular computer that limit on » which 
permits the exact machine representation of all elements of 
these matrices, the following maximum values are listed: 


n t;,; (max) 
8 163800 
9 1178100 
10 8314020 
11 61108047 
12 440936496 ; 
begin 
integer 7, j, fi, fj, tless1; 
fi := n; tlessl := 0; 
for 1 := 1 step 1 until n do 
begin 
fg := 1; 
for j := 1 step 1 until n do 
begin 


alt, j] := (ftxf7) + (ilessl+jJ); 
Spe Nag) AI) 9 


end; 


274 P2 O 
fi := ((nti)Xfi) + 1; tlessl := 1 
end 
end testmzl 


Proofs that the test matrices described above have integer 
elements and checkerboard inverses follow the lines of similar 
proofs given in [1]. 

Acknowledgments: Thanks are due to T. J. Dekker for com- 
municating details of this method and to the referee for the con- 
tribution mentioned. 


REFERENCE: 

1, Dexxer, T. J. Evaluation of determinants, solution of sys- 
tems of linear equations and matrix inversion. Rep. No. 
MR63, Mathematical Centre, Amsterdam, June 1963, pp. 
8 and 9. 
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ALGORITHM 275 

EXPONENTIAL CURVE FIT [E2] 

GERARD R. Detny (Recd. 27 July 1964 and 16 Apr. 1965) 
US Department of Defense, Washington, D. C. 

(Now with HRB-Singer, Inc., State College, Pa.) 


procedure EX PCRVFT (a, b,c, E squared, n, x, y, epsilon, | maz, 
flag) ; 
integer n, 1 maz, flag; 
real a, b, c, E squared, epsilon; 
real array 2, y; 
comment This algorithm will fit a curve defined by the equation 
y=aX exp(bX xz) + ctoaset {x , y;} of m data points. The 
Taylor series modification of the classical least squares method 
is utilized to approximate a solution to the system of nonlinear 
equations of condition. After every iteration, the statistic E 
squared is computed as a measure of the goodness of fit. Com- 
mencing with the second iteration, the successive values of # 
squared are differenced, and when the difference in absolute 
value becomes less than epsilon, the calculations cease. If the 
number of iterations necessary to achieve this result exceeds 
| maz, a flag is set to a nonzero value and the procedure is termi- 
nated; 
begin 
integer 2, 1, m; 
comment Computation of initial estimates follows; 
b = 2X In(abs(((yln] — y[n—1]) X (2[2] — 2[1]))/ 
((y[2] — y[1]) X (ln] — 2[n—1]))))/ 
(z[n] + z[n—1] — 2[2] — xf1]); 
a := (y[n] — y[n—1])/(@ln] — z[n—1]) 
xX exp((6 X (a[n] + 2[n—1]))/2) X b); 
m:= (n+1) + 2; 
c i= ylm] — a X exp(b X x[m}); 
E squared := 0; 
for 7 := 1 step 1 until n do 
E squared := E squared + (y[t] — c — a X exp(b X 2[2}))12; 
comment Computation of corrections follows; 
for 1 := 1 step 1 until / maz do 
begin 
real sumex1, sumex2, sumxtex1, sumaxiexs2, sumxi2er2, sumyt, 
sumytexl, sumayier|, dil, d12, d13, d22, d28, d33, e1, e2, ¢3, 
deltall1, delta12, deltal3, delta22, delta23, delta33, delta, u, v, w, 


save; 
sumecl := sumex2 := sumaiexl := sumxiex2 := sumxi2der2 := 
sumyt := sumytexl := sumzyierl| := 0; 
for 7 := 1 step 1 until n do 
begin 


real ez1, ex2, xiexl, xiex2, xi2ex2; 
exl := exp(b X x[t]); 


ex2 := exlf2; 

ziexl = := aft] X exl; 

ziex2 := ali] X ex2; 

wi2ex2 := aft] & xiex2; 

sumexl := sumexl + ez1; 
sumez2 := sumex2 + ex2; 
sumazierl := sumxiexl + xiezl; 
sumaiex2 := sumaxier2 + xiex2; 


sumai2ex2 := sumaxi2ex2 + xi2ex2; 
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ll 


sumyt sumyi + yi}; 
sumyiexl := sumyiexl + y(t] X exl; 
sumxytexl := sumaxyierl + y[t] X xiexl; 
end computation of sum terms in normal equations; 
dll := sumex2; 
d12 := sumaiex2 X a; 
di3 := sumexl; 
d22 := sumxi2ex2 KX a2; 


d23 := sumaiexl X a; 

d33 := n; 

el := — sumex2 X a — sumexl X c + sumyierl; 

e2 := — sumxiex2 X at2 — sumaiexl X cXat 
sumxyiexl X a; 

es := — sumetl Xa—nXect+ sumyi; 


deltall := d22 X d33 — d23 72; 
deltal2 := d13 X d23 — dl2 X 433; 
deltal3 := d12 * d23 — d13 X d22; 
delta22 := dll X d33 — d13 72; 
delta23 := di2 X d13 — dll X d23; 
delta33 := dil XK d22 — di27 2; 
delta := dll X deliall + dl2 X deltal2 + d13 X delial3; 
u := (el X deltall + e2 X delial2 + €3 X deltal3) /delta; 
v := (el X& deltal2 + e2 X delia22 + €3 X delta23)/delta; 
w:= (el X deltal3 + e2 X delta23 + e3 X delta33)/delia; 
@a:=at+u; 
b :=b+y; 
Cc = e+ U; 
E squared := 0; 
for 7 := 1 step 1 until n do 
E squared := E squared + (y[i] —¢ — a X exp(b X alt})) F 2; 
if ] = 1 then go to retry; 
if abs(save — E squared) < epsilon 
then go to 73 
else if 1 < 1 maz 
then go to retry 
else go to unfurl; 


retry: save := E squared; 


end computation of corrected values of a, b, and c; 


unfurl: flag := 1; 
73: end least squares curve fit to y = a X exp(b X 2) +e 
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CONSTRAINED EXPONENTIAL CURVE FIT [E2] 


GERARD R, Detny (Recd. 27 July 1964 and 16 Apr. 1965) 
US Department of Defense, Washington, D.'C. 
(Now with HRB-Singer, Inc., State College, Pa.) 


procedure CSXPCVFT (a,b,c, E pada n, 2 ¥, k, 2, pal 
Ll max, flag, jump); 
integer 1n,k,1 maz, flag, jump; . 
real a,b,c, E squared, z, scalar 
real array 12, Y; 

comment This algorithm will fit a curve defined by the equation 
y=aX ep Xx) +c toaset {x;, ys} of n data points, and 
constrain the curve so it contains the point (2, , z). The Taylor 
series modification of the classical least squares. method is 
utilized to approximate a solution to the system of nonlinear 
equations of condition. After every iteration, the statistic E 
squared is computed as a measure of the goodness of fit. Com- 
mencing with the second iteration, the successive values of 
squared are differenced, and when the difference in absolute 
value becomes less than epsilon, the calculations cease. If the 
number of iterations necessary to achieve this result exceeds 
1 maz, a flag is set to a nonzero value and the procedure is termi- 
nated. In normal usage, the jump parameter is brought in as a 
ZERO. 

With certain data sets, convergence diffcultips will be ex- 
perienced. In these cases it is sometimes helpful to first utilize 
the procedure EXPCRVFT [Algorithm 275, Comm. ACM 9 
(Feb. 1966), 85] to obtain initial values for b and c, and then 
bring the jump parameter in as a ONE in order. to bypass the 
following starting value computations for b and c¢.; 

begin 

integer 1, |, m; 

real exp factor; 

if jump = 1 then go to entry; 

comment Computation of initial estimates follows; 

b := 2X In(abs((Gyin] — yln—1]) & G2] — xf1)))/ | 
((y{2] — yfl]l) X G@f{n] — z[n—1]))))/ 
(a[n] + aln—1] — x[2] — 2[1)); 

m := (n+l) + 2; 

exp factor := exp(b X (x|m] — x{k])); 

c := (y[m] — z X exp factor)/(1 — exp factor); 

a:= (2 —c) X exp(—b X alk)); 

E squared := 0; 

for i := 1 step 1 until n do 

E squared := E squared + (y[t] — c — a X exp(b X alt])) T 2; 
comment Computation of corrections follows; 

entry: for 1 := 1 step 1 until 1 maz do 
begin 
real sumexl, sumex2, sumgexl, sumgex2, sumgexilsex2, 
sumq2ex2, sumyt, sumyiexl, sumgytexl, zlsc, dil, d12, d22, 
el, e2, delta, v, w, save; 

sumexl := sumex2 := sumgexl := sumger2 := ative = 

sumg2ex2 := sumyi := sumyierl := sumgyierl := 0; 
for 7 := 1 step 1 until n do 

begin 
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real q, exl, ex2, gexl, gex2, qexllsex2, q2ex2; 


q = 2[t] — ak]; 
exl := exp(b X QQ); 
ex2  := exit 2; 


gexl := q X exl; 
gex2 := q X ex2; 


gexilsex2 := gexl — ger2;- 

q2ex2 = gex2 X q; 

sumexl := sumecl + eal; 

sumex2 := sumex2 + ex2; 

sumgexl := sumgexi + geri; 

sumgex2 += sumgex2 4+ gex2; 
sumgexilsex2 := sumgexllsex2 + geallsex2; 
sumg2ex2 := sumg2ex2 + q2ex2; 

sumyi := sumyt + y(t; 

‘sumytiexl := sumyterl + exl X ylt]; 


sumqyiexl := sumqyiexl + gexl X ylt]; 


end computation of sum terms in normal equations; 


zlsc := 2 — €; 

dil := sumq2ex2 X alse f 2; 

di2 := sumgexiisex2 X alsc; 

d22 :=n — 2X sumezi + sumez2; 


el := sumgyiexl X zlsc — sumgex2 X zlseT2 — 


sumgexl X alse X ¢; 


e2 := sumyt — sumyiexl + sumexl X (2 Xe — 2) + 


sumex2 X zlse —n X c€; 
delta := dll X d22 — di2T2; 


v i= (el & d22 — e2 X dl2)/delta; 
w:= (e2-* dil — el xX d12) /delta, 
b= b+; 
ec :=e+w; 


a:=(— é) X exp(—b X zk]; 
E squared := 0; 
for 1 := 1 step 1 until n do 

E squared := E squared + 

(ylt] — ¢ — a X exp(b X 2lt])) T 2; 

if 1 = 1 then go to retry; 
if abs(save — E squared) < epsilon 
then go to 73 
else if 1-< 1 max 

then. go to retry | 

else go to unfurl; 


retry: save := E squared;. 


unfurl: flag := 


73: 


end constrained least squares fit to y = 


end computation of corrected values of a, b, and c; 


aX exp(b Xx) + 


0 
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tion, is available to the user. For instance, in the example below 
with N = 8 the number A|[7] is the difference between A[1] for 
N =4 and A[1]) for N = 8. 

PARAMETER EXPLANATION. If the function F is odd or even 
then the Boolean parameters ODD or EVEN should be true 
respectively in which case every other coefficient in the array 
A will be zero. The array A will contain the coefficients of the 
truncated series with N-+1 terms. 

EXAMPLE. For the function F(z) = e* the following values 


ALGORITHM 277 

COMPUTATION OF CHEBYSHEV SERIES 

COEFFICIENTS [C6] 

Lyte B. Suita (Recd. 15 July 1965, 23 July 1965 and 20 
Sept. 1965) 

Stanford University, Stanford, California 


procedure CHEBCOEFF (F, N, ODD, EVEN, A); 


value \; : ; 

: . ; were computed for A[n] with N = 4and N = 8. The computations 
ieee ee AVES were done using this procedure written in Extended ALGou for 
Liane 7 the Burroughs B5500 computer. Also shown are computed values 
re aaa ne for the coefficients of the ‘‘best’’ polynomial of degree 8 from [4] 
array A; 


comment This procedure approximates the first N+1 coeffi- 
cients, ad, , of the infinite Chebyshev series expansion of a func- 
tion F(z) defined on [~1, 1]. 


F(x) = >0'a,T,(z), (1) 
n=0 
where ye denotes a sum whose first term is halved, and 17’, (zx) 
denotes the Chebyshev polynomial of the first kind of degree n, 
defined by 


T,(2) = cos n6, x = cos 0 (n = 0, 1, 2, ---). 


The truncated series 32% an7'n(x), gives an approximation to 
F(x) which has maximum crror almost as small as that of the 
“best”? polynomial approximation of degree N, see [1]. In this 
procedure the coefficients, a, , are closely approximated by 
Baw, = OC)N, which are the coefficients of a “Lagrangian” 
interpolation polynomial coincident with F(x) at the points 
z;,v = OU)N where x; = cos(ri/N), see [2]. The B,,y are given 
by 


2x 2x 
Buy = N >,” F@) T(z) = v ” Fai) Ts (tn), 


t= i=0 


where >,” denotes a sum whose first and last terms are halved. 
The Ba,w are evaluated by a recurrence relation described by 
Clenshaw in [1] and improved by John Rice [5]. This recurrence 
relation can also be used to evaluate the truncated series, 
ee Aanl'n(x), onee CHEBCOEFF has found values for the 
coefficients. For even N a relation between Brawn and Brin 
(pointed out by Clenshaw [3, p. 27]) is used in computing Bn.y . 
For large N, Br,w is very close to a, . In [2] the relation is given 
as 


Brin = A, + be (Gopn—n + Gopn+n)- (2) 


This shows that $By.n approximates ay quite well for large N 
sinee from (2) we see that 


$By,w = Qn + sn +e: (3) 


For even N a simple check on the accuracy is available. Since 
the relation 


Biw = Bawa — By-ay , n = O(1)N/2-1 (4) 
is used in the computation, the difference 
Brawn — Baw = By-nn ; (5) 


which measures in some sense the accuracy of the approxima- 


(digits differing from the correct result are in italics). 


n| A{n] with N = 4 | Alu] with V = 8 |‘‘Best” a, from [4]/Correct ay from [1] 


2.53213 21689/2.53213 17555)2.53213 17555|2.53213 17555 
1.18032 14175)1.13031 82080/1.13031 82080/1.13031 82080 
0.27164 03174)0.27149 53395/0.27149 53395/0.27149 53395 
0.04487 97762/0.04433 68498/0.04433 68498/0.04433 68498 
0.00547 42404/0.00547 42404/0.00547 42404/0.00547 42404 
0.00054 29263/0.00054 29263/0.00054 29263 
0.00004 49779)0.00004 49773/0.00004 49773 
0.00000 32096/0.00000 31984)0.00000 31984 
0.00000 01992/0.00000 01998/0.00000 01992; 


CON OO BR WND eK OC 


begin 


integer 7, m, N2, S1, S2, T1; 
real 00, b1, b2, pi, TWOX, FXN2; 
array FX, X[(0:N]; 
Boolean TEST; 
pi := 314159265359; 
N2 := N + 2; 
comment If N is even TEST is set to true; 
if 2* N2 = N then TEST := true 
else TEST := false; 
comment Compute the necessary function values; 
for 7 := 0 step 1 until N do 
begin 
X(t] := cos(pi * 1/N); 
FX{i] := F(X{t)); 
end; 
S2:=1; Sl := 0; 
comment If F(z) is odd or even initialize accordingly; 
if ODD then 


begin 
for m := 0 step 2 until N do 
Alm] := 0; 
S2:= 2; S1:= 1; 
end else 
if EVEN then 
begin 
for m := 1 step 2 until N do 
Alm] := 0; 
S2:= 2; Sl := 0; 
end; 
comment If TEST is true the coefficients are computed in 
two steps; 


FXN2 := FX[(N\/2.0; 
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if TEST then 
begin 
for m := S81 step S2 until N2 do 
begin 
b1 := 0; 
b0 := FXN2; 
TWOX := 2.0 X X[2 XK m}; 
for 1 := N—2 step —2 until 2 do 


begin 
62 := b1; b1 := b0; 
60 := TWOX X b1—b2 + FX{i); 
end; 
Alm] := 2.0 X (X[2Xm]xb0—b1+F X[0]/2.0)/N2; 
end; 
A[N2] := A[N2]/2.0; 
T1 := 81; 
ifODD \V EVEN then 
begin 


if 2 (N2 + 2) = N2 
then S1 := N2 + 2 — SI 
else Sl := N2+1+S1; 
end 
else Sl := N2 +1; 
end; 
comment Compute the desired coefficients; 
for m := S1 step S2 until N do 


begin 
b1 := 0; 
bO := FXN2; 


TWOX := 2.0 X X[m]; 
fori := N—-1 step — 1 until 1 do 


begin 
b2 := 61; bl := bO; 
b0 := TWOX X bl — 62 4+ FX{z); 
end; 
Alm] := 2.0 X (X{[m]xb0—b1+FX[0]/2.0)/N; 
end; 
if TEST then 
begin 


for 7 := T1 step S2 until N2—1 do 
Aft] := Aft] — A[N—7]; 
end; 
A[N] := A[N]/2.0; 
end CHEBCOEFF 
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ALGORITHM 278 escape: if abs(xmaz—axmin) < xerror VV abs(ymax—ymin) < 
GRAPH PLOTTER. [J6] yerror then go to label; 
P. Luoyp (Recd. 4 June 1965) | ee ee 
Queen Mary College, London, England for j := 1 step 1 until S do plot[i,j] := 2; 
procedure graphplotter (N, x, y, m, n, xerror, yerror, g, L, 8S, EM, for i := 1, L do 
CO, C1,.C2, C3, C4, label); for j := 1 step 1 until S do plot{z,j] := 1; 
value N, m, n, xerror, yerror, g, L, 8; for 7 := 2 step 1 until L—1 do 
array x, Y; for 7 := 1, S do plot{i,j] := 1; 
integer N, 9g, m,n, L, 8S; ifn = 1 then go to N1B; 
real xerror, yerror; for 2 := 1 step 1 until m do 
string HM, CO, Cl, C2, C3, C4; for j := n step —1 until 1 do 
label label; plot{1+entier (0.5+pX (x[t]—xzmin)), 
comment This procedure is intended to be used to give an ap- 1+entier (0.5-+ 4X (y[t,7|—ymin))] := 7+2; 
proximate graphical display of a multivalued function, y[z, j] of plotter: 
z{t], on a line printer. Output channel N is selected for all out- for 7 := 1 step | until L do 
put from graphplotter. The display is confined to points for which begin 
L1<ti<mandil<j<nwhere2<n< 4. lin = 1, then y is NEWLINE (N,1); SPACE(N,g); 
considered to be a one-dimensional array y[i] and the display is comment NEWLINE and SPACE must be declared 
again given for 1 <7 < m. The format of the print out is ar- globally to graphplotter, NEWLINE(N,p) outputs p car- 
ranged so that a margin of g spaces separates the display from riage returns and p line feeds on channel N, SPACE(N,p) 
the left-hand side of the page. L and S denote the number of outputs p blank character positions on channel N; 
lines down the page and the number of spaces across the page for 7 := 1 step 1 until S do 
which the display will occupy. The graph is plotted so that lines begin 
1 and £ correspond to the minimum and maximum values of z, switch SW := SW1, SW2, SW3, SW4, SW5, SW6; 
and the spaces 1 and S correspond to the minimum and maxi- go to SW[plot{i,7]]; 
mum values of y, that is, y is plotted across the page and x down SWI: outstring(N,EM); go to fin; 
the page. After the graph has been plotted, the ranges of z and SW2: outstring(N,C0); go to fin; 
y for which the display is given are printed out in the order as SW3: outstring(N,Cl); go to fin; 
above, separated from the display by a blank line. The strings SW4: outstring(N,C2); go to fin; 
EM --- C4 must be such that they occupy only one character SW5: outstring(N,C3); go to fin; 
position when printed out. The characters of Cl C2 C3 C4 repre- SW6: outstring(N,C4); 
sent y(z,1] y[z,2] y[¢,3] y[t,4]. HM is the character printed out fin: 
round the perimeter of the display. C0 is printed at empty end 
positions. At coincident, points the order of precedence of the end of display output; 
characters is Cl C2 C3 C4 EM C0. For the special case n=1 the NEWLINE(N,2); SPACE(N,g); outreal(N,xmin); 
character Cl represents y(t]. Control is passed from the pro- outreal (N ,amazx) ; 
cedure to the point labeled label if the interval between the outreal(N ymin); outreal(N ymax); 
maximum value and minimum values of 2[i] is less than zerror, go to end; 
or if the range of y is less than yerror. If the values of x[z] occur N1A: 
at equal intervals, choosing L=m will make one line equivalent ymax := ymin := y[1); 
to one interval of x; for 7 := 2 step 1 until m do 
begin begin 
real p, q, xmaz, xmin, ymax, ymin; if y[i] > ymax then ymax := y{t]; 
integer 7, j; if y[t] < ymin then ymin := y{t] 
integer array plot(1:L,1:S]; end of hunt for maximum and minimum values of y when 
xmax := xemin := x{l]); n=1; 
for 7 := 2 step 1 until m do go to escape; 
begin NIB: 
if z[¢] > xmazx then zmax := 2[1]; for 1 := 1 step 1 until m do 
if x[z] < amin then xmin := z{t] plot(itentter (0.5-+pX (x[t]—xmin)), 
end of hunt for maximum and minimum values of 2; l+entter (0.5+¢X (y[t]—ymin))] := 3; 
if n=1 then go to NIA; go to plotter; 
ymax := ymin := y(1,1); end: 
for 71 := 1 step | until m do end of graphplotter 
for j := 1 step 1 until n do 
begin 


if y{i,j] > ymaz then ymaz := y{t,J]; 
if yli,j] < ymin then ymin := ylt,J] 
end of hunt for maximum and minimum values of y; 
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ALGORITHM 279 
CHEBYSHEV QUADRATURE [D1] 

F. R. A. Hopcoop and C. LirHeruanp (Recd. 31 July 
1964, 1 Dec. 1964, 16 Aug. 1965 and 29 Nov. 1965) 
Atlas Computer Laboratory, 8.R.C., Chilton, Berks, 

England 


real procedure cheb(a, b, error, nmaz, f); 
value a, b, error, nmax; reala, b, error; 
procedure f; 
comment This routine evaluates the integral of f(z) with lower 
and upper limits set to a and b respectively.:The method is 
that suggested by Curtis and Clenshaw [Num. Math. 2 197-205 
(1960),]. The method consists of fitting 2 t n +/1 point Cheby- 
shev pcelynomial to integrand and thus finding integral. 7 is 
tried equal to 2 and increased by 1 if error, thé relative error, 
is too large. If n reaches maximum nmaz without required ac- 
curacy obtained a message is printed. Accuracy is determined by 
assuming that error is less than the contribution to the integral 
of the last term in the integrated Chebyshev polynomial. After 
n = 2 has been tried, an estimate of the integral is available 
and subsequently the last term in the Chebyshev polynomial is 
found first and this saves evaluating whole polynomial if ac- 
curacy not obtained. An extra check is that the: next two terms 
are also tested allowing up to 8 times error on previous term in 
each case. A reasonable value for nmaz is probably 7. Integrals 
requiring many more points than this would probably be better 


integer nmaz; real 


tackled using some method which subdivides the range. Also. 


the temporary storage required increases considerably for larger 
values of nmazx. For example nmax = 10 requires 2048 words; 
begin 
real armin1, aradd1, bmina, badda, br, bsum, es; csadd1, csadd2, 
esterr, x, estint, inidv2, twodun, twotr, verror; 
integer j,k, m, r, 8, mmax, mmazxd2, rk; 
k := 2 (nmaz — 2); 


mmaxd2 := 2X k; 
mmax := 2X mmaxd2; 
begin 
real array func, cosine [0:mmaz]; 
bmina := .6 X (b—a); 
badda := .56 X (bXa); 
‘nodine & := 1; m:= 4; 


comment m-+1 is number of points used in Chebyshev fit; 
start: twodun := .5  twodvn; 
bsum := aradd1 := 0; 
k := k+2; 
j := if m = 4 then 0 else k; 
fnrein: if 7 S mmaad2 then 
begin pas * 
cosine [j] := if m = 4 then cos (3.14159265 X j/mmaz) 
else if j = k then sqrt ((1 + cosine[2 X j])/2) 
else (cosine[j — k] + cosine[j + k])/ @ x cosine|k]) ; 
cosine [mmax ~ j] := —cosinelj] 
end; 
z= bmina X cosine [j] + badda; 
func [j] := if j = mmaz then .5 X f(z) else f(z); 
Ji= 2X k+9; 
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comment Evaluates remaining values of integrand required 
storing .56 X lower bound for easier use in Cr recurrence 
formula; 

if mmax = j then go to fnrein; 


ifm =4thenk :=2Xk; 
verror := error; 
ri=™; 


rk := mmaz; 
comment  verror is the error allowed in Chebyshev coefficient 
compared with estimate of integral; 
brretn: twotr := 2 & cosine[rk]; 


csadd2 := Q; 
csaddl := 0; 
8 := mmazx; 


cretn: cs := twotr X csaddl — csadd2 + fune[s]; 
if s ~ 0 then 


begin 
csadd2 := csadd1; 
csaddl := cs; 
s:=s—k; 
go to crétn 


end recurrence to evaluate next Chebyshev coefficient of 
pe function; 
armin1l := .5 X twodun * (cs — csadd2) X (if r = m then 
5 elde 1.0); - 
br := 6X (armini — aradd\)/( + 1); 
comment bris Chebyshev coefficient of integrated function; 


bsum := bsum + br; 
aradd1 := arminli; 
comment integral = (6 — a) X (01+ 63 + --- +.5& bn); 


ifr = m then esterr := br; 
comment error assumed less than last term added in br sum; 
if m + 4 74383 m + mmaz 743838 r 2m — 4 then 


begin: 
if abs(6r) 2 verror X. estint then 
begin 
newm: m:=2Xm; 
go to start 
end; 
verror := 8 X verror 


end Checks last 3 coefficients to ensure within allowed 
error bounds; 
if r ~ 0 then 
begin 
r:=r—2; 
rk := rk —-2Xk; 
go to brrein 
end; 
inidv2 := bsum X bmina; 
estint := abs(bsum); 
if error X estint < abs(esterr) then 
begin 
if m ~ mmaz then go to newm; 
outstring (1, ‘Accuracy not obtained’); 
end; 
cheb := 2 X intdv2 
end 
end cheb 
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REMARK ON ALGORITHM 279 
CHEBYSHEV QUADRATURE [D1] 
F. R. A. Hopgood and C. Litherland 
‘Comm. ACM 9 (Apr. 1966), 270] 


The 38rd line of the second column on page 270 should read: 
ifm #4 Am # mmar Ar > m — 4 then 
A printing error showed A as 7433. 


CERTIFICATION OF ALGORITHM 279 [D1] 

CHEBYSHEV QUADRATURE [F. R. A. Hopgood and 
C. Litherland, Comm. ACM 9, 4 (Apr. 1966), 270] 

KENNETH Hitustrom (Recd. 16 Dec. 1966 and 30 Jan. 
1967) 

Applied Mathematics Division, Argonne National Labora- 
tory, Argonne, Illinois 


Work performed under the auspices of the US Atomic Energy Commission 


The 40th line of the first column on page 270 should read: 
badda := .5 X (b+a); 

So corrected, Chebyshev quadrature was coded in CDC 3600 
Au@ou. A modified version of this quadrature scheme was coded 
in 3600 Compass language. In this modification the cosine values 
are program constants, with 3600 single-precision accuracy, as 
opposed to program generated values, which tests show have 
raaximum absolute errors of 2-35, These errors are carried into the 
integrand argument evaluation, resulting in large relative errors 
in the integrand evaluation, for functions bounded by unity over 
the interval of integration, for example, e-* over (0, 4.3) and sin(z) 
over (0, 27), which in turn delays convergence. 

Since 8600 Compass does not permit dynamic allocation of 
storage, the dimension of the cosine array must be fixed. The 
choice of 129 = 27 + 1 terms is based on the recommendation in 
the comments of Algorithm 279, ‘‘A reasonable value for nmax is 
probably 7.”’ 

The Chebyshev quadrature 3600 ALGOL program, the modified 
3600 Compass routine, and 3600 Forrran-coded Romberg and 
Ilavie integration routines were tested with six integrands. The 
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TABLE I 

Num- 

ber 

of 

Integrand A} B|EPS VI Routine VA func- 
tion 

evalu- 
ations 

Havie 0. 886226924 17 

e2? 0 | 4.3] 10-6 | 0.886226924 Romberg 0. 886226925 65 
Chebyshev 0.886095576 | 129 

Chebyshev (Rev.)} 0.886226926 | 17 

Havie 6. 268233308 | 129 

sin (2) +1 0 | 27 | 10-8 | 6.283185308] Romberg 6. 268233309 | 129 
Chebyshev 6.282993876 | 129 


Chebyshev (Rev.)| 6.288185309 5 


: Havie 5.034254231 | 129 
(z)-a)In(=)} 0 | 1 | 10-8 | 6.0 Romberg 5.034254231 | 129 
Chebyshev 5.829597734 | 129 


Chebyshev (Rev.)} 5.701177427 ) 129 


14. 02585084 65 


Havie 
In (x) 1 {10 | 10-6 }14.02585088 |} Romberg 14. 02585085 65 
Chebyshev 14.02585096 17 


Chebyshev (Rev.)| 14.02585097 17 


‘ Havie 1.979745104 | 129 

in (5) 0) 1 | 10-* | 2.0 Romberg 1.979745104 | 129 
Chebyshev 1.999599461 | 129 

Chebyshev (Rev.)| 1.997983436 | 129 

Havie 1.582238946 | 17 

1 —1|1 | 10-6 | 1.5322329* | Romberg 1.582238046 | 17 
(24+ 2240.9) Chebyshev 1.582232967 | 17 
Chebyshev (Rev.); 1.582232967 | 17 


d 
® The value /7) Saco = 1,5822329 is obtained from C. W. Clenshaw and 


A. R. Curtis, ‘A method for numerical integration on an automatic computer,’”’ 
Numer. Math. 2 (1960), 2038. 


Romberg and Havie routines are based upon Algorithm 60, Rom- 
berg Integration [Comm. ACM 4, (June 1961), 225], and Algorithm 
257, Havie Integration [Comm. ACM 8 (June 1965), 381]. 

The results of these tests are tabulated in Table I. In the table, 
A is the lower limit of the interval of integration, B is the upper 
limit, EPS the convergence criterion, VJ the value of the integral, 
and VA the value of the approximation. 

Due to storage requirements, Chebyshev quadrature is re- 
stricted to a maximum of 129 function evaluations. For reasons 
of comparison, this limit is also imposed on Romberg and Havie 
quadratures. Thus, in some cases the accuracy called for was not 
obtained. 


REMARK ON CORRECTION TO CERTITICATION 
OF ALGORITHM 279 [D1] 

CHEBYSHEV QUADRATURE [F.R.A. Hopgood and 
C. Litherland, Comm. ACM 9 (Apr. 1966), 270 and 10 
(May 1967), 294] 

KennetH Hiuustrom (Recd. 26 June 1967) 

Applied Mathematics Division, Argonne National Labora- 
tory, Argonne, Illinois 


There are two corrections that should be appended to the certi- 
fication of Algorithm 279. 

Due to programming error, the integrand function routines for 
e and sin(x)+1, used by the Chebyshev routine, incorrectly 
evaluated the functions at x = 0, thus delaying convergence. 
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The revised Chebyshev routine still converges more rapidly 
than the original scheme in the first two examples, but the ad- 
vantage is muct ‘ess pronounced than previously indicated. 

The amended Table I should read as follows, with the numerical 


corrections italicized. 


TABLE I 

Number 

of func- 

Integrand | A | B | EPS VI Routine VA tion 
evalu- 

ations 

a 0 | 4.3} 10-6 | 0.886226924| Havie 0.886226924 17 
Romberg 0. 886226925 65 

Chebyshev 0.8862269261 83 

Chebyshev (Rev.) | 0.8862269258 17 

sin(a)+1 O | 27] 10-6 | 6.283185308] Havie 6283185307 3 
Romberg 6. 283185307 3 

Chebyshev 6 52831853086 9 

Chebyshev (Rev.) | 6.2881853089 5 
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ALGORITHM 280 

ABSCISSAS AND WEIGHTS FOR GREGORY 
QUADRATURE [D1] 

Joun H. Weuscu (Recd. 27 Apr. 1965, 14 May 1965, 14 
Sept. 1965 and 9 Dec. 1965) 

Computation Center, Stanford University, Stanford, Calli- 
fornia, 


procedure gregoryrule (n, 7, t, w); 
value n, 7; integer n,7; real array t, w; 
comment. Computes the abscissas and weights of the Gregory 


quadrature rule with r differences: 


bn 1 1 h 
f@ dt = i (Ep +h + faa +34) — 9 VA Afo) 


to 


ce ms (Vfn + A*fo) wg wee, eee heru(V'fn + A’fo) 


7 oy w; f(t), 
j=0 
where h = (tn — &)/n, and thec;* are given in Henrici [1964]. The 
number r must be an integer from 0 to n, the number of sub- 
divisions. The left and right endpoints must be in ¢[0] and ¢[n] 
respectively. The abscissas are returned in ¢[0] to ¢[n] and the 
corresponding weights in w[0] to w[n]. 

If r = 0 the Gregory rule is the same as the repeated trapezoid 
rule, and if r = n the same as the Newton-Cotes rule (closed 
type). The order p of the quadrature rule is p = r + 1 forr 
odd and. p = r + 2 for r even. For n 2 9 and large r some of the 
weights can be negative. 

For n S$ 82 andr S 24, the numerical integration of powers 
(less than r) of z on the interval (0, 1] gave 9 significant digits 
correct in an 11-digit mantissa. Since the binomial coefficients 
are generated in the local integer array b, integer overflow may 
occur for large values of r. The type of b can be changed to real 
to prevent this with no change in the results stated above. 
REFERENCES: 

HILDEBRAND, F. B. Introduction to Numericai Analysis. 

McGraw-Hill, New York, 1956, p. 155. 
Henrict, Peter. Elements of Numerical Analysis. Wiley, 
New York, 1964, p. 252.; 
begin integer i,j; real h, cj; 
integer array b[0: 7]; real array c(0: n + 1]; 
b{0]) := 1; cf0] := 1.0; ell] := —0.5; b[n] := 0; 
h := (t{n] — t{[0])/n; w[0] := wn] := 0.5; 
for? := n—I1 step —1 until 1 do 

begin w[t] := 1.0; é(¢7] :-=7Xh+ [0]; bf] := Oend; 
ifr >nthenr := n; 
for 7 := 1 step 1 until 7 do 
begin cf := 0.5 X c[j]; 

for 1 := j step —1 until 1 do b[z] := b{z| — bf¢—1); 

for 7 := 3 step 1 until j + 2 do cj := cj + cly+2-—7]/7; 

e[j-+1] := —aj; 

for 1 := 0 step 1 until n do 

wlt] := wt] — ej X (b[n — 2] + Bz); 
end; 
for 7 := 0 step 1 until n do w[z] := wit] XA 
end gregoryrule 
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ALGORITHM 281 

ABSCISSAS AND WEIGHTS FOR ROMBERG 
QUADRATURE [D1] 

Joun H. Weiscu (Reed. 27 Apr. 1965, 14 May 1965, 14 
Sept. 1965 and 9 Dec. 1965) 

Computation Center, Stanford University, Stanford, Cali- 
fornia 


procedure rombergrule (n, p, t, w); 
value n, p; integer n,p; real array ¢, w;_ 

comment Computes the abscissas and weights of the pth order 
Romberg quadrature rule which features equally spaced ab- 
scissas and positive weights lying between 0.484 & h and 1.4524 
x h. (h = subdivision length). The number of subdivisions n 
must be a power of 2 (say 2f g) and p aneven number from 2 to 


2q+2. Romberg integration is normally given as the extrapola- 
tion to the limit of the trapezoid rule. Let 


1 
To =h (r +h tees + fob +5 je) and 7” 


arn (k+1 he 
A Tint — Tint 
4m— 1? 


then the Romberg quadrature rule gives 


tn 
i fd) dt = TY = 
to 


where n = 24, m = (p — 2)/2,andk = q—m. The left and right 
endpoints must be in ¢[0] and ¢[n] respectively. The abscissas 
are returned in ¢[0] to ¢[n] and the corresponding weights in 
w[0] to w{n]. 

If p = 2 the Romberg rule is the same as the repeated trape- 
zoid rule, and if p = 4, the same as the repeated Simpson rule. 

For n S 128 and p S 16, the numerical integration of powers 
(less than p) of x on the interval [0, 1] gave answers correct to 
one round off error in an 11-digit mantissa. 
REFERENCE: Bauer, F. L., Rutishauser, H., and. Stiefel, E. 
New aspects in nuimerioal quindpabare: Proc. of Symp. in Appl. 
Math., Vol. 15: High speed computing and experimental arith- 
metic. Amer. Math. Soc., Providence, R. I. , 1963, ‘pp. 199-218; 

begin integer 7, j, m, ml, ine 83 

real h, ci; real array c[0: (p — 2)/2]; 
his (¢(r] — ¢[0})/n; w[0] := w[n] := 0; 
for 7 := n—1 step —1 untilldo — 

begin w{t] := c[t]:=0; ti] :=7xh+?[0] end; 


p> w; f(ts), 


jm 


= (p — 2)/2; c[0]:= 1.0; s:=m4:=1; ais = 0; 
if m > In(n)/In(2) then m := In(n)/In(2); 
for j := 1 step 1 until m do . 
begin m4 := 4 X m4; ml := m4 —1; 

for 7 := j step —1 until 1 do | 
cli] := (m4 X cli] — eft — 1})/ml; 
c[0] := c[0] X enka 
-end; 
for i := 0 step 1 until m do 
begin ci := clt] X 8; 
for j := 0 step s until n do w[j] := wly]+cz; 
=2Xs 


end; 
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w[0] := wl[n] := 0.5Xw(0); 
for j := O step 1 until n do wij] := wij] XA; 
end rombergrule 


REMARK ON ALGORITHM 281 [D1] 

ABSCISSAS AND WEIGHTS FOR ROMBERG 
QUADRATURE [John H. Welsch, Comm. ACM 9 
(Apr. 1966), 273] 

J. Booturoyp (Recd. 13 Sept. 1966 and 14 Nov. 1966) 
University of Tasmania, Hobart, Tasmania, Australia 


The following changes which effect two minor improvements and 

correct two errors are recommended: 

1. The expression (p — 2)/2, which occurs twice, should 
preferably be written (p — 2)+2 

2. Delete c[i] := from the left part list of the statement 
wii] := efi] := 0 which occurs within the scope of the first for 
statement 

3. Delete the statement c[n] := 0; 

4, Add, immediately following m1 := 
ment clj] := 0; 


These changes have been tested by the author of Algorithm 281 
using B5500 ALGOL. 


m 4 — 1, the state- 
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ALGORITHM 282 

DERIVATIVES OF e*/z, 
[S22] 

WALTER GautTscui (Recd. 19 Aug. 1965) 

Argonne National Laboratory, Argonne, III. 


* Work performed under the auspices of the U.S. Atomic Energy 
Jommission. Author’s present address is Purdue University. 


cos (x)/x, AND sin (x)/x* 


procedure dsubn(xz, nmazx, d); 
value 2, nmax; integer nmax; realz; array d; 
comment This procedure generates the derivatives 


da” z 
OO ae (£) (a = 0,1, 2, +++, nmaz) 
dz*\ 2x 


using the recurrence relation 
dn(w) = (e? — ndy«(x))/2 (n = 1, 2,3, +++). 


The results are stored in the array d. If x = 0, there is an error 
exit to a global label called alarm; 
begin integer 7; real e; 
if x = 0 then go to alarm; 
e:= exp(z); d[0] := e/z; 
for n := 1 step 1 until nmaz do 
d[n] := (€e-—nXd[n — 1))/x 
end dsubn; 
procedure csubn (xz, nmaz, c); 
value z,nmaz; integer nmaz; realz; array c; 
comment This procedure obtains the derivatives 


. an COs x 
dx* x 


from the recurrence relation 


Cn(z) = (ta(z) — nepsr(x))/2 (n = 1, 2,3, --+ ), 


cn(x) ) (n = 0,1, 2, ---, nmaz) 


where {ra(z)}ne1 = {—sin x, —cos 2, sin x, cos x, —sin 2, -++}. 
The results are stored in the array c. If x = 0, there i is an error 
exit to a global label called alarm; 
begin integer n; array tau(l: 4): 
if z = 0 then go to alarm; 
tau{1] := ~—sin(x); tau[2] := —cos(z); 
tau[3] := —tau[1]; tau[4] := —tau[2]; 
c[0] := tau[4]/z; 
for n := 1 step 1 until nmaz do 
ce[n] := (tau[n—4X ((n—1) + 4)] — nXel[n—1])/x 
end csubn; 
procedure ssubn(z, nmaz, d, 8); 
value x, nmaz,d; integer nmaz,d; realz; array 8; 
comment This procedure generates to d significant digits the 


derivatives 

d" } 

Bel (fli (n = 0,1,2,---,mmaz), 
dx” x 


Sa(z) = 


and stores the results in the array s. The method of computation 
is based on the recurrence relation 
8n (x) = (on (x) — NS8n-1(x))/z (n = 1, 2,3, a i 


where {on(t)}nei = {cos x, —sin x, —cos x, sin x, cos x,+--}. 
The recurrence relation is applied in forward direction as long 
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as n S |x|, and in backward direction for the remaining values 
of n, starting with an appropriately large n = v. A detailed dis- 
cussion of the method will be published elsewhere. It is assumed 
that a global real procedure t(y) is available, which evaluates 
the inverse function t = t(y) of y = t ln t to low accuracy for 
y = 0.(See W. Gautschi, Algorithm 236, Bessel functions of 
the first kind, Comm. ACM 7 (Aug. 1964), 479 Gautschi, W. 
Computation of successive derivatives of f (z)/z, in press; 
begin integer n,70,nu; realzi,dl,sl; array sigma [1: 4]; 


1 := abs(x); 
sigma [1] := cos(x); sigma [2] := —sin(z); 
sigma [3] := —sigma [1]; sigma [4] := —sigma [2]; 


n0 := entier (x1); 8(0] := ifz ~ 0 then sigma [4]/z else 1; 
for n := 1 step | until if 20 S nmaz then n0 else nmaz do 
s[n] := (sigma[n — 4 X ((n — 1) + 4)] — n X 8[n — 1))/2; 
if n0 < nmaz then 
begin 
sl := 0; dl := 2.3026 x d + .6931; 
= if nmax S 2.7183 K x10 then 
1 + entier (2.7183 & x1 X t(.86788 K-d1/x1)) else 
1 + entier (nmax X i(dl/nmaz)); 
for n := nu step —1 until n0+2 do 
begin 
= (sigma{n — 4 X ((n — 1) + 4)] — & X 81)/n; 
ifn S nmax + 1 then s[n—1] := sl 
end 
end 
end ssubn 


REMARK ON ALGORITHM 282* [822] 
DERIVATIVES OF e’/z, cos (z)/z, AND sin (z)/z 
[Walter Gautschi, Comm. ACM 9 (April 1966), 272] 
WALTER GAUTSCHI AND Bruce J. KuEIn (Recd. 12 May 
1969) 

Computer Sciences Department, Purdue University, La- 
fayette, IN 47907 and College of Arts and Sciences, 
Virginia Polytechnic Institute, Blacksburg, VA 24061 


* Work supported by the National Aeronautics and Space 
Administration NASA under Grant NGR 15-005-039. 


KEY WORDS AND PHRASES: 
cessive derivatives, error control 
CR CATEGORIES: 5.11, 5.12 


recursive computation, suc- 


For large values of x, and derivatives of order n > 2, the first 
two procedures of Algorithm 282 incur substantial loss of accuracy. 
The reasons for this, as well as remedial measures, are described 
in the companion article [1]. The following revised procedures, 
based on this article, are believed to preserve accuracy as far as 
seems possible. Both procedures call upon the real prgeedure t of 
Algorithm 236 [2]. 


procedure dsubn (x, nmaz, acc, machacc, d, error); 
value z, nmax, acc, machacc; integer nmaz, acc, machacc; 
real z; array d; label error; 
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comment Givenx ~ 0, nmaz, and the number machacc of decimal 
digits available in the mantissa of machine floating-point num- 
bers, this procedure generates the derivatives 


dx x 


an (e 
d(x) = — (5) », n=0,1,2,---, nmaz, 


to an accuracy of acc significant decimal digits, except near a 
zero of dn(z), where some significance may be lost. The result 
d,(z) is stored in d[n]. If 2 = 0, the procedure immediately 
exits to the label error; 
begin 
integer 710, min, n, n1; 
Boolean booll, bool2; 
if x = 0 then go to error; 
xl := abs(x); nO := 21; 
d(0] := e/z; 
a := 1.1513 X (machacc—acc) — .3466; 
ifa < 2thena := 2; 
booll := a2 <0Va2l Sa; bool2 := nO < nmaz; 
min := if bool2 then 70 else nmaz; 
for n := 1 step 1 until if booll then nmaz else min do 
d[n] := (e-nXd[n—1])/2z; 
if (-booll) (A bool2 then 
begin 
nl := 2.71838 X zl X 
t((21+2.3026 Xacc-+ .6982) / (2.7183 21))—1; 


real 21, e, a, q; 


e := exp(x); 


if nl < nmaz then nl := nmax; 

gq := Ife; 

for n := 1 step 1 until nl + 1 dog := —n X q/z; 
for n := ni step —1 until n0 + 1 do 

begin 


g := (e—2Xq)/(n+)); 
if n S nmaz then d[n] := q 
end 
end 
end dsubn; 
procedure csubn (x, nmax, acc, machace, c, error); 
value x, nmaz, acc, machacc; integer nmaz, acc, machacc; 
real z; arrayc; label error; 
comment This procedure generates the derivatives 


ee 


re ee for = 0,1,2,--- amar), 
and stores them in the array c. The parameters acc, machacc 
have the same meaning as in the preceding procedure. There is 
an error exit if x = 0; 
begin 
integer 70, min, n, nl; 
Boolean booll, bool2; 
if x = 0 then go to error; 


real zl,a,q; array tau[(1:4]; 


vl := abs(z); nO := 21; 
tau[1] := —sin(x); tau[2] := —cos(x); 
tau[3] := —tau[l]; tau[4] := —tau[2}; 


c(0] := tau[4]/x; 
a@ := 2.3026 X (machacc—acc) — .69315; 
ifa < 8thena := 3; 


booll := zl Sa; bool2 := n0 < nmaz; 
min := if bool2 then n0 else nmaz; 
for n := 1 step 1 until if booll then nmax else min do 


c[n] := (tau[n—4X ((n—1) +4)]—nXc[n—1])/2; 
if (-booll) A bool2 then 


begin 
nl := 2.7183 & x1 X t((2.3026 Xacc+.6932) / (2.7183 21))—1; 
if nl < nmax then nl := nmaz; 
q := 1/2; 
for n := 1 step 1 until nl + 1 doq:= —n X @q/z; 
for n := nl step —1 until n0 + 1 do 
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begin 
q := (tau[n+1—4x (n+4)]—2XQ)/(n+1); 
if n S nmaz then c[n] := ¢ 
end 
end 
end csubn 
REFERENCES: 

1, GautscH!, WALTER, AND Kein, Bruce J. Recursive com- 
putation of certain derivatives—A study of error propagation. 
Comm. ACM 18 (Jan. 1970), 7-9. 

2. Gautscurt, Waurer. Algorithm 236, Bessel functions of the 
first kind [S17]. Comm. ACM 7 (Aug. 1964), 479-480. 
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ALGORITHM 283 

SIMULTANEOUS DISPLACEMENT OF POLYNO- 
MIAL ROOTS IF REAL AND SIMPLE [C2] 

Immo O. KERNER (Recd. 8 Sept. 1965 and 12 Nov. 1965) 

Rechenzentrum Universitaet Rostock 


procedure Prrs (A, X,n, eps); value n, eps; 
integer n; real eps; array A, X; 

comment Prrs (polynomial roots real simple) computes the n 
roots X of the polynomial equation 


Ana” + Anz + Pan oe + Ao = 0 


simultaneously. On entry the array X contains the vector of 
initial approximations to the roots and on exit it contains the 
vector of improved approximations to the roots. The initial 
approximations must be distinct. Accuracy is specified by means 
of a parameter eps. Iteration is continued until the Euclidean 
norm of the correction vector does not exceed eps. The con- 
vergence is quadratic; 
begin integer i,k; real z, P, Q; 
eps := eps tf 2; 
W: Q:= 0; 
for 7 := 1 step 1 until n do 
begin z:= P := A[n); 
fork := 1step1 until 2 do 
begin zs := 2 XK X(t] + A[n — ky]; 
ifk #ithen P := PX (X[i]—X[k]) 
end; 
X(t] := X[t]-—2/P; 
Q:=Q+ (a/P)T2 
end; 
if Q > eps then go to W 
end 
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ALGORITHM 284 

INTERCHANGE OF TWO BLOCKS OF DATA [K2} 

Witiiam Fiuercuer (Recd. 25 Oct. 1965 and 24 Nov. 
1965) 

Bolt, Beranek and Newman, Inc., Cambridge, Mass. 

and 

ROLAND SILVER 

The Mitre Corp., Bedford, Mass. 


procedure interchange (a, m,n); 
value m,n; integer m,n; array a; ie 

comment This procedure transfers the contents ‘of a[1] --- alm] 
into a[n+1] --- afn+m] while simultaneously transferring the 
contents of alm + 1] --- alm +n] into all] --- a[n] without using 
an appreciable amount of auxiliary memory. 

The nonlocal procedure gcd (x, y) has value the greatest 
common divisor of the integers x and y. The nonlocal procedure 
swap (x, y) interchanges the values of the variables z and y. 

Let G be the additive group of integers modulo m+n. The 
multiples 0, 7, 2n, --- of n form a cyclic subgroup C of G. The 


order of C is r = (m+ 7n)/d, where d is the greatest common - 
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divisor of m and n. The integers 1, --- , d belong to distinct 
cosets C; «+» Cz of C. These cosets form a disjoint covering of G. 


The interchange procedure is'based on the fact that if we start 
with a member. zx of the coset C, , and add n repeatedly modulo 
m + n, we will in r steps have generated each member of C; 
just once; . 

begin 
integer d, 1, 7, k, r; 
real ¢; 
d := gcd (m, n); 
r:= (m+n) +d; 
- for? := 1 step 1 until d do 
begin 

jgiety 

t := alt]; 

for k := 1 step 1 until r do 

begin 

ifj < mthenj :=j+nelsej :=j — m; 
swan (¢, alj]) 

end k 
end 2 

end interchange 


ACM Transactions on Mathematical Software, Vol. 2, No. 4, December 1976, Pages -- 392-303. 


REMARK ON ALGORITHM 284 


Interchange of Two Blocks of Data [K2] 


[W. Fletcher and R. Silver, Comm. ACM 9, 5 (May 1966), 326] 
M.R. Ito [Recd 25 July 1975 and 25 May 1976] ° 


Department of Electrical Engineering, University of British Columbia, Vancouver, 
B.C., Canada, V6T 1W5. 


The relocation of two contiguous blocks of data performed by Algorithm 284 can 
be regarded as a permutation problem. That is, the first m components and the last 
n components of an (m + n) dimensional vector, a, are interchanged by the trans- 
formation, b = Qa, where Q is a permutation matrix defined in partitioned form as 


and I; is the identity matrix of order k. 

Algorithm 284 is in fact equivalent to the representation [1] of the desired per- 
mutation as the product of r disjoint cycles, with cach cycle comprising d compo- 
nents, where 

d = greatest common denominator of m and n; 

r=(m+n) +d. 

A more efficient algorithm for performing the permutation is based on the fol- 
lowing decomposition of @. Let Pz be the permutation matrix of order k with ones 
along the minor diagonal (zcros elsewhere). Then, Q can be decomposed as 


Q = PirinkS, 
where 
In | O P,| O 
R=|—"1—-|,  g=|—"}— 
O | Pn O ! i 


The partial permutation associated with P, can be represented as a product of 
(k/2 — (k/2) mod 1) disjoint cycles; each cycle comprising only two components 
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with easily computed indices. This latter property, combined with the above de- 
composition of Q, leads to an algorithm which avoids the following features present 
in Algorithm 284: 

(i) computation of the greatest common denominator; 

(ii) conditional calculation of array element index in inner loop; 

(ili) extra storage and variable assignment. 


Geometrically, the matrix Q can be interpreted as a rotation matrix, and the 
matrices Pnin, R, and S can be interpreted as reflection matrices. 
The new algorithm is given below. 


procedure rotatecirclist (a, m, n); 
value m,n; integer m,n; array a; 
comment This procedure transfers the contents of afl]... a{m] into a[n + 1]...a[n + m] 
while simultaneously transferring the contents of a[m + 1]...a[m-+n] into a[i]...aln]. 
The nonlocal procedure swap (x, y) interchanges the values of the variables x and y. 
Fewer steps occur if the result of integer division is truncated rather than rounded, but 
the procedure also works in the latter case; 
begin 
ifm ~O0 An # 0 then 
begin 
integer 1, k, l; 
k:=m+1; l:=m + 2; 
for 7 := 1 step 1 until / do swap (a[t], a[k — 7]); 
k:=k+n; l:=n +2; 
for 1 = 1 step 1 until 1 do swap (a[m + 17], alk — 7]) 
L:= (m+n) + 2; 
for i := 1 step 1 until / do swap (a[t], a[k — 7]) 
end; 
end rotatecirclist; 


REFERENCES 


1. Knutu, D.E. The Art of Computer Programming, Vol. 1. Addison-Wesley, Reading, Mass., 
1969. 
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ALGORITHM 285 
THE MUTUAL PRIMAL—DUAL METHOD [H] 
Tuomas J. Arrp (Reed. 29 June 1964 and 5 Apr. 1965) 
Wolf Research and Development Corporation 
Manned Spacecraft Center 
Houston, Texas 
procedure Linearprogram (n, p, A, min, psol, dsol, bool); 
value p, n; integer p, n; array A, ere dsol; real min; 
Boolean bool; 
comment This procedure solves the linear programming prob- 


lem by the Mutual Primal-Dual Simplex Method. The problem 
is assumed to be in the following form: 


AX+B<0 
Xx >0 
d+ CTX 


where Aisp Xn, Bisp X1landCisn X 1. The dual problem 
is then, 


min u = 


Y>o0O 
ATY+ C20 
maxv =d-+ BTY. 


The matrix of coefficients, also called A is formed in the follow- 
ing way: 


d CG Co see Ch 

bh Au Ax Ain 

=}, 6b, An Az Aon 
5 Api Ap Aon , 


The input matrix A is declared [0: p, 0: n], min is the value of 
the objective function, psol is the solution vector for the primal 
problem, dsol is the solution vector for the dual problem, bool will 
be set to true if an optimal solution is found, otherwise bool will 
be set to false; 
begin integer array row [0:2Xp,0:p}, 

nocol [0:2Xp], index [0:n+p]; 

integer 2, 7, k, s, ¢; 

procedure subschema (k); intéger k; 

comment This procedure defines an admissible sequence of 

subschema Si41 Sez, -:* , assuming that S,, S2,--- S: , 
have already been defined; 

begin integer count; 


col [0:2Xp,0:n], norow, 


for 7 := 1 step 1 until p do if A[z,0] > 0 then go to 
WORK; 
for 7 := 1 step 1 until n do if A[0,j] < 0 then go to 
WORK; k:=0; goto RETURN; 
WORK: if 2X (k+2) = k then go to EVEN else go to ODD; 
EVEN: 
begin 
if k = 0 then 
begin 
for i := L step 1 until p do if A[z,0] > 0 then 
begin 
row[1,0] := 7; go to D3 


end; 


D1: 


D2: 


D3: 


row[1,0] := 0; go to D8 
end; 
for j := 1 step 1 until el do 


if A[row[k,0],col[k,j]] = 
go to RETURN; 
for 7 := 1 step 1 until norow|k] do 
if A[row[k,i],col[k,0]] > 0 then go to D2; 
go to RETURN; 
rowl(k+1,0] := row|k,t]; 
col{k+-1,0] := coll[k,0]; 
count := 0; 
for j := 1 step 1 until nocol[k] do 
if Alrow[k,0],col[k,j]] = 0 then 
begin 
count := count + 1; 
col[{k+1,count] := col[k,j} 
end; 
nocol{k+1] := count; 
count := 0; 
for i := 1 step 1 until norow[k] do 
if Afrow[k,i],col|k,0]] < 0 then 
begin 
count := count + 1; 
rowl[k+1,count] := row[k,2] 
end; 
norow|k+1] := count; 
=k+1; 
go to ODD 
end EVEN; 


0 then go to D1; 


ODD: 


Bi: 


B2: 


begin 

for 7 := 1 step 1 until norow|k] do 
if Afrow[k,z],col[k,0}] = 0 then go to Bl; 

go to RETURN; 

for j := 1 step 1 until nocol[k] do. 
if A[row[k,0],col[k,j]] < 0 then go to B2; 

go to RETURN; 

col[k+1,0] := coll{k,j]; 

row(k+1,0] := row[k,0]; 

count := 0; 

for 7 := 1 step 1 until norow[k] do 

if A[row[k,i],col{k,0]] = 0 then 

begin 
count := count + 1; 
row[|k+1,count] := row|k,t] 

end; 

norow|k+1] := count; 

count := 0; 

for j := 1 step 1 until nocol{k] do 

if A[row[k,0],col{k,j]] > 0 then 

begin 
count := count + 1; 
col{k+1,count] := col[k,j] 

end; 

nocol[{k+1] := count; 
=k+1; 

go to EVEN 

end ODD; 


RETURN: 


end subschema; 
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procedure pivot (s,t); value s,¢; integer s, t; 
comment The procedure pivot performs the usual pivot opera- 


tion on the matrix A, Als,¢| is the pivot element; 


begin integer (, j; 


Als,t] := 1/A[s,t]; 
for « := 0 step 1 until s — 1, s + 1 step 1 until p do 
begin 
Aft,t] := —Al[z,t] X Als,¢]; 
for j := 0 step 1 until t — 1,1 + 1 step 1 until n do 
if abs(Ali,j]+Ali,t]xXAls,j]) < abs(Ali,j]Xn—8) then 
A[i,j] := 0 
else Afi,j] := Alt,j] + Alt,t] x Als,J] 
end; 
for j := 0 step | until ¢ — 1, + 1 step 1 until n do 
Als,j] := Als,j] X Als,t]; | 


2 := indez[t]; 
index|t] := index[n+s}; 
andex{n+s] := 7 

end pivot; 


procedure pickapivot (k,s,t); 


integer k, s, ¢; 


comment The procedure pickapivot will choose a pivot ele- 


ment from S; or S;,-1 in a manner which will guarantee im- 
provement in the goal vector; 


begin real maz, test; 


if 2 X (k+2) = k then go to EVEN else go to ODD; 


ODD: 


Al: 


A2: 


A3: 


begin 

for j := 1 step 1 until nocol[k] do 

if Alrow[k,0],col[k,j}] < 0 then 

begin 
for 7 := 1 step 1 until norow[k] do 

if A[row|k,z],col[k,j]] > 0 then go to Al; 

$s := row|k,0]; 
t := collk,j]; 
k:=k—1; 
go to RETURN; 


end; 
for j := 1 step 1 until nocol{k] do 
if Afrow[k,0},col[k,j]| < 0 then 
begin 
for 1 := 1 step 1 until norow[k] do 
if Alrow|[k,t],col[{k,j]] > 0 then 
begin s := rowl|k,?]; 
t := collk,j]; 
max := Afrow|[k,i],col{k,0]|/A [row[k,z],colfk,7]}; 
go to A2 
end 
end; 
go to A3; 
fori := 171+ 1 step 1 until norow[k] do 
if A[row|k,7],col[k,j]] > 0 then 
begin 
test := A[row|k,i],col[k,0]|/A[row[k,2] ,col[k,7}]; 
if test > max then 
begin 
sc: 
max 
end 
end; 
k:=k—1,; 
go to RETURN; 


row|k,t]; 
:= test 


for j := 1 step 1 until nocol[k—1] do 
if Alrow[k,0],col[k—1,7]| < 0 then 
begin 
s := row[k,0]; 
t := col(k—1,7); 
max := Afrow(k—1,0],col(k—1,7]|/A[row[k,0],col[k—1,7]]; 


Ad: 
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go to A4 
end; 
3 := row[k,0|; 
t := col{k,0]; 
k:=k— 2; 
go to RETURN; 
for j := j + 1 step 1 until nocol[k—1] do 
if A[row|k,0],col[{k—1,7]] < 0 then 
begin . 
test := Aflrow[k—1,0],col[k—1,7]]/A[row|[k,0],col[k—1,7]] 
if test > maz then 
begin 
ti= 
max : 
end 
end; 
k:=k— 2; 
go to RETURN 
end ODD; 


col{k—1,j]; 
= test 


EVEN: 


Bl: 


B2: 


B3: 


begin 
for 1 := 1 step 1 until norow|k] do 
if Alrow[k,7],col{k,0]] > 0 then 
begin 
for 7 := 1 step 1 until nocol[k] do 
if Afrow|k,7],col[k,j]] < 0 then 
go to Bl; 
s := row[k,i]; 
t := coll|k,0]; 
k:=k—-1; 
go to RETURN; 


end; 
for 1 := 1 step t until norow|k] do 
if Afrow[k,i],col[k,0]] > 0 then 
begin 
for j := 1 step 1 until nocol{k] do 
if Alrow|k,z],col{k,j]] < 0 then 
begin 
s := row|k,t]; 
t := col[k,j); 
max := Afrow|k,0|,col{k,7]|/A[row|k,7],collk,7]]; 
go to B2 
end 
end; 
go to B3; 
for j := j +1 step 1 until nocol[k] do 
if A[row[k,zj,col[k,j]] < 0 then 
begin 
test := A[row[k,0],col[k,j]]/ A lrow|[k,2],col[k,7]]; 
if test > max then 
begin 
ti= 
max 
end 
end; 
k:=k—-1; 
go to RETURN; 
for 7 := 1 step 1 until norow[k—1] do 
if A[row|k—1,1],col[k,0]] > then 
begin 
sis 


col[k,j]; 
:= test 


row|k—1,2]; 
t := col[k,0]; 
max := Afrow(k—1,7],col[k—1,0]]/A[row[k—1,1],col(k,0}}; 
go to B4 

end; 


’ 
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s := row[k,0]; 
t := collk,0]; 
k:=k-—2; 


go to RETURN; 
B4: fori := 7+ 1 step 1 until norow|k—1] do 
if A[row[k—1,7],col[k,0]] > then 
- begin 
test := Afrow[k—1,2],col[k—1,0]]/A [row[k—1,7],col[k,0}]; 
if test > maz then 
begin 
s:= rowl[k—1,1]; 
max := test 
end 
end; 
k:=k — 2; 
go to RETURN 
end EVEN; 
RETURN: 
end pickapivot; 
for 7 := 1 step 1 until p + n do indez[i] := 7;. 
for 1 := 0 step 1 until p do row(0,i] := 7; 
for j := 0 step 1 until n do col[1,j] := 7; 
norow|0] := p; nocol{l] :=n; k:= 0; 
comment This is a check on the row constraints; 
NEXT PIVOT: 
for 1 := 1 step 1 until p do 
begin 
if A[z,0] < 0 then go to NEXTI; 
for j := 1 step 1 until n do 
if A[t,j] < O then go to NEXTI; 
comment Row constraints are incompatible; 


bool := false; 
go to FINISH; 
NEXTI: 
end; 


comment This is a check on the column constraints; 
for j := 1 step 1 until n do . 
begin 
if A[0,j] > 0 then go to NEXTJ; 
for 1 := 1 step 1 until p do 
if A[i,j] > O then go to NEXTJ; _ 
comment Column constraints are incompatible; 


bool := false; 
go to FINISH; 
NEXT: 
end; 
subschema(k) ; 
if k = 0 then 
begin 


comment. k = 0 indicates that the present solution is opti- 
mal. A[0,0] is value of the objective function; 
min := A[0,0}; 
for 7 := 1 step 1 until p + n do psol[i] := dsol[t] := 0; 
comment Find the primal solution vector; 
for 1 :=.1 step 1 until p do 
psollinder[n+-7]] := —Afz,0]; . 
comment Find the dual solution vector; 
for 7 := 1 step 1 until n do 
if indez|i] > n then 


dsoll{index|i]—n] := A[0,2] 
else 
dsol{index{i]+p] := A[0,]; 
bool := true; 
go to FINISH; 
end; 


pickapivot (k,s,t); 
ifs = 0 Vt =O0then 
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begin 
comment No feasible solution; 
bool := false; 
go to FINISH; 

end; 

pivot(s,t); 

go to NEXT PIVOT; 

FINISH: 


end Linearprogram 


CERTIFICATION OF ALGORITHM 285 [H] 

THE MUTUAL PRIMAL-DUAL METHOD 
[Thomas J. Aird, Comm. ACM 9 (May 1966), 326] 

H. Sparu (Recd. 13 Feb. 1967) 

Institut fiir Neutronenphysik und Reaktortechnik, 

Kernforschungszentrum, Karlsruhe, Germany 


The procedure Linearprogram has been translated into FORTRAN 
II and successfully run on the IBM 7074 Computer. The fol- 
lowing corrections had been made (the first two are merely 
typographical errors). 


1. P. 328, left column, 1 line after label B3: 
reads: 

if Alrow[k—1, ¢],col[k, 0]] > then 
should read: 

if Alrow[k—1, 7],col[k, 0]] > 0 then 


2. P. 328, left column, 1 line after label B4: 
reads: 

if Alrow[k—1, 7],col[k, 0]] > then 
should read: 

if A[row{k—1, 1],col{k, 0]] > 0 then 


3. P. 328, right column, after the end of the procedure pickapivot 
and before the label NEXTPIVOT there must be inserted the 
statement 
col[0, 0} := 0; 

Otherwise col[0, 0] has no assigned value when the procedure 
subschema is entered for the first time. 
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ALGORITHM 286 

EXAMINATION SCHEDULING [ZH] 

J. E. L. Peck anp M. R. Wiuiiams (Recd. 17 Mar. 1964, 
25 Jan. 1965 and 1 Mar. 1966) 

University of Alberta, Calgary, Alta., Canada 


procedure parittiion (incidence) graph of order : (m) into : (n) 
parts using weights : (w) bound : (maz) preassignment : 
(preassign) of number : (pren); 

Boolean array incidence; integer array w, preassign; 
integer m,n, maz, pren; 

comment This is an heuristic examination time-tabling pro- 
cedure for scheduling m courses in n time periods. It is essen- 
tially the problem of graph partitioning and map coloring. 

In the terminology of graph theory: Given a graph of m ver- 
texes with a positive integer weight w[z] at the ith vertex, 
partition this graph into no more than n disjoint sets such 
that each set contains no two vertexes joined by an edge, 
and such that the total weight of each set is less than the 
prescribed bound maz. 

We represent the graph as an mXm symmetric Boolean matrix 
incidence whose 7,jth element is true if and only if vertex 7 is 
joined to vertex j by an edge (if a student is taking both course i 
and course j), diagonal elements being assigned the value true. 
The weight assigned to the ith vertex (number of students in the 
ith course) is w[i]. We shall see below that preassignment is 
permitted. The number of courses to be preassigned is given in 
pren and the course preassign {t, 1] is to be placed at the time 
preassign [i, 2]. 

This procedure does not minimize the second order incidence 
i.e. a vertex 7 being assigned to the set k, where the set k—1 
contains a vertex 7 joined to7z (astudent writing two consecutive 
examinations), but this may be done by rearranging the sets 
after the partitioning is completed. The procedure contains its 
own output statements, but its driver should provide the input; 

begin integer array row [1:m], number [1:n]; 
integer 1, j, sum, course, time; 

Boolean preset, completed; 

INITIALIZE: preset:= false; 
for j := 1 step 1 until n do number [j] := 0; 
for 1 := 1 step 1 until m do 
begin sum := 0; 

for 7 := 1 step 1 until m do 

if incidence [1, 7] then sum := sum + 1; 

row [7] := sum 
end JNITIALIZE. Note that row [i] now contains the multi- 

plicity of, or number of edges at the vertex 7 (number 
of courses which conflict with the course 7). Of course since the 
incidence matrix is symmetric, less than half (¢ > j) need be 
stored. However, this procedure, for the sake of simplicity, 
is written for the whole matrix. Also note that row [2] will 
eventually contain the negative of the set number to which 
the 7th vertex is assigned (examination time for the 7th course) 
and number [7] will contain the weight of the jth set (number of 
candidates at time j). From here on we drop the allusions to 
graph theory in the comments; 

THE PREASSIGNMENT: forj := 1 step 1 until pren do 
begin comment preassignment of courses to times is now car- 
ried out. If pren = 0, then there are no preassignments; 

course := preassign [j,1]; time:= preassign [7,2]: 
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comment We now attempt to assign this course to the given 

lume; 
SCRUTINIZE: if row [course] < 0 then 

begin outstring (1, ‘This course’); outinteger (1, course) ; 
outstring (1, ‘is already scheduled at time’); 
outinteger (1, —row[course]); go to NEXT 

end; 

if number [time] + w[course] > max then 

begin outstring (1, ‘Space is not available for course’); 
outinteger (1, course); outstring (1, ‘at time’); 


outinteger (1, time); go to NEXT 
end; 
for 71 := 1 step 1 until m do 


if row [i] = — ttume then 
begin if incidence {1, course] then 
begin outstring (1, ‘course number’); 
oulinteger (1, course); outstring (1, ‘conflicts with’); 
outinteger (1,2); 
outsiring (1, ‘which is already scheduled at’); 
outinteger (1, teme), 
go to NEXT 
end if incidence 
end if row; 
SATISFACTORY: row[course] := —time; 
number [time] := number [time] + w [course]; 
preset := true; 
NEXT: 
end THE PREASSIGNMENT; 
MAIN PROGRAM: begin Boolean array avatlable [1:m]; 
integer next; 
procedure check (course); integer course; 
begin integer j: comment This procedure renders un- 
available those courses conflicting with the given course; 
for j := 1 step | until m do 
if incidence [course,j] then available {j] := false 
end of procedure check. 
For each of the » time periods we select a suitable set of non- 
conflicting courses whose students will fit the examination 


room; 
START OF MAIN PROGRAM: 
for time := 1 step | until n do 


if preset = number|time] > 0 then 

begin comment The preceding Boolean equivalence di- 
rects the attention of the program initially only to 
those times where prescheduling has occurred. We now 
determine the available courses (i.e. unscheduled and 
nonconflicting). If course 7 is already scheduled, then 
row(t] is negative; 


completed := true; 
for 7 := 1 step 1 until m do if row [i] > 0 then 
begin available [1] := true; completed := false end 


else available [1] := false; 
if completed then go to OUTPUT; 
if preset then 
begin comment Some courses were prescheduled at 
this time. It is necessary to render their conflicts un- 


available; 
for 7 := 1 step 1 until m do 
if row[i] = —time then check (2) 


end prescheduled courses. 


COLLECTED ALGORITHMS (cont.) 


We now select the available course with the most con- 
flicts. This is essentially the heuristic step and there- 
fore the place where variations on the method may be 


made; 
AGAIN: 
sum := 0; 
for 1 := 1 step 1 until m do 


if available [i] A row [1] > sum then 

begin next := 7; sum := row [t] end most conflicts; 
if sum > 0 then 
begin comment There exists an available course, so 

we test it (viz next) for size. If it does not fit we look 
for another; 

available [next] := false; 

if number [time] + w[next] > max then go to AGAIN; 

comment If we are here the course will fit so we use it; 

row [next] := —time; 

number [time] := number [time] + w[nezt]; 

check (next); go to AGAIN 
end sum > 0 

end of the time loop; 
if preset then : 
begin preset := false; go to START OF MAIN 
PROGRAM end 
In case of prescheduling this takes us back to try the re- 
maining time periods. 

If we have reached here with completed true then all 
courses are scheduled, but the converse may not be true, 
therefore; 

if — completed then 
begin completed := true; 
for 7 := 1 step 1 until m do 
if row [1] > 0 then completed := false 
end — completed and 
end of the main program; 
OUTPUT: if = completed then 
begin comment The following for statement outputs the 
courses that were not scheduled; 
outstring (1, ‘courses not scheduled’); 
for 7 := 1 step 1 until m do 
if row [2] > 0 then outinteger (1,2) 
end not scheduled. 
The following outputs the time period j, the number of stu- 
dents number([j] and the courses 7 written at time 7; 
TIMETABLE: outstring(1, ‘time enrolment courses’); 
for j := 1 step 1 until n do 
begin outinteger (1,7); outinteger (1, number[j]); 


for 1 := 1 step | until m do 
if row[t] = —j then outinteger (1,1) 
‘end j. 


The following outputs the courses, the times at which they are 
written, and their enrolment; 
outstring (1, ‘course time enrolment’); 
for 1 := 1 step 1 until m do 
if row [i]< 0 then outinteger (1, 7); outinteger' (1, row |t]); 
outinteger (1, wlt]) 
else 
begin outinteger (1,7); outstring(1, ‘unscheduled’); 
outinteger (1, w[z]) 
end 
end of the procedure 
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REMARK ON ALGORITHM 286 [H] 


EXAMINATION SCHEDULING [J. E. L. Peck and M. 
R. Williams, Comm. ACM 9 (June 1966), 433]. 


The 6th and 7th lines from the end of the procedure should be 
corrected by the insertion of a begin end pzir so that they read 
if row [7] < 0 then 

begin outinteger (1, 1); outinteger (1, row [t]); outinteger 

(1, w[z]) 


end 
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ALGORITHM 287 
MATRIX TRIANGULATION WITH 
INTEGER ARITHMETIC [F1] 
W. A. BLANKINSHIP 
(Recd. 19 May 1965 and 17 Sept. 1965) 
National Security Agency, Ft. Geo. G. Meade, Md. 


integer procedure INTRANK (mat, m, n, e); value m, n, e; 
integer m, n, e; integer array mat; 

comment This procedure operates on an m by n+e matrix whose 
name is mat and whose elements are integers. If mat is considered 
as composed of two submatrices U and V, where U comprises 
the first » columns of mat and V comprises the last e columns, 
then the effect of the procedure is as follows: 

(1) The rank of the submatrix U is returned as the value of 
INTRANK (designated by r in the following discussion). 

(2) mat is transformed by a sequence of elementary row opera- 
tions in such a manner that U is reduced to triangular form. 
Triangular form means that the leading, or first nonzero, ele- 
ment of each row appears to the right of the leading element 
of the preceding row. 

(3) It is easy to deduce from the proof in [1, p. 72, Th. 12] 
that for any set of k columns of mat, the greatest common divisor 
of all kth order minors selected from those columns is preserved. 
In particular, the product of all leading elements in U (final) 
(which are preserved as the first r elements of the local array a) 
will be equal to the ged of all nth order minors of U. 

(4) It is also easy to show, by the methods of [2] that if mat 
contains an m X m identity matrix, J, then J ends up as a record 
of the row operations actually performed, specifically: 


mat (final) = I (final) X mat (initial) 


(5) Since (3) implies that the rank of U is preserved, and the 
rank of U (final) is obviously equal to the number of nonzero 
rows that it contains, this number, r, is returned as the value of 
INTRANK. 

(6) Under the conditions of (4), it follows that the last 
m—r rows of I (final) comprise a complete, linearly independent 
set of left-annihilators (row-dependences) of the matrix U. 

The preceding properties are the basis of the claims for the 
procedure SOLVEINTEGER [Algorithm 288, Comm. ACM 9 
(July 1966), 514] which calls this procedure. 

INTRANE is designed to minimize the likelihood of overflow, 
the detection of which is left to the user. The best method is to 
include an identity matrix in mat and check the relation de- 
scribed in 4 (above). In many instances overflow doesn’t matter. 
In particular, if (a) the machine-compiler combination does 
integer addition, subtraction and multiplication modulo 27+1 
where 7 is the maximum integer representable in the machine, 
(b) division is done by the usual long-division algorithm, and 
(c) the answers sought are either known t« be less than 7 in 
absolute value, or only desired modulo 27-1, then, short of 
interference by an over-zealous monitor, the procedure will 
produce satisfactory results in spite of overflow. (Although the 
CDC 1604 does not satisfy (a), the same effect can be achieved 
by using a suitable subroutine in place of the multiplication 
sign in the procedure REDUCE.) 
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Overflow is generally dependent upon the magnitude of the 
greatest common divisor of all r X r minors contained in U, as 
this number, or a large divisor of it will appear in the rth row 
of mat (final) and as afr]. Thus if U is a square matrix whose 
determinant is a prime greater than the capacity of the machine, 
there is obviously no way to avoid overflow. Even if the deter- 
minant is composite, it is most likely that only small factors 
will be left on the diagonal and overflow will still occur. When 
elements of U are chosen from a flat-random population of 
integers in the closed interval [—138, +13] it has been found 
empirically that overflow almost never occurs for m=n=11 
when run on the CDC 1604. where i =, 2“—1. See also the dis- 
cussions on overflow in the procedure SOLVEINTEGER; 

begin integer 7, j,k, Q, T, topel, nextel, itop, inext; 
integer array a [1:m]; 
procedure FINDNEXT; 
begin nextel := 0; 
for k := 7 step 1 until m do 
if afk])>nextelA\k + ttop then 
begin nextel := alk] ; inert := k 
end | 
end; 
procedure SWAPROWS; 
begin for k := j step 1 until T do 
begin Q := — mat[t,k]; 
mat [1,k] := mat [ttop, k]; 
mat [itop, k] :=Q 

end; 

a{t] := a [ttop]; 

comment The last statement is a luxury which ensures that, 

at the end of the algorithm, a will contain the leading ele- 
ments of the first IVTRANK rows of mat; 
end; 
procedure REDUCE; 
begin Q := mat [itop,j] + mat [tnect, J]; 
for k := j step 1 until T do 
mat {itop,k] := mat [ilop,k] -Q X mat [inext,k]; 
a [itop] := if mat [ttop,j] < 0 then — mat [ttop,j] else 
mat |ttop,j}; 
end; 
4:= j:= tlop := 0; T := 2.+6; 
NEXTROW: if itop ~ 1 then SWAPROWS; 
1 := i+1;if 7 > m then go to OUT; 
NEXTCOL: j := j+1;if j > n then go to OUT; 
for k := 7 step 1 until m do 

alk] := if mat [k,j] < 0 then — mat [k,j] else mat [k,j]; 
comment Find the value and location of the largest element at 

or below position (i,j) of mat.; 

itop := i-1; FINDNEXT; 

if nextel = 0 then go to NEXTCOL; 
CONTINUE: itop := inext; topel := neztel; 

comment Find the value and location of the next largest 

element at or below position (1,7); 

FINDNEXT; 

if nextel = 0 then go to NEXTROW; 

comment Subtract row containing next highest element from 
that containing highest element. Repeat until highest element 
no longer ranks highest; 


COLLECTED ALGORITHMS (cont.) 


REDUCE; 
go to CONTINUE; 
OUT: INTRANK := i-1; 


end 


REFERENCES: 

1. Apert, A. A. Fundamental Concepts of Higher Algebra. U. of 
Chicago Press., Chicago, Ill., 1956. 

2. BLANKINSHIP, W. A. A new version of the Euclidean algorithm. 
Amer. Math. Month. 70 (1963), 742-745. 
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ALGORITHM 288 
SOLUTION OF SIMULTANEOUS LINEAR 
DIOPHANTINE EQUATIONS [F4] 
W. A. BLANKINSHIP 

(Recd. 19 May 1965 and 17 Sept. 1965) 
National Security Agency, Ft. Geo. G. Meade, Md. 


Boolean procedure SOLVEINTEGER (A) times: (x) equals 
the vector: (b) times a least integer: (d) where A is a matrix of 
dimension one to: (m) by one to: (n) Also find: (k) linearly 
independent auxiliary solutions and store in the matrix: (Y); 
value ™, n; 
integer m, n, d, k; 
integer array A, 7, b, y; 

comment Seeks the smallest positive integer, d, for which an 
integer solution to the equation Az = bd exists. 

If no: solution exists then SOLVEINTEGER is returned as 
false. Otherwise SOLVEINTEGER is returned as true and the 
values of d and the solution vector x are returned. 

If more than one solution exists then auxiliary solutions are 
returned in the matrix Y. The additional solutions are obtained 
by adding any linear combination of the first k rows of Y to the 
solution x. 

It is assumed that 

A is dimensioned [1:m,l1:7], 
x is dimensioned [1:7], 
b is dimensioned [1:m], 
Y is dimensioned [1:n,l:n]. 

Note that a diophantine solution exists if and only if d is 
returned as 1 and SOLVEINTEGER is returned as true. 

The procedure relies entirely on the action of the procedure 
INTRANK (Algorithm 287, Comm. ACM 9 (July 1966), 513). 
In particular, a matrix, mat, is formed by adjoining —} to the 
transpose of A, and then adjoining an (n + 1)th order identity 


matrix as follows: 
—b 
mat = Ar I 


INTRANK is then called upon to triangularize the first m+1 
columns of mat (reaching into the first column of J). The value of 
INTRANK willbe returned as an integer r which is 1 greater 
than the rank of A. Furthermore, as a consequence of properties 
(4) and (6) claimed under INTRANK, the last n—r+1 rows of I 
(final) will comprise a complete set of left annihilators of the 


matrix . Since only the first of these rows (if any) will have 


At 

a nonzero element in the first column, it follows that this first 
row expresses the value d and the desired solution (if d ¥ 0), 
and the succeeding n—r rows constitute solutions to the homo- 
geneous equation. If any linear combination of these last n—r+1 
rows were to yield a vector whose elements have a greatest 
common divisor not equal to 1, this would imply that 
det (I (final)) = det (I (initial)) ~ 1, which is false. This en- 
sures that d is the smallest value, as claimed. 

Overflow cannot occur in this procedure except as inherited 
from the procedure INTRANK. Overflow seems to be no problem 
when solutions (z,d) exist which are within the machine’s 
capacity to verify. Iam unable to fully explain this but nu- 
merous cases have been run on the CDC-1604 (47-bit integers plus 
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sign bit) with elements of A chosen randomly between —13 and 
+13 inclusive and for m=n=5 through 20 (10 or more cases 
each). Only a single failure (in the case m=n=20) occurred. 
These cases were devised by preassigning integer values to z, 
calculating 6 and then calling SOLVEINTEGER. It is difficult 
to devise significant test cases where dei(A) ~ d > 1 as this 
involves assigning values of x satisfying Ax=0 (mod d). This 
implies d must be a divisor of det (A) which must therefore be 
precalculated. But det (A) may overflow even though there may 
be ad for which solution is possible. When m=n the values of 
xz and d will usually be, according to Cramer’s rule, nth order 
determinants, or high divisors thereof, which may exceed 
machine capacity. When the elements of both b and A are chosen 
equiprobably between —a and +a, inclusive, it can be shown 
that the standard deviation of such a determinant is 
(n!a" (w+1)"/3)+. Since this is an upper bound for the expected 
absolute value of such a determinant, it may be used as a rule of 
thumb to predict overflow. If a=13, then for n=11 this value 
is 10" and for n=12 it is 10%, 1604 capacity is 10“1. In test 
cases, the procedure invariably succeeded for n=11 and in- 
variably failed for 2»=12. (Remember, we are referring to cases 
where 6 is chosen randomly so that an integer solution will 
hardly ever exist.) 
Note that if m=1, this algorithm solves the ged problem in 
much the same way as Algorithm 237 [J. E. L. Peck, Comm. 
ACM 8 (Aug. 1964), 481]; 


begin integer 7, j, rank, s; 


integer array mat [1:n+1, 1: m+n-+1]; 
for 7 := 1 step 1 until m do 
begin mat [1,7] := —6 [J]; 
for 7 := 1 step 1 until n do mat [i+], j] := A [7,7] 
end; 
for 7 := 1 step 1 until n+1 do 
for 7 := 1 step 1 until n+1 do 
mat [t, j-+m] := if i = 7 then 1 else 0; 
rank := INTRANK (mat, n+1, m+1, n); 
d := mat [rank, m+1]; 
if d = 0 then begin SOLVEINTEGER := false; go to OUT 
end; 
for 7 := rank step 1 until m do 
if mat frank, 1] 4 0 then 
begin SOLVEINTEGER := false: go to OUT 
end; 


SOLVEINTEGER := true; 


s:= ifd<Othen —lelsel; d:=s Xd; 
k:=n-— rank + 1; 
for 7 := 1 step 1 untilndo 
begin x[i] := mat [rank, m+i+1] X s; 
for j := 1 step 1 until k do 
Y [7,7] := mat [rank+j, m+i+1] 


end; 


OUT: 
end of procedure SOLVEINTEGER 
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ALGORITHM 289 

CONFIDENCE INTERVAL FOR A RATIO [G1] 
I. D. Hitz and M. C. Pixs (Recd. 8 Oct. 1965) 
Statistical Research Unit, Medical Research’ aaa 
London, England 


procedure Fieller (y, x, Vyy, Vey, Vaz, t, rl, 72, jullie: 
value y, 2x, Vyy, Vry, Vaz, t; 
real y, 2, Vyy, Vay, Vaz, t, rt, 12; 
Boolean inclusive; 

comment This procedure finds the (1—2Xa) confidence limits 
for 0/@ where y and x are estimates of @ and'¢ respectively, 
subject to random errors ‘normally’ distributed with zero means, 
variance estimates Vyy and Vz, and covariance estimate Vzy, 
each based on f degrees of freedom, and ?¢ is the upper (100Xa) 
percent point of the ¢ distribution on f degrees of freedom. 

At exit, if inclusive is true then the confidence interval in- 
cludes all values such that rl S$ value S r2, Otherwise the 
confidence interval includes all values such that — infinity < 
value S$ r2 and additionally all values such that rl S$ value S$ 
infinity. 

Where the interval is such that the value of 71 or r2 should be 
+ infinity, the procedure sets the value to + the largest available 
real number. 

Reference: E. C. Fretuer, A fundamental formula in the 
statistics of biological assay, and some applications, Quart. J. 
Pharm. Pharmacol. 17 (1944), 117-123; 

begin 

real c, r, infinity; 

inclusive := true; infinity := 10114; 

comment Set infinity to largest available positive real number; 
=if2; r:=2 T2—c¢%X Vaz; 

rki=axXy—-cX Vay; c:=y Tf 2—cX Vyy; 

ifr ~ Othen 

begin 

e:=rl fT 2—rXe; 

ifr>OAc<Othenc := 0; 

if c < 0 then go to unbounded; 

inclusive := r > 0; r:=1.0/r; ¢ := sqrt (¢); 

= (rilt+c) Xr; rl := (rl—c) Xr 
end else 
begin 
if rl ~ 0 then 
begin 
ce := ¢c/(2.0Xr1); 
if rl > 0 then 
begin 
rl :=c; 12 := infinity 
‘end else 
begin 
rl := —infinity; 12 :=c¢ 
end 
end else 
begin 
unbounded: rl := —infinity; 712 := infinity 

end 

end 
end Fieller 
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ALGORITHM 290 
LINEAR EQUATIONS, EXACT SOLUTIONS [F4] 
J. Booturoyp* (Recd. 7 Sept. 1965 and 21 Mar. 1966) 


U. of Tasmania, Hobart, Tas., Australia 
* Thanks are due to the referee for useful criticism and awkward 
test cases. 


procedure exacile(a, b, n, det); value n; integer n, det; 
integer array a, b; 
comment solves the matrix equation Ax = 6 for A = a [lin, 
1:n] and z, b[1:n] where the elements of A, b are small integers 
and the results are required as ratios of integers. The solution 
vector overwrites b and has values given by det A < x where det 
A is the determinant of A and z is the true solution vector. The 
user is warned that this procedure, of limited though useful 
application, is not a substitute for other well-established 
methods of solving general sets of linear equations owing to the 
inherent danger of integer overflow. This may occur in the 
reduction if the elements of the matrix are large or in the back 
substitution if the determinant and/or the elements of the right- 
hand side are large and may even occur with small elements and 
determinant if the order of the matrix and the nature of the 
equations combine to produce large solution values. Four 
devices intended to avoid integer overflow are incorporated. 
These are, (1) choice of column pivots having the smallest non- 
zero absolute value, (2) division by previous pivots (both after 
Fox, L., An Introduction to Numerical Linear Algebra, Oxford 
U. Press, New York, 1965, p. 82), and (3) the local procedures 
crossmpy and abdive which respectively evaluate integer expres- 
sions of the form (aXxb—cXd) + e anda X b + ¢ by performing 
the divisions before the multiplications. The output parameter 
det yields the determinant of A. If A is singular det := 0; 
begin integer piv, pivot, sum, arti, aki, i,j,k, pivi, ri, rk, m; 
integer array rf [l:n]; boolean zpiv; 
integer procedure tabs (it); value il; integer it; 
tabs := if tt < 0 then — it else it; 
integer procedure crossmpy(a)times:(6)minus:(c)times:(d)all 
over :(e); 
value a,b,c,d,e;_ integer a,b,c,d,e; 
begin integer qab,qcd,r,res; 
if iabs(a) > 1abs(b) then 


gab :=b+e; r:=b— gqabXe; 
gab := gab X a;res:=rXa 
end; 
if iabs(c) > tabs(d) then 
begin 
qed :=ec+e; ri=c— qd Xe; 
qed := ged Xd; res := res-rXxXd 
end 
else 
begin 
qed :=d+e, r:=d—qdXe; 
qed := qed Xe; res:=res—rXe 
end; 


290-P 1- 0 


crossmpy := gab — qed + res + € 
end crossmpy; 
integer procedure abdivc(a,b,c,sum); value a,b,c; integer 
a,b,c,sum; 
comment evaluates expressions of the form a X b + c by 
performing divisions before multiplications, assigning the 
quotient to abdive and accumulating the remainder in sum; 
begin integer q,r,temp; 
if iabs(a) > iabs(b) then 
begin g:=a+c; temp:=qXb; 
ri=a@—-cXq 
gq:=b+ oc; 
abdive := temp +q X17; 
sum := sum + (b—qXec) Xr 
end 
else 
begin g := 6 +c; temp := q XQ; 
r:=b—-—¢cXq; 
Q:=a+c; 
abdive := temp + q X71; 
sum := sum + (a—qXc) X r 
end 
end abdivc; 
procedure permb(b,r,n); value n; integer array b,r; inte- 
ger 7; 
comment rearranges the elements of b[1:n] so that b[¢] := 
b[r{z]], t= 1, ee n; 
begin integer 7,k,w; 
for 1 := n step —1 until 2 do 
begin k := r{z]; 


if k #7 then 
begin 
if k > 7 then begin k := r [k]; goto L end; 
w:= b[t]; oft] := b[k]; blk] :-= w 
end 
end 
end permb; 
m:=1; 
for i := 1 step 1 until n do r[i] := 7; 
for 7 := 1 step 1 until n do 
begin pivot := 0; zpiv := true; 
for k := 7 step 1 until n do 
begin aki := iabs(a[r{k],i]); 
if zpiv /\ aki > 0 V) aki # 0 A akt < iabs(pivol) then 
begin zpiv := false; pivt := k; pivot := a[r[k],7] end 


end; 

if pivot = 0 then begin det := 0; go to out end; 

rt := r[pivt]; rl[pivt] := r{t]; rlt] := ri; if pit ¥ 7 then 
m:=—-™; 


for k := 7+ 1 step 1 until n do 
begin rk := rik]; aki := alrk,z]; 

for) := 7+ 1 step 1 until n do 

al(rk,j] := if ¢ = 1 then a[rk,j] X pivot — aki X alrz,]] 
else crossmpy(a[rk,j],pivot,aki,al[rt,j],piv); 
b[rk] := if 7 = 1 then b[rk] & pivot — aki X blri] 
else crossmpy(b[rk] ,pivot,akt,b{rt] ,piv) 
end; 
piv := pivot 
end; 
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if m # 1 then 
begin det := aki := — alri,n); 
else det := aki := alrinl; 


for 7 := n — 1 step —1 until 1 do 
begin 77 := rt]; arit := alri,t]; 
sum := 0; piv := abdive(b[ri],akt,arii,sum); 
SUM := — SUM; 


b[ri] := — biri] end 


for j := 7 +1 step 1 until n do 


piv := piv — abdive(b[r[j]],alri,j],arti,sum); 
blr] := piv — sum + arit 
end; 
permd (b,r,n); 
oul: 
end exzactle 
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ALGORITHM 291 

LOGARITHM OF GAMMA FUNCTION [S14] 

ian PIKE AND I. D. Hitz (Recd. 8 Oct. 1965 and 12 Jan. 
Medical Research Council’s Statistical Research Unit, 
University College Hospital Medical School, London 
England 


? 


real procedure loggamma (z); 
value x; real z; 
comment This procedure evaluates the natural logarithm of 
gamma(x) for all z > 0, accurate to 10 decimal places. Stirling’s 
formula is used for the central polynomial part of the procedure.; 
begin 
real f, z; 
if z < 7.0 then 
begin f := 1.0; z:=a4— 1.0; 
for z := z+ 1.0 while z < 7.0 do 
begins := 2; f:=fXz 
end; 
e:=2+1.0; f := — In(f) 
end 
else f := 0; 
z2:= 1.0/r f 2; 
loggamma := f + (c—0.5) X In(z) — & ++ .91893 85332 04673 + 
(((— .00059 52380 952388 xz+ .00079 36507 93651) x 2 —.00277 
(7777 77778) Xz-++ 083338 33333 33333) /x 
end loggamma : 


REMARKS ON: 
ALGORITHM 34 [S14] 
AMMA FUNCTION 
[M. F. Lipp, Comm. ACM 4 (Feb. 1961), 106] 
ALGORITHM 54 [814] 
GAMMA FUNCTION FOR RANGE 1 TO 2 
[John R. Herndon, Comm. ACM 4 (Apr. 1961), 180] 
ALGORITHM 80 [814] 
RECIPROCAL GAMMA FUNCTION OF REAL 
ARGUMENT 
[William Holsten, Comm. ACM 6 (Mar. 1962), 166] 
ALGORITHM 221 [814] 
GAMMA FUNCTION 
[Walter Gautschi, Comm. ACM 7 (Mar. 1964), 143] 
ALGORITHM 291 [814] 
LOGARITHM OF GAMMA FUNCTION 
[M. C. Pike and I. D. Hill, Comm. ACM 9 (Sept. 1966), 
684] 
M. C. Pike anv I. D. Hitt (Recd. 12 Jan. 1966) 
Medical Research Council’s Statistical Research Unit, 
University College Hospital Medical School, 
London, England 
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Algorithms 34 and 54 both use the same Hastings approxima- 
tion, accurate to about 7 decimal places. Of these two, Algorithm 
54 is to be preferred on grounds of speed. 


Algorithm 80 has the following errors: 
(1) RGAM should be in the parameter list of RGR. 
(2) The lines 
if z = 0 then begin RGR := 0; go to EXIT end 
and 
if x = 1 then begin RGR :=1; goto EXIT end 
should each be followed either by a semicolon or preferably by an 
else. 
(8) The lines 
if x = 1 then begin RGR := 1/y; 
and 
ifz < — 1then beginy := y Xz; goto CC end 
should each be followed by a semicolon. 
(4) The lines 
BB: if x = —1 then begin RGR := 0; goto EXIT end 
and 
ifx > —1 then begin RGR := RGAM (x); goto EXIT end 
should be separated either by else or by a semicolon and this 
second line needs terminating with a semicolon. 
(5) The declarations of integer 7 and real array B[0:13] in RGAM 
are in the wrong place; they should come immediately after 
begin real z; 


go to EXIT end 


With these modifications (and the replacement of the array B 
in RGAM by the obvious nested multiplication) Algorithm 80 ran 
successfully on the ICT Atlas computer with the ICT Atlas 
ALGOL compiler and gave answers correct to 10 significant digits. 


Algorithms 80, 221 and 291 all work to an accuracy of about 10 
decimal places and to evaluate the gamma function it is therefore 
on grounds of speed that a choice should be made between them. 
Algorithms 80 and 221 take virtually the same amount of comput- 
ing time, being twice as fast as 291 at x = 1, but this advantage 
decreases steadily with increasing x so that at x = 7 the speeds are 
about equal and then from this point on 291 is faster—taking only 
about a third of the time at x = 25 and about a tenth of the time 
at x = 78. These timings include taking the exponential of log- 
gamma. 

For many applications a ratio of gamma functions is required 
(e.g. binomial coefficients, incomplete beta function ratio) and the 
use of algorithm 291 allows such a ratio to be calculated for much 
larger arguments without overflow difficulties. 


REMARK ON ALGORITHM 291 [S14] 
LOGARITHM OF GAMMA FUNCTION [M.C. Pike 
and I. D. Hill, Comm. ACM 9 (Sept. 1966), 684] 

Miss M. R. Hoare (Recd. 24 Aug. 1967) 
% C. Hoare and Co., 37 Fleet St., London, E.C.4. 


(1) if a2 < 7.0 then 
begin f := 1.0; z:=2— 1.0; 
for z2 := z+ 1.0 while z < 7.0 do 
would be better written as: 
if x < 7.0 then 
begin f := 1.0; 
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for z := 2, 2 + 1.0 while z < 7.0 do 
This avoids unnecessary operations. 


(2) In the final statement, loggamma should read loggamma 
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ALGORITHM 292 

REGULAR COULOMB WAVE FUNCTIONS 
Wa.terR Gautscut (Recd. 8 Oct. 1965) 

Purdue University, Lafayette, Indiana and Argonne 


National Laboratory, Argonne, Illinois 
Work performed under the auspices of the U. 8. Atomic Energy Commission. 
real procedure t(y); value y; real y; 
comment This procedure evaluates the inverse function t = t(y) 
of y = tlnt in the interval y 2 —1/e, to an accuracy of about 
4 percent, or better. Except for the addition of the case 
—l/e Sy S 0, and anerror exit in casey < —1/e, the procedure 
is identical with the real procedure ¢ of Algorithm 236; 
begin real p, z; 
if y < — .36788 then go to alarm 1; 
if y < 0 then ¢ := .36788 + 1.0422  sgrt(y + .36788) else 
if y S$ 10 then 
begin 
p := .000057941 & y — .00176148; p:= y X p+ .0208645; 
p:i=yX p— 129018; p:=y KX p+ 85777; 
t:=y X p+ 1.0125 
end 
else 
begin 
z:= In(y) — .775; p := (.775—In(z))/(1+2); 
p:=1/Q+p); t:=y X p/z 
end 
end ¢; 
procedure minimal (eta, omega, eps, lal, dm); 
value ela, omega, eps; real eta, omega, eps, lal, dm; 
comment This procedure assigns the value of 1’ to lal, accu- 
rately to within a relative error of eps, where {A,’} is the minimal 
solution (normalized by Ao’=1) of the difference equation 


2L+1 LP? + 7? 


Ra aga ee 


L+l L 
(For terminology, see [8].) If {Az} denotes the solution corre- 
sponding to initial values Xx = 1, A1 = w — 7, the procedure also 
assigns to dm the value \; — Ai’. The negative logarithm of 
|A1 — Ax‘| may be considered a measure of the “degree of mini- 
mality’” of the solution {z}; 

begin integer L, nu; real eta2, r, ra; 

eta2 := etaf 2; 


nu := 20; ra := 0; 
D1: r :=0; 
for L := nu step —1 until 1 do 
ris —(L 7 2+eta2)/(LX ((2XL-+1) xX omega— (L+1)Xr)); 
if abs(r—ra) > eps X abs(r) then 
begin 
ra: = 7; nu:s=nu+i10; goto Li 
end; 
lal := r; dm := omega — ela—r 


end minimal; 

procedure Coulomb (eta, ro, Lmaz, d, F); 
value eta, ro, Lmax,d; integer Lmaz, d; 
array F; 

comment This procedure generates to d significant digits the 


real eta, ro; 


regular Coulomb wave functions F,(y, p) for fixed = 0, p20, 


and for L = 0(1)Lmaz. (For notation, see [2, Ch. 14]). The 
results are put into the array F. Letting 
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2-7! 
~ (2L)'C1(n) 


the procedure first obtains f, as the minimal solution of the 
recurrence relation 


Mb+V +A, LED] | LD) 
(Generac 42 mer eee 


using for normalization the identity 


Qhe-m2 | (L + 1 + tn)| 


Cr(n) = (QL +! : 


tx F y(n, p); 


YL-1 = 0, 


per = So rafy dz = HEP YY? (—tw), 
L=d 


where P“(z) denotes the Jacobi polynomial of degree L. The 
parameter w is so chosen as to avoid undesirable cancellation 
effects. The final results F', are obtained recursively, by 


F x(n, p) = erfr, 


2L—1 2rn 4 
SS a 2 2 ;, — eee = ame Fe 
CL LOL + D [L? + p]icz (LZ = 1,2,3 ), © (= = :) 


A detailed justification of the process is to appear elsewhere 
({3]). For large positive y and p, the generation of the coefficients 
Az is subject to some loss of accuracy. If0 <= 7 S 20,0 Sp S 20, 
none, or only a few decimal digits will be lost, however. Writing 
the procedure minimal in double precision will resolve the 
problem for 7, p up to about 50, for normal accuracy require- 
ments. In any case, if higher precision is desirable, the procedure 
puts out a message to this effect. There is an error exit, if p < 0; 
begin integer L, nu, nul, mu, mul, 7, k; 
real epsilon, rol, eta2, omega, dl, sum, 7, r1, s, #1, (2; 
array lambda, lmin[0:1], Fapprox, Rr[{0:Lmaz]; 
switch coefficients := L2, L1, M1; 
if ro < 0 then go to alarm?2; 
if ro = 0 then 
begin 
for L := 0 step 1 until Lmaz do F[Z] := 0; 
go to L5 
end; 
epsilon := 5X 107 (—d); rol := 1/ro; 
tl := if eta > 0 then .5 X ro/eia else 0; 
omega := if eta < 1 then 0 else 
if 1 = 1 then 1.570796327/t1 else 
(1.570796327 — arctan(sqrt(1/tl—1)) + sqrt (t1X (1—21)))/t1; 
lambda [0] := Imin[0]} := 1; lambda[1] := omega — eta; 
sum := ro X exp(omegaXro); 
for L := 0 step 1 until Lmaz do Fapproz[L] := 0; 
dl := 2.3026 X d + 1.3863; 


ela2 := eta Tt 2; 


t1 := 1.3591 X ro; 
L := if Lmaz < tl then 1 + entter(tl) else Lmaz; 
tl := exp(1.5708Xela); s := sgrt(1+omega fT 2); 


tl := if omega = 0 then fl + 1/1 else 
exp (—etaXarctan(1/omega)); 
12 := omega + s; 
r := 1.3591 X ro X 12; 
s := (dl+In(tlX sqrt(i2/s))—omegaX ro)/r; 
nu := if s = —.36788 then entier(rXi(s)) else 1; 
nul := entier(LXt(.5Xd1/L)); 
nu := if nu < nul then nul else nu; 
nul :=1; 
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if omega = 0 then? := 1 else 7 := 2; 
LO: begin own array lambda[0 nu); 


comment Dynamic own array accineatinne are not per- 
mitted in most of the current ALGOL compilers. It can be 
avoided here, at the cost of extra storage, if lambda is de- 
clared as.an array of dimension [0:300] at the beginning of 
the procedure Coulomb. The same remark} applies to the 
array lmin declared later in the block labeldd M1; 

go to coefficients [1]; 

L1: minimal (eta, omega, 10—m, 11, d1); 

comment The letter m in 10—m is a place holder for a ma- 
chine-dependent integer, namely one less than the number of 
decimal digits carried in the precision mode (single, or 
double precision) of the procedure minimal. Similarly for 
the letter n in the next statement, which is a place holder 
for the integer m-+ 1. Both m and n are to be properly 
substituted by the user; 

if abs(d1 Xepstlon) = 1o—n then begin? := 1; go to L2end; 

outstring (1, ‘The requested accuracy cannot’ ‘be guaranteed. 

Use of the procedure minimal in a higher precision mode 

appears indicated’); 

1:= 38; mul := 0; 

M1: begin array Rra, lam[0:nu]; 


own array lmin[0:nu]; 
mu := entier (1.25Xnu); . 


for L := mul step 1 until nu do lam[L] :== 0; 
M2: r:=0; 
for L := mu step —1 until mul + 1 do 
begin 
ris —(L Tf 2+eta2)/(LX ((2XL+1) xX omega— (L+1)Xr)); 
if L S$ nu then Rra[L—1] := 
end; 


for L := mul +1 step 1 until nu do 
lmin[L] := Rra{L—1] X lmin[L—1]; 
for L := mui step 1 until nu do 
if abs(lmin[L]—lam[L]) > epsilon X abs(lmin[L]) then 
begin 
for k := mul step 1 until nu do lam[k] := 
mu := mu + 5; 
if mu <5 X nu then go to M2 else 
begin 
outstring (1, ‘convergence difficulty in the generation of 
the coefficients lambda sub L’); 


lmin[k]; 


go to L5 
end — 
end; 
lam[0] := —rl; lam[1]) := 1; 1 := d1/(1 + rl 2); 
for L := 2 step 1 until nu do 


begin 
lam[L] := ((2XL—1)XomegaXlam{L—1]-- 
((L—1) f 2+-eta2) x lam[L—2]/(L—1))/L; 


lambda[L] := Imin[L] + t1 & (lam[L]+r1xImin[L]}) 
end 
end; 
go to L3; 
L2: for L := nul step 1 until nu — 1 do 
lambda[L+1] := ((2XL-+1) x omegaX lambda[L}+ 
(L fT 2-Feta2)Xlambda[L—1]/L)/(L+1); — 
B38: r:=s:= 0; 
for L := nu step —1 until 1 do 
begin 


= eta/(L+1); 
= 1/(@XL—1)X (1/L+rol— (1+41 f 2)X1r/(2XL+3))); 
s:= 7 X (lambda(L]+s); 
if LZ S Lmaz then Rr{[L—1] := 
end; 
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F{0] := sum/(1+s); 

for L := 1 step 1 until Lmaz do F[L] := Rr{L—-1] K F[L—1]; 

comment The for-statement which follows is of purely 
precautionary nature, making sure that the results have the 
required accuracy. If speed is important, the statement 
may be omitted: 


for L := 0 step 1 until Lmaz do 
if abs(F(L]—Fapprox{L]) > epsilon X abs(F[L]) then 
begin 
for k := 0 step 1 until Lmaz do Fapproz{k] := F[k]; 
nul := mul := nu; nu := nu + 10; 
if nu < 300 then go to LO else 
begin 
outstring (1, ‘convergence difficulty in Coulomb’); 
go to L5 
end 
end 
end; 
tl := 6.2831853072 X eta; 
comment — The constant 27 in the preceding statement must be 
supplied more accurately if more than 11 significant digits are 
desired in the final results; 
if abs(i1) < 1 then 


begin 
2:=s:=1; L:=1; 
44;:L:=L+4+1; 


2 := t1 & 12/L;\ s := 8 + 12; 

if abs(t2) > epsilon X abs(s) then go to L4; 

s := sgrt(1/s) 

end 
else 
8 := sqgrt(tl/(exp(tl)—1)); 
F(0] := s X F{0]; 
for L := 1 step 1 until Lmaz do 
begin 
= (L—.5) X sqri(L fT 2+eta2) X s/(LX (£4 .5)); 
FIL) := s X FIL] 
end; 
L5: end Coulomb; 
comment The procedure Coulomb was tested on the CDC 3600 
computer, with the procedure minimal in single precision (un- 
less stated otherwise). The tests included the following: 

(i) Generation of @z(n, p) = [(Cr(n)e“" JF 1, p), L = 0(1)21, 
to 8 significant digits (d=8) for » = 0, —5(2)5, p = .2, 
1(1)5. The results were in complete agreement with values 

- tabulated in [4]. 

(ii) Computation of Fo(n, p), Fo’(n, ») = (d/dp)Fo(n, p) to 6 
significant digits for 7» = 0(2)12, p = 0(5)40, using 

= (p7+7)Fo — (1+n?)*f1. Comparison with [5] 
revealed frequent discrepancies of one unit in the last 
digit. In addition, beginning with y = 8, the results became 
progressively worse for p = 30, 35, 40, being correct to 
only 2-3 digits when 7 = 12, p = 40. With the procedure 
minimal in double precision, however, these errors dis- 
appeared, 

(iii) Computation to 8 significant digits of Fo(n, p), Fo’ (n, p) for 
= 2n, p = .5(.5)20(2)50. The results agreed with those 
published in [1] for p S 16, but became increasingly in- 
accurate for larger values of p. Complete agreement was 
observed, however, when the procedure minimal was 
operating in the double-precision mode; 
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The following changes are suggested to eliminate the need for 
multiple-precision arithmetic. The underlying theory will be 
published in Aequationes Math. 


1. Remove the procedure minimal. 


2. Change the statement (near the bottom of page 794) 
nu := if s > —.36788 then entier (rXt(s)) else 1 
to read: 
nu := if s > —.36788 then entier (rXt(s)) else r/2.7183 


3. Change the statement labeled Z1 to read 
Tl: dl := 2 X eta/(exp(2XetaXarctan(1/omega))—1) 
and rephrase the comment following this statement to read: 


comment The letter n in the following statement is a place 
holder for a machine-dependent integer, namely, the number 
of (equivalent) decimal digits carried in the mantissa of 
floating-point numbers. This integer must be properly sub- 
stituted by the user; 
4. Omit the output statement 


outstring (1, ‘The requested accuracy cannot be guaranteed. 
Use of the procedure minimal in a higher precision mode ap- 
pears indicated’); 
5. Insert the statement 
rl := Ilmin{l); 
between the two lines 
end; 
and 
lam[0] := —r1; lam[1]:=1; 1 := dl/(1+r1T 2); 
6. Change the line (near the middle of page 795) 
s := sgrt(tl/(exp(tl)—1)); 


to read 
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s := exp(—tl/4)/sqrt((exp(tl/2) — exrp(—tl/2))/t1); 
(These statements are mathematically equivalent, but the lat- 


ter delays overflow as the value of t] hecomes large.) 


7. If large values of |x| and/or p, say exceeding 100, are contem- 
plated, it may be necessary to increase the dimension of the ar- 
rays lambda and lmin (if they are declared at the beginning of 
the procedure Coulomb) and to correspondingly increase the 
upper limit for nu in the conditional clause 

if nu < 300 
near the top of page 795. The user, in this case, should also be 
prepared to encounter overflow difficulties, especially in the 
later entries of the array lam. . 


With these revisions the algorithm produced correct results on 
the CDC 3600 for the three tests described at the end of Algorithm 
292. It was also used (with input parameter d = 10) to compute 
miscellaneous values of Fo(n, p) and $o(7, p) published in a paper 
by C. E. Fréberg (Numerical treatment of Coulomb wave func- 
tions. Rev. Mod. Phys. 27 (1955), 399-411). The results are sum- 
marized in the table below. 


Algorithm 292 (revised) 


9 .357085680i9 — 1 
120366249119 — 3 
50 120 Fo = 2.0025993491) — 1 
100 4 ®y = 5.7229851541921 
200 1 @) = 7.2366047321914 


Fréberg 


9 .357085519 — 1 
1.203665: — 3 
2.002551) — 1 

5.7229851551921 
7. 236604731 1014 


n p 


9 50 Fo 
50 80 Fo 


ll 


In addition, the algorithm was run (with d = 6, and lambda, 
lmin being declared as arrays of dimension [0 : 600]) for » = 
—200(20)200, p = 20(20)200, Imax = 0(50)100. Apparently valid 
results were obtained as long as 7 < 100, though no tables seem to 
exist to check these results against. Overflow was observed in some 
of the entries of the array lam, for 7 = 120, p > 120; 7» = 140, 
p = 60; » = 160, p > 40; and » = 200, p > 20. (For the pur- 
pose of this test, a number is considered to overflow if its modulus 
exceeds 10300.) 
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Both the original and the revised version of the procedure 
Coulomb have been translated into Fortran and tested on a Con- 
trol Data 6600 computer. It became apparent that the following 
changes in the original version are necessary: 


1. The second sentence in the comment following the statement 
labeled L1 in procedure Coulomb should be replaced by: 
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Similarly for the letter n in the next statement, which is a place 
holder for the number of digits carried in the main program. 


2. The second statement after this comment (beginning “out- 
string .. .’’) should be changed to 


if abs(dl X epsilon) < 10-m-1 then 

outstring (1, ‘The requested accuracy cannot be guaranteed. 
Use of the procedure minimal in a higher precision mode ap- 
pears indicated.’); 


Since the original version of Coulomb is to be superseded by 
the revised one (see Remark), detailed test results are given here 
only for the latter. Most of the tests have already been described 
in the Algorithm itself or in the Remark. Those presented here 
are obtained on a different machine, and the results differ slightly 
in some cases from the previous ones. The tests included the 
following: 


(i) Generation of @z(n,e) = [Cx(n)p™*1]-} Fula), L = 0(1)21, 
to 8 significant digits (d = 8) for » = —5(1)5, p = .2(.2)5. The 
results were in complete agreement with the values tabulated in 
[4] of Algorithm 292. In the cases where more than 8 significant 
digits are tabulated, the highest discrepancy was one unit in the 
last digit; e.g. for L = 0, 7 = 5, p > 3.4, 10 to 11 correct signifi- 
cant digits have been found. 


(ii) Computation of Fo(n,p), Fo'(n,p) = (d/dp)-Fo(n,p) to 5 sig- 
nificant digits for » = 0(2)12, p = 0(5)40, using F’5 = (p™ + »)Fo — 
(1+7*)+ Fi. Comparison with [5] of Algorithm 292 revealed 
frequent discrepancies of one unit in the fifth digit. For » = 2, 

= 40 the discrepancy in Fp is 80 units of the fifth digit. This is 
probably an error in the table. 


(iii) Computation to 8 significant digits of Fo(y,), Fo'(y,p) 
for p = 24, p = .5(.5)20(2)50. The results agreed completely with 
those published in [1] of Algorithm 292. 


(iv) Computation (with d = 10) of the miscellaneous values of 
Fo(y,p) and ®o(y,9) given in the Remark on Algorithm 292. The 
results obtained differ slightly from those given' in the Remark. 
In the worst case, 7 = 50, p = 120, the pune paney is 16 units 
in the tenth digit. 


(v) After changing the dimensions of the arrays lambda, lmin 
into {0:600] and adjusting the upper limit for nu to 600 (see Re- 
mark on Algorithm 292), Fx(y,o) has been calculated with d = 
for n = —200(20) 200, p = 20(20) 200, Lmax = 0(50)100 merely to 
test whether overflow occurs or not. The following table indi- 
cates where overflow, indefinite results, or convergence difficulties 
in the generation of \, (see Algorithm 292) have been observed. 


n pa 
20 200 
40 200 
60 180 
80 100 

100 80 
120 60 
140 60 
160 60 
180 40 
200 40 


(vi) Calculation of F'z(n,) for L = 0(50)100 with d = 7 for y = 1, 
e = 10%,» = —20(1)—1. Underflow occurred for,L = 50, n < 5; 
L = 100, n < 2. The valid results have been compared with those 
obtained by summation of the power series for z(n,p) (see [4, 
(1.3) and (4.4)] of Algorithm 292). Agreement has been found 
to 7 significant digits. 


(vii) Calculation of z(y,9) to 13 significant digits (d=13) for 
p = 5, 7 = 0(1)5, LZ = 0(10)100. The results have been compared 
with those obtained by summation in double-precision mode 
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(27 digits) of the power series mentioned in (vi). Agreement was 
found to at least 12 significant digits. The constant 27 in the 
statement él := . on page 795 of Algorithm 292 was supplied 
here with 14 significant digits, as required by the comment. 


Acknowledgment. I wish to thank Professor Gautschi for use- 
ful remarks and comments. 
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The revised version of the procedure Coulomb was translated 
into IBM System/360 Algol and tested on an IBM 8/360 Model 75 
Computer. When 7 > 12 overflow problems were encountered in 
the generation of intermediate arrays. These were due to the 


‘smaller exponent range of the 8/360, —64 < exp < 63. The follow- 


ing changes, while not completely eliminating the overflow prob- 
ems, greatly alleviate them. 


Insert real scale; 
after begin integer L, nu, nul, mu, mul, 1, k; 


‘Insert scale := 16 ft (—57); 


comment This value of scale is appropriate for the IBM 8/360. 
On a machine with a different base and a different exponent 
range, say a < exp < B, the value of scale should be base T 
(6-8); 

between end; 
and epsilon := 5% 10 7 (—d); 


Change lambda [0] := lmin [0] := 1; lambda [1] := omega-eta; 
sum := ro X exp (omega X ro); 
to lambda [0] := scale; Imin [0] := 
lambda [1] := (omega-eta) X scale; 
sum := ro X exp(omega X ro) X scale; 


Change lImin [L] := Rra[(L — 1] X lmin [L — J; 
to begin 
= Rra [L — 1] X lmin [L — 1); 
comment The following constant 5 fT (—10) is approximately 
2X base T o/scale, where base is the base of the floating- 
point number system and a < exp < 8B; 
lmin[L] := if abs(t1) > 5 T (—10) then 


t1 else 0 
end; 
Change lam [0] := —r1; lam [1] := 1; 
to lam [0] := —rl X scale; lam [1] := scale; 


Change lambda [L] := lmin [L] + #1 X (lam [L] +11 X lmin [L}) 
to lambda {L] := Imin [L] X scale + tl X 
(lam [L] + rl X scale X lmin [LZ]) 
Change F[0] := sum/(1+s); 
to F[0] := sum/(scale+s); 
The authors gratefully acknowledge the referee’s helpful sug- 
gestions. 
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TRANSPORTATION PROBLEM [H] 

G. Bayer (Recd. 9 July 1965 and 22 Aug. 1966) 
Technische Hochschule, Braunschweig, Germany 


procedure transp1 (m, n, inf, c, a, b, x, kw); value m, n, int; 
integer m,n, inf, kw; integer array c, a,b, 2; 

comment transpl is derived from Algorithm 258, transport, 
[Comm. ACM 8 (June 1964), 381] in order to reduce running time 
by about 50 percent. The following notation is used. 
c m, n-matrix of unit costs, 
a array of quantities available, 
b array of quantities required, following the usual descrip- 

tion of the transportation problem, 


inf greatest positive integer within machine capacity, 
x  m,n-matrix of flows, 
kw optimal total costs (computed by procedure). 


c, a, 6 are disturbed by the procedure. Sum of a[t] = sum of b[7]. 

Multiple. solutions are left out of account. [Ref.: G. Hadley, 

Linear Programming, Reading, London, 1962, p. 351]; 

begin integer 7, j, u, v, k, [, s, t, gd, h, p, ctj, xij, at, bj, lsvj, nlvi; 

Boolean: zg; 

integer array g, listu, nlv[l:m], 7, Ustvil:n], Is[0:m+n—l], 
nl{limxXn], lsv[0:n]; 

comment in the for-statement u := --- after s33, operate on 
all pairs 7, 7 with clz,j] = 0. To win time the array nl supervises 
those zeros; the j-indices of zeros in row 7 are kept in 
nli{(t—1)Xn+1] --- nl[nlv[z]]. In the for-statement v := --- 
after s33, operate on all pairs 7, 7 with x[7,7] ~ 0 (and c[z,j]=0). 
ls supervises those essential zeros, the 7-indices of essential 
zeros in column j are kept in Is{lsv[7j—1]+1] --- Is[lsv[j] 
Procedure in adds to list ls, procedure out takes out from list 
ls an essential zero in position 1, 7; 

procedure in; 

begin 
lsvj := lso[j]; 
for ¢ := lsv[n] step —1 until /sv7 do Is[t-+1] := Is{t]; 
for ¢ := j step 1 until n do Isv[t] := lsv[t] + 1; 
Is[lsuj1] := 7 

end ; 

procedure out ; 

begin 
lsuj := [soy]; 
for t := Isv[7—1]+1 step 1 until lsvj do 


begin 
if ls[t] + 7 then go to next; 
S$ := ¢t; go to ex; 
next: 
end =; 
ex: 
for { := j step 1 until n do Isv[t] := lsv[t]—1; 
Isoj := Isv[n]; 
for { := s step 1 until I[svj do Is{t] := [s[t+1] 
end ; 
for 1 := 1 step 1 until m do 


for j := 1 step 1 until n do 2[7,j] := 0; 
for < := 1 step 1 until m do nlv{i] := —1)Xn; 
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lsv{O] : 
for j := 
begin 
listo[z] := 1; 
lsv[7] := 0 
end ; 


0; 
1 step 1 until n do 


sl: 


kw := gd := 0; 
comment gd is the defect, i.e., the sum of quantities not yet 
transported; 


for 1 := 1 step 1 until m do 
begin 
h := inf; 


for 7 := 1 step 1 until n do 

if c(t, j] < hthenh := cft, J]; 
for j := 1 step 1 until n do 
begin 

cj = c[t, J) := elt, J = h; 

if cyj = 0 then 


begin 
listv[7] := 0; 
nlot := nlo[t] := nlv[t] + 1; 
ni[nlot] := 9 
end 
end; 


kw := h X aft] + kw 
end see next comment; 
for j := 1 step 1 until n do 
begin 
if listv[j] = 0 then go to neztj1; 
h := inf; 
for 71 := 1 step 1 until m do 
if c{t, j] < then h := c[t, 9]; 
for 7 := 1 step 1 until m do 
begin 
cij := elt, J] := eft, 7] — h; 
if cij = 0 then 
begin 
nlvi := nlo[i] := 
ni[nlvi] := 7 
end 


nlofi] + 1; 


end; 
kw :=h X bj] + kw; 


nextj1: 


end; 
comment in step 1 the usual reduction of the matrix of costs 
is achieved (dual problem), zeros are listed in nl; 


82: 


for 7 := 1 step 1 until m do 
begin 
at := alt]; nlot := nlvfi]; 
for u := (@—-1) X n+ 1 step 1 until nlvi do 


begin 
if at = 0 then go to nexti2; 
j i= nl[ul; 
bj := 619]; 


if bj = 0 then go to nextj4; 
h := a{t, j] := if at < bj then ai else bj; 
at := at — h; b[7] := b7 — h; in; 
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nextj4: Z:= rij]; J := gle; 
end; if j = inf then 
nexti2: begin 
alt] := at; gd := gd + at if a[t] < hthenh := ali]; go to re 
end; end; 
comment applying a usual rule to all zeros we get an initial if x[i, 7] < Athen h := z{t, J]; 
tlow (restricted primal problem) in step 2; go to mark; 
sbl: re: 3 
if gd = 0 then go to s6; comment flow h along the labeled path thus reduces defect 
comment problem is solved if defect has become zero; without changing total costs. Correct list of essential zeros 
832: if necessary. Start labeling anew, optimizing the restricted 
for j := 1 step 1 until n do 7[j] := 0; primal problem; 
k := 0; j= p; bj] := dL] — hs alt] := alt] — A; 
for 7 := 1 step 1 until m do gd := gd —h, 
begin rel: 
if a[i] ~ 0 then i:= rij); vif := aft, 9]; cli, j] = cj +h; 
begin = 4% if x17 = 0 then in; 
k :=k +1; listulk]) := 7; gle! := inf j= gltl; 
end if 7 = inf then go to s31; 
else g[i] := 0 vig = alt, j] = aft, 7) — A; 
end; if 277 = 0 then oul; 
comment 7[{j] = 0 if column j is unlabeled, = 7 if labeled go to rel; 
from row 7. g[¢] = 0 if row 7 is unlabeled, = inf if a[d] ~ 0, sd: 5 
i.e., a[z] is a possible source of flow. The indices 7 of labeled comment step 5. Flow is maximal. To find a new solution to 
rows are kept in listu[1] --+ listu[k]. In step 3, consisting of the dual, take the part of matrix ¢ which is the intersection 
step 32 and step 33, the maximal flow is found by the la- of labeled: rows and unlabeled columns, reduce matrix in a 
beling process. Labeling ends in only two ways: (a) a columnj certain way; 
with b[j] > 0 has been labeled: go to step 4, (b) all labeling is k:=0; li=n-+); 
done, but a positive flow has not been found: go to s5; for j := 1 step 1 until n do 
833: begin 
L:= 0; if r{j] = 0 then 
for u := 1 step 1 until k do begin 
begin k:=k+41; listo[k] = 7 
4 := listulu]; nlvt := nlv[e]; end 
begin else 
j := nifs); ; begin. 
if r[j] ~ 0 then go to nextj5; . L:=1t—1; lswfl] = 9 
rij] := 4; Li= tL +1; Ustofi] := 3; end 
if b[j] > 0 then go to s4; end list all labeled resp. unlabeled columns in lisiv; 
nexty5 : h := inf; 
end for 7 := 1 step 1 until m do 
end in each newly labeled row, see listu, look for zeros in begin 
unlabeled columns, list them in listv; if g[i] = 0 then go to nexli6; 


for s := 1 step 1 until k do 


if | = 0 then go to $5; 
begin 


k := 0; 
= u j := listv[s]; 
ne eR ene nn if clé, j] < A then h := cli, jl 
egin 
. : ; ; end; 
j := listv[v]; lsv7 := lsofj]; ei 
a i fevlj—1-+1 step 1 until leyj do end find minimum h in partial matrix; 
besin for 7 := 1 step 1 until m do 
a := Is{s]; begin 
if g[¢] = 0 then zg := glt] ¥ 0; nlvi := @—-1) Xn; 
begin for s := 1 step 1 until n do 
git] := 9; k:=k+1; begin 
listu[k] := 7 7 := listo[s]; 
end if zg then cij := c[t, j] 
end else 
end_ in each newly labeled column, see listv, look for essential cij := eft, J] := eft, J] +h; 
zeros in unlabeled rows, label these rows, list them in lstu; if cij = 0 then 
if k = 0 then go to 85; begin 
go to 833; nlot = mlvt + 1; 
comment step 4. A column j with b[j] has been labeled, b[,] io = J 
is the sink of a possible positive flow, the path of which is ae 


indicated by labels. Find the minimum flow h along the path; for s := 1 step 1 until k do 
n:= fj]; pi= 7; begin 
mark: 
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j= lisiv{s]; 
if zg then cij := ct, j] := ct, 7] —h 
else cij := c[t, j]; 
if cij = 0 then 
begin 
nlvi := nlvi +1; 
nl[nlvi] = 9 
end 
end; 
nlofi] := nlor 
end reduction, add h to labeled columns, subtract h from 
labeled rows. Construct new list of zeros; 
kw :=h X gd + kw; 
comment total costs for new solution of dual; 
go to 832; 
s6: ; 
comment solution, defect has become zero; 
end 


CERTIFICATION OF: 


ALGORITHM 258 [H] 
TRANSPORT 

[G. Bayer, Comm. ACM 8 (June 1965), 381] 
ALGORITHM 293 [H] 
TRANSPORTATION PROBLEM 

[G. Bayer, Comm. ACM 9 (Dec. 1966), 869] 


Leg 8S. Sims (Recd. 21 Feb. 1967 and 17 Mar. 1967) 
Kates, Peat, Marwick & Co., Toronto, Ont., Canada 


Both of these algorithms were coded in Extended Atego. 60 
and tested on a Burroughs B5500. Three problems were solved 
correctly, one of them being of medium size (55 X 167). On this 
larger problem transpl was found to be about twice as fast as 
transport. 

In coding and debugging transpl three apparent errors were 
found. In the right-hand column on page 870), after line 27 which is 

t:= listulu]; nlvt := nld[z]; 

a line is missing. This line should read 

for s := (i—1) X n+ 1 step 1 until nlvi do 
Also in the right-hand column, the line 

s4: ; 
should be inserted ahead of line —12, which begins 

comment Step 4. A column j with b[j] has been labeled, b[j] 
On page 871, in the left-hand column, line --22 which reads 

for s := 1 step 1 until n do 
should read 

for s := | step 1 until n do 
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TRANSPORTATION PROBLEM [G. Bayer, Comm. 
ACM 9 (Dec. 1966), 869] 

G. Bayer (Reed. 24 Aug. 1967, 30 Oct. 1967 and 8 Jan. 
1968) 

Technische Hochschule Braunschweig, Germany 


KEY WORDS AND PHRASES: 
programming 
CR CATEGORIES: 5.41 


transportation problem, linear 


There is an error in the algorithm concerning the number of 
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essential zeros which can be greater than m + n — 1. An example is: 


cs kb 1-21 
21141 
12 2 2 

a: 4 6 1 
24 2 8 


The difficulty may be overcome in two ways. 
1. Declare array ls by: 
integer array [s[0 :mXn] 
instead of: 
integer array l|s[0 :m-+n—1] 


2. As the case of more than m + n — 1 essential zeros will seldom 
arise in practical problems, it may be enough to have 
Is[0 : 2Xm+n—1]; 

(It is assumed that m<n). To make sure that list Js does not over- 
flow, add a statement to procedure in and remove inf from the 
value part. 

procedure in; 

begin if Isv[n] = 2 X m+n — 1 then 

begin inf := 0; go to s6 end; 


Thus in the.case of overflow of ls, the procedure is left with inf = 0 
signalling that the optimum has not been reached and that the 
solution is possibly incomplete. (One would wish then to run the 
procedure anew with more space for /s and using the solution ob- 
tained as an initial flow. This would only be possible by partly re- 
writing the algorithm.) 
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ALGORITHM 294 

UNIFORM RANDOM [G5] 

W. Murray SrromMe (Recd. 26 May 1966) 
Carnegie Institute of Technology, Pittsburgh, Pa. 


real procedure UNIFORM (A, B, X0, C, M); 
value A, B, X0,C,M; real A, B; integer X0, M,C; 
comment This procedure generates the next uniformly distrib- 
uted pseudorandom number on (A, B). The ‘‘multiplicative 
congruential’’ method is used, namely 
; Znu = CX Z,(mod M) 
M and C are chosen to maximize the period and minimize the 
correlation of the sequence generated. To accomplish this, M 
should be as large as possible subject to the following conditions 
fi]: 
(i) C~-<M and suitably chosen. 
(ii) The expression X := X0/M followed by X := X XC 
within the procedure must be evaluated with no roundoff 
or truncation error for every positive integer XO < M. 
For most applications, M and C' may be chosen as follows. Let D 
denote the number base of the machine (e.g., D = 10 for a decimal 
machine) and n the number of significant D-digits of a real vari- 
able of the ALGOL implementation. Then let M = D* and 
C = Dv — gwhere k = entier ((2n + 1)/3). For D = 2, 4, 5, 8, 10 
or 16 and D* > 100, q = 3 is suitable. In general, choosing M@ 
and C in the above fashion will guarantee that condition (ii) be 
met, but this should be verified for the particular implementation. 
See [1] for a more detailed discussion on the choice of C and M. 


The first time UNIFORM is used in a program, X0 should be a 
positive integer less than and relatively prime to M. Subsequently, 


use X0 = 0. 

UNIFORM was translated into C.I.T’s ALGOL-20 and run on 
a CDC G-20 computer with M = 2% and C = 2" — 3. Some scaling 
was required to prevent roundoff in the multiplications since the 
G-20 is a 14-octal digit machine rather than a 42-bit binary one 
(the scaling would have been unnecessary had we used M = 83, 
C = 8 — 38, but the period of the sequence would have been 
shorter). In order to test the algorithm, the following statistical 
tests were performed for sequences of pseudorandom numbers 
generated on (—1, 1). 

1. Distribution. We divided (—1, 1) into 10 equal subintervals. 
Denote by f; the number of numbers of a sequence of length 1000 
in the 7th interval. The statistic 


10 
x? = 01 >> (f; — 100)? 
i=l 


was computed for each of 62 different such sequences. For numbers 
drawn from a uniform distribution, this statistic has a x?-dis- 
tribution, with 9 degrees of freedom [2]. The results obtained were 
entirely consistent with the hypothesis that the numbers were 
distributed uniformly. 

2. Independence. Define the serial correlation (lag 7) by 


Lae pet ; 
N Dd, XiXns —( 2, Xs 
i=1 t=1 


Mn Ai ONT — fap NE 
> xe (> x 
N j=1 


N — 171 
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If X;, Xi; are independent, then for large N, p; is distributed 
normally with mean —1/N and standard deviation 1/VWN [3]. 
pi was estimated for 16 different sequences each of length 5000. 
The average, —0.004, and the standard deviation, 0.011, are con- 
sistent with the hypothesis of independence. p; was estimated for 
8 different sequences each of length 9900 for j = 1,2, --- , 49. 
These results were consistent with the hypothesis that X; , Xi4; 
are independent for these values of 7. 
The Von Neumann ratio test [4] for 16 sequences of length 1000 
also yielded results consistent with the hypothesis of independ- 
ence. The results of other tests for many values of C and M using 
this method are outlined in [1]; 
begin own real X; 

if XO ~ Othen X := X0/M; 

X:=XXC; X := X — entter (X); 
UNIFORM := X X (B— A) +4 
end procedure UNIFORM 


REFERENCES: ; 

1. Huu, T. E., anp Dospeui, A. R. Random number generators. 
SIAM Rev. 4 (July 1962), 230-254. 

2. YAMANE, T. Statistics, An Introductory Analysis. Harper «& 
Row, New York, 1964, pp. 584-593. 

3. AnpEeRsoN, R. L. Distribution of the serial correlation coeffi- 
cient, Ann. Math. Stat. 18 (1942), 1-18. 

4, Hart, B. I. Tabulation of the probability for the ratio of the 
mean square successive difference to the variance. Ann. 
Math. Stat. 13 (1942), 207-214. 


COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 295 

EXPONENTIAL CURVE FIT [E2] 

H. Sparx (Reed. 29 Apr. 1966) 

Institut fiir Neutronenphysik und Reaktortechnik, 

Kernforschungszentrum Karlsruhe, Germany 

procedure expfit (x,y, p, n, ca, ce, eps, a,b,c, 8, fx, extt); 
value n, ca, ce, eps; integer n; real ca, ce, eps, a, b,c, 8; 
label exit; array z, y, ‘D, fz; 

comment If the method of least squares is used to determine 
the parameters a, b, c of a curve f(x) = a + be~* which approxi- 
mates n data points (2; , y:) with associated weights p; , then 


s(a, b, ¢) = Dy pilys—Ses))? (I) 
tml 
must be a minimum. A necessary condition for this is that 
Os Os os 
—=-=-—=0. II 
0a ob oc TD) 


Usually (see [1]) it is attempted to solve this system of nonlinear 
equations by an iterative method which is based upon the 
linearization of f in (II). and the convergence of which depends 
on the given starting values for a, 6, c. 

A simpler and more effective way which can always be chosen 
if there is only one nonlinear parameter in f is the following: 
It is always possible to eliminate a = a(c) and b = b(c) from the 
equations ds/da = 0 and ds/ab = 0 and to put these expressions 
into s/dc = 0. This gives only one equation in one variable 


WS GOtOO 20. 
oc 


If a value c’ is calculated with F(c’) = 0 then the corresponding 
values of a and 6 are obtained from a’ = a(c’) and b’ = b(c’). 
The following procedure is based upon this idea which is fully 
treated: in [2]. It allows to find a triple (a, 6, c) which solves (IT) 
if you make available a nonlocal procedure Rootfinder which is 
able to get a zero c of a function F(c) in the interval [ca, ce] with 
the relative accuracy eps, if sign (F (ca)) # sign (F (ce)) otherwise 
leaving to the global label exit. As the above F(c) is discontinu- 
ous at c = 0, [ca, ce] must not contain 0. [The speed and efficiency 
of the algorithm depend on the choice of the procedure Root- 
finder -~-REF.| 
Most. of the symbols are self-explanatory. The array fz finally 
contains the values a + be~*:; 
begin integer7; reali, u,v, w, fc, h0, hl, h2,h3, h4, h5, h6, h7; 
procedure fronc (c, fe); valuec; reall c, fe; 
comment computes for a given c the value fe = 
a = a(c), b = b(c); 
begin AO := Al := h2 := h3 := h4 := 5 := h6 := h7 := 0; 
for 7 := 1 step 1 until n do 


F(c) and 


begin 

t:= afi]; w:= exp(—cXt); v:= pit]; w:= ye); 
hO:=hO+v; Al:=Al+uXv; h2r=h2+uxKuxX v; 
AR3:= AB +ox& w; h4s= h4+uxro XK w; 


AD :=hb +t xKuxX v; 
h6 :=h6-+tX uu XK v; 


end 7; 


AT :=h? ~-uxXvxKXwxrxt 
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t:= 1.0/(hOXh2—hAl Xh1); 
6b := tX& (AOKA4—h1Xh3); 
end fronc; 
Rootfinder (frone, ca, ce, eps, c, exit); t:= 0; 
for 1 := 1 step 1 until n do 
begin 
v:= frit] := a+b X exp(—cXa2[t}); 
t:=¢+ pli] XvXv 
end 7; 
s:= t 
end expfit 


a:= 1X (h2Xh3—h1Xh4): 
fo := h7 + (h5Xa+h6Xb) 


piso yi); 


REFERENCES: 
1. Deity, G. R. Algorithm 275, Exponential curve fit. Comm. 
ACM 9 (Feb. 1966), 85. 
2. OBERLANDER, S. Die Methode der kleinsten Quadrate bei 
einem dreiparametrigen Exponentialansatz. ZAMM 48 
(1963), 493-506. . 
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GENERALIZED LEAST SQUARES FIT BY 

ORTHOGONAL POLYNOMIALS [E2] 

G. J. Makinson (Reed. 30 Sept. 1965 and 29 Aug. 1966) 

University of Liverpool, Liverpool 8, England 

procedure LSFITUW (f, x, w, m,k, st, p, l,al, be, s); value m,k; 
integer m,k; array f, w, st, p, z, al, be, s; Boolean 1; 

comment LSFITUW accepts m observations 27], f[i],7 = 1, 2, 

,m each with its associated weight w[z]. The weights should 

be eaaviied inversely proportional to the standard error of the 
observations. 

x[{1] should be algebraically the smallest abscissa and xz[m] the 
largest. 

The coefficients of the best fitting polynomial of degree k or 
less, where k < m — 1, are obtained in p[0:k], with p[0] the 
independent term. s7[0:k] contains the measures of the goodness 
of fit of each polynomial tested. The sz[t] are examined suc- 
cessively and the best polynomial is chosen of ‘degree h if h is 
the first value of ¢ found such that sz[h] < s¢[h+1] provided 
that s2[j] > 0.6 X st{h] fork 2 7 > A+ 1. If h is the first value 
of ¢ found such that sz[h] < s¢{h+1] but then aj is found that 
satisfies sz[7] < 0.6 X sz(h] forj >h-+ 1 the procure will choose 
the polynomial of degree j as best fit. 

If an h such that s¢[h] < s¢[h+1] is not found then the poly- 
nomial is chosen of degree k. LSFITUW uses the procedure 
POLYX (a, b,c, d, n) [Algorithm 29, Comm. ACM 3 (Nov. 1960), 
#04] to transform its results from the interval (—2,2) to the 
interval (x[1], z[m]). 

Normally ! should be false but if the choice made is to be 
overruled after consideration of the sz and the best fitting 
polynomial is required to be strictly of degree k, then / should be 
true. 

The programming is as outlined by G. E. Forsythe, [J. Soc. 
Indust. Appl. Math. 6 (1957), 74-88] and originally programmed 
by J. G. Mackinney [Algorithm 28/29, Comm. ACM 8 (Nov. 
1960), 604]. LSFITUW incorporates remarks made by D. B. 
MacMillan [Comm. ACM 4 (Dec. 1961), 544]. 

The variables in the paper of Forsythe have been abbreviated 
as follows. 

allt] is alpha], be{z] is beta|z], sz{z] is (stgmalt]) Tf 2, s{z] is 
the same, om is omega, lw is wlt, 7], tw is wi[z+1, i+1], 
ctp|j| is the coefficient of x fT j in This (the current) 
orthogonal polynomial, clp[j] is the coefficient of x fT 7 in 
the Last (previous) orthogonal polynomial, cp[j] is the co- 
efficient of z 7 j in the most recently calculated polynomial 
of best fit, tp[¢] is the value at z[7] of the present orthogonal 
polynomial, lp[i] is the value at [2] of the last orthogo- 
nal polynomial, simin is the least value of (stgmafi]) T 2 
found so far, swx becomes false as soon as (sigmafi+1]) T 
2 > (sigmafi]) f 2 one time, comp becomes true if swx 
is false and some (sigma[t]) T 2 < 0.6 X stmin; 
begin integer 7, j; real du, delsq, om, lw, tw, simin, a, b; 
array ctp, cpsave, cpl0:k], clp|—1:k], lp, tpll:m], 

Boolean swz, comp, 

comment initialization; 


Ll: 


296-P 1- 0 


for 7 := Ostep 1 until k docp{t] :=0; simin := 0; 
swxz := true; be[0] := clpl0] := clp[—1] := delsq := om := 0; 
clp[0] := 1; tw:=0; comp := false; 
for 7 := 1 step 1 until m do 
begin 
delsq := delsq + wit] X fit] T 2; tplz] := 1; 
lpi] := 0; om := om + vit] x fli); twii= 
end; 
s[0] := cp[0] := om/tw; delsq := delsq — s[0] X om; 
st[0] := delsq/(m—1); 
comment transformation of abscissa; 
a := 4/(almj—az[l]); b:= —2—-—aX afl]; 
for 7 := 1 step 1 until m do z[t] := a X 2[t] +6 
comment main computation loop; 
fori := Ostep 1 untilk — 1do 
begin 
du := 0; 
for j := 1 step 1 until m do du := du + wy] X aly] X tplz] T 2; 
alfi + 1] := du/tw; lw:= tw; tw:=o0m:=0; 
for j := 1 step 1 until m do 
sat ies 


tw + wit] 


= belt] X lplj]; 
tel := tplyl; 

ip{j] := (elj]—al{é+1]) X tly] — du; 

= tw + wij] X tply] T 2; 

om := om + wlj] X fly] X tly] 
end; 
be[i+1] := tw/lw;  s[t+1] := om/tw; 
delsq := delsq — s[i+1] X om; si[t+1] := delsq/(m—1—2); 
if 1 then go to LI; 
if — comp then 


begin 
if swx then 
begin 
if sz[¢+1]= sz[z] then 
begin 
comment higher power appears not to improve fit; 
swx := false; 


simin := si[t]; 
for j := 0 step 1 until k do 
epsavelj] := eply] 
end; 
go to L1 
end; 
if si{¢+1] < 0.6 X simin then comp := true; 
comment termination of main loop at superior fit to first. 
one found; 
comment recursion to obtain the coefficients cp of the 
polynomial of best fit of degree 7 + 1; 


for j := 0 step 1 until 7 do 
begin 
du := clp[j] X be[z]; 


elp[j] := ctply]; 
etp{j] := elp[j—1] — al[t+1] X clp[j] — du; 
ep(j] := eplj] + slt+1] X ctply] 
end; 
ep[t+1] := s[¢+1]; ctp[t+1] := 1; clp[¢+1] :=0; 
if = (comp VV swx) then 
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begin 
ifi = k - 1 then 
for j := 0 step 1 until k do 
cplj] := cpsavely] 
end 
end 
end end. of main computation loop. Transformation of poly- 
nomial follows; 
POLY X (a, b, cp, p, k) 
end LSFITUW 


REMARK ON ALGORITHM 296 [E2] 
GENERALIZED LEAST SQUARES FIT BY 
ORTHOGONAL POLYNOMIALS 
[G. J. Makinson, Comm. ACM 10 (Feb. 1967), 87] 
G. J. Makinson (Recd. 21 Mar. 1967) 
University of Liverpool, Liverpool 3, England 


The second sentence of. the first comment should read ‘‘The 
weights should be provided inversely proportional to the square 
of the standard error of the observations.’’ 


instead of 


“The weights should be provided inversely proportional to the 
standard error of the observations.”’ 


CERTIFICATION OF ALGORITHM 296 [E2] 

GENERALIZED LEAST SQUARES FIT BY 

ORTHOGONAL POLYNOMIALS [G. J. Makinson, 
Comm. ACM 10 (Feb. 1967), 87] 

WayneE T. Watson (Recd. 11 Feb. 1969 and 21 Mar. 1969) 

Service Bureau Corp., Development Laboratory, 111 West 
St. John Street, San Jose, CA 95113 


KEY WORDS AND PHRASES: least squares, curve fitting, 
orthogonal polynomials, three-term recurrence, polynomial re- 
gression, approximation, Forsythe’s method 

CR CATEGORIES: 5.18, 5.5 


LSFITUW was compiled and tested in CALL/360:PL/I. No 
modifications were made to the algorithm, and the computations 
were made in long precision (about 15 significant floating point 
digits). In. addition, POLY.X {2} was used to transform the results 
of LSFITUW from the interval (—2,2) to the interval (a ,vm). 

To generally test the algorithm, several small sets of data were 
used with LSFITUW and the results were compared with those 
obtained from an-independently written polynomial curve fitting 
algorithm which does not use the method of orthogonal poly- 
nomials. Only polynomials of degree less than 5 were used to fit 
the data. Agreement between coefficients and standard errors was 
good. 

As a more comprehensive test of the algorithm, all experiments 
that could: be duplicated from the article by Ascher and Forsythe 
[1] were performed; a slight modification to LSFITUW was re- 
quired to transform the data to the interval (—1,1) instead of 
(--2,2). Briefly, the experiments included: 

(1) For certain equally spaced data, a cornparison of the a; and 
6; calculated by the program against those values of a; and 8; 
obtained from known formulas (@;=0 for equally spaced data). 

(2) A fit of the function f(z) = | x | over the interval (—1,1) 
for equally spaced data for polynomials of degree as high as 30. 

(3) A fit of the function f(z) = e* for unequally spaced data 
inside the interval (—1,1) for polynomials of degree as high as 32. 


296-P 2- R3 


The results of experiment (1) showed that LSFITUW produced 
values of 8; differing only in the last significant digit (15) from 
those calculated by the known formula. The values of a; produced 
were in the range of the floating point round-off error (107). 
The results of duplicating experiments (2) and (3) were better 
than those reported in {1] because of the greater precision used in 
the calculations (about 10.8 versus: about 15 significant floating 
digits). While conducting the last two experiments, it was noted 
that for data values of x symmetric about the origin, the value of 
b in the transformation equation x = at + 6 may be computed to 
be a number in the floating point round-off range instead of exactly 
zero. When fitting polynomials of a sufficiently high degree, this 
may cause an underflow at line 4 of POLYX, the transformation 
routine. The user may find it desirable to branch on an underflow 
in POLYX and resct 6 to zero. 

To check the computations of the ox? obtained by the recursive 
definition of o.? used in the algorithm, the o.? were compared 
with results computed directly from the equation 


ant = 2 (Ss uales))*/(m—b-1) (*) 


where yz is the best fitting polynomial of degree & for the data 

x; , f;. Experience with the algorithm indicates that a loss of 

accuracy in computing o,? occurs at smaller values of k when using 

the recursive definition than when using (*). If the values of_o,? 
are of importance to the user, he may find it useful to compute 
them using (*) instead. 

A comprehensive test of the algorithm’s feature which uses the 
ox? to automatically select the best fitting polynomial was not 
made, but the feature did work properly for the polynomials used. 
In connection with this feature, the user should be aware, though, 
of the possible difficulty mentioned above in computing o,? ac- 
curately using the recursive definition. In this case, the user 
should not expect the algorithm to select the best fitting poly- 
nomial. This difficulty was experienced several times while testing 
the algorithm, but was circumvented by using (*) to calculate 
ox. In order to detect a possible loss in accuracy, the oz? should 
be examined carefully or compared with those obtained by (*). 

Comprehensive tests were not made using weights; however, 

no problems were encountered with a moderate usage of this 

feature. 

REFERENCES: 

1, AScHER, M., aND Forsyture, G. E. SWAC experiments on the 
use of orthogonal polynomials for data fitting. J. ACM 6 
(Jan. 1958), 9-21. 

2. MacKinney, Joun G. Algorithm 29, Polynomial transformer. 
Comm. ACM 8 (Nov. 1960), 604. 
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ALGORITHM 297 

EIGENVALUES AND EIGENVECTORS OF THE 

SYMMETRIC SYSTEM (A—AB)X = 0 [F2] 

J. BootHroyp (Recd. 19 Aug. 1965, 7 Feb. 1966, 1 Aug. 
1966, and 14 Nov. 1966) 

University of Tasmania, Hobart, Tas., Australia 


procedure eigensolve(a, b, x, n, nondef); 
label nondef; array a, b, 2; 

comment solves the equation (A — \B)X = 0 for symmetric 
A, B ina, b[1:n, 1:n] provided one of these is either positive or 
negative-definite. B is decomposed symmetrically so that B 
= DLL’ and the equation transformed to (C—dAI)Y = 0 where 
C = (L)"A(L’)" is symmetric and Y = L’X. If B is negative- 
definite (A—(—d)(—B))X = 0 is solved. If B is neither positive 
nor negative-definite the original equation is rearranged as 
(B—(1/A)A) X = 0 and solved as such for positive-definite A or 
as (B—(—1/d)(—A))X = 0 for negative-definite A. Failure to 
achieve one useful transformation from the four possibilities 
leads to exit via the label nondef. 

The procedure calls procedure symmetric QR 2 [P. A. Businger, 
Algorithm 254, Eigenvalues and eigenvectors of a real 
symmetric matrix by theQR method. Comm. ACM 8 (April, 
1965), 218-219] to evaluate the roots and vectors of 
(C—rI) Y = 0. That procedure leaves untouched the 
strictly upper triangle of C. In conformity with this, 
eigensolve preserves the strictly upper triangles of A and B. 
If, before entry to eigensolve, the user saves the diagonals 
of A, B, both these arrays may, if necessary, be fully 
restored after exit. 

On exit from the procedure the eigenvalues occupy the diago- 
nal elements a[z, 7] with the eigenvectors in corresponding 
columns of z[l:n, 1:n]; 

begin integer 1, j, k, jless1, iless1, adi, bdi; 
real t, sum, xij, length; 
Boolean recip; 


value n; integer n; 


procedure LCHOLESKI(a, n, fail); value n; integer n; 
label fail; array a; 
comment performs the symmetric decomposition A = LL’ 


for positive definite A in a[l:n, 1:n]. The lower triangle of A 
is overwritten by L. The strictly upper triangle of A is intact. 
For nonpositive-definite A the procedure exits via label pa- 
rameter fail; 
begin integer 7, j, k, jless1; 
real ajj, ajk, aij; 
jlessl := 0; 
for j := 1 step 1 until n do 
begin ajj := alj, jl; 
for k := 1 step 1 until jless1 do 
begin ajk := alj, kl]; 
ajj := ajj — ajk X ajk 
‘ end; 
if ajj < 0.0 then go to fail; 
ajj := alj, j] := sgrt(aji); 
for 1 := 7 +1 step 1 until x do 
begin aij := alt, 7]; 
for k := 1 step 1 until jlessl do 
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aij := atj — alt, k] X alj, kl; 
alt, j] := atj/ajj 
end; 
jlessl := 7 
end j 
end LCHOLESKIT, 
comment scan diagonals of A, B setting adi, bdi respectively 
to +1, —1, 0 if the diagonal elements are all positive and non- 
zero, all negative or neither. Save the diagonal of B in X; 
adi := sign(a[l, 1); 
afl, 1] := ¢ := Ofi, 1]; 
bdit := sign(t); 
for 7 := 2 step 1 until n do 
begin ¢ := alt, 1]; 
if ¢ = 0.0 V (t>0=adi<0) then adi := 0; 
aft, 7] := t := bt, a]; 
if t = 0.0 V (t>0=bdi <0) then bdi := 0 
end; 
rectp := false; comment prepare to solve (A—AB)X = 0; 
if bdi = 0 then go to swap; comment B is nondefinite; 
if bdi < 0 then 
begin comment prepare to solve (A—(—A)(—B))X = 0; 
for 7 := 1 step 1 until n do 
for j := 1 step 1 until 7 do b{z, 7] := — bdfz, J] 
end; 
newtry: LCHOLESKI(b, n, swap); 
go to ok; 
swap: if recip then go to nondef; 
recip := true; 
comment prepare to solve (B—(1/A)A)X = 0; 
if adi = 0 then go to swap; comment to escape, since A is 
also nondefinite; 
if adi < 0 then 
begin comment prepare to solve (B—(-—1/A)(—A))X = 0; 
for 7 := 1 step 1 until n do 
begin b[7, 7] := aft, 7]; aft, 7] := az, 2]; 
for 7 := 7 +1 step 1 until n do 
begin b[j, i] := —alz, 73 alj, a] := bfz, jl end 
end 
end 
else 
begin comment prepare to solve (B—(1/A)A)X = 0; 
for 7 := 1 step 1 until n do 
begin b[i, i] := afi, 7]; alt, 7] := x [%, 2]; 
for j := 7+ 1 step 1 until n do 
begin b[j, 7] := aft, 7]; alj, 7] := Blt, j] end 
end 
end; 
go to newtry; 
comment form C=(L)"A(L’)"! by LX = A, CL’ = X. C re- 
places A; 
ok: jlessl := 0; 
for j := 1 step 1 until n do 
begin iless1 := 0; 
for 7 := 1 step 1 until j do 
begin sum := a[j, 7]; 
for k := 1 step 1 until zless1 do 
sum := sum— ak, j] X bft, kl]; 
sum := 2[t, j] := sum/b{t, 2]; 
for k := 1 step 1 until jless1 do 


ll 
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sum := sum — (if k<i then ali, k] else alk, 7]) X bl, k]; 
alj, 7] := sum/blj, J]; 
tless1 := 1 
end; 
jlessl := j 
end; 
comment global call of symmetric QR 2 to solve (C—AI)NY = 0. 
symmetric QR 2 includes a built-in precision tolerance. For 
use with ezgensolve this constant should be changed to the 
value appropriate to whatever computer is used. Those in- 
terested in using JACOBI [Thomas G. Evans, Algorithm 85, 
JACOBI, Comm ACM 6 (April 1962), 208] in place of symmetric 
QR 2 may do so by copying the lower triangle of A to the upper 
triangle and making suitable changes to accommodate the 
parameter rho of that procedure before it is called. In this case 
the strictly upper triangle of A will not be preserved on exit 
from ezgensolve; 
symmetric QR 2 (n, a, x); 
comment change the Y vectors, now in z by L’ X = Y and 
normalize to unit length; 
for j := 1 step 1 until x do 
begin length := 0.0; 
for 7 := 1 step —1 until 1 do 
begin sum := x{t, J]; 
fork :=i-+1 step I until n do 
sum := sum —bd[k, 7] X a[k, 9); 


xij := alt, J] := sum/bl2, 2); 
length := length + xt7 X xij 
end; 


length := sqrt(length); 
for i := 1 step 1 until n do 2[1, j] := 2[2, j]/length 

end; 

comment take the reciprocals and/or change the signs of the 
roots if necessary; 

for 7 := 1 step | until n do 

if recip then 


begin 
if adi <0 then afi, 7] := —1.0/a[z, 2] 
else alt, 7] := 1.0/a[t, 7] end 

else if bdt <0 then ali, 7] := —afz, 7] 


end eigensolve 
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ALGORITHM 298 

DETERMINATION OF THE SQUARE-ROOT OF A 
POSITIVE DEFINITE MATRIX [F1] 

H. SpAtu (Recd. 20 Sept. 1966) 

Institut fiir Neutronenphysik und Reaktortechnik 
Kernforschungszentrum Karlsruhe, Germany 


procedure WURZEL(A, B, N, theta, eps); 
value N, theta, eps; integer N; real theta eps; array A, B; 
comment Let A be a symmetric positive-definite matrix of the 
order N. Further let Amin be the smallest and Amax be the greatest 
eigenvalue of A. 
It is known [1] that for all 6 with 0 < @ < 1 the sequence 
Bret = Be + c(A—B;,2), Bo = 2cA (1) 
with 
0 
= 
24/ Kinax 
converges to V/ A. The rate of convergence of the above se- 


quence is given by the rate of convergence to zero of the se- 
quence 


te = (1 — 0Wdunin/max)*. (2) 
As || A || = oadAmax with a > 1, we set 
_ 
a = oJ Tal 
(In the program we choose || A || = max; ||>0x | au |). Then 


the sequence (1) with c = c; converges for all 6; with 


0<a <Va= VI All max 


and therefore in any case for 6, with 0 < 6: < 1. Because of (2) 
it is favorable to choose @ close to 1 and 6; close to Wa, respec- 
tively. If nothing at all is known about a, the optimum is to 
choose 6; close to 1. The computing time is proportional to 
f(@i)N*, where f(@:) decreases as 6, increases. 
Meaning of symbols in the formal parameter list: 
A = A[1:N,1:N] must be symmetric and positive-definite. 

A is not destroyed after leaving WURZEL 
B = B{l:N, 1:N] contains ~/A when WURZEL is left 
N is the order of A and B 
theta = @, is an input parameter as described above 
eps is an accuracy parameter. The iteration is stopped when 


ket k 
max | bf? — of | < eps; 
cy 


real delia, s,c; array bb[1:N]; 
c := 0; 


begin integer 7, 7, k; 
comment determination of c¢; 
for 1 := 1 step 1 until N do 
begin s := 0; 
for j := 1 step 1 until N dos := s + abs(A[z, j)); 
c:=ifc <sthenselsec 

end; 

c:= 6 X theta/sgqrt(c); 

comment now Bo is set; 

for 7 := 1 step 1 until N do 

for j := 1 step 1 until N do 
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Bii, j] := BU, i] := 2.00 ¢ X A [t, J]; 
comment start of iteration; 
REPEAT: delta := 0; 
. for 7 := 1 step 1 until N do 
begin for j := 7 step 1 until N do 
begin s := 0; 
for k := 1 step 1 until N dos := s — Blt, k] X Bik, 9); 
bbij] := Blt, j] +e X (Alt, J] +8) 
end; 
for j := 7 step 1 until N do 
begin s := abs(B[i, 7] — bb[y)); 
if s > delta then delia := s; 


Bt, j] := bb[y] 
end 
end; 
for 7 := 1 step 1 until N — 1 do 


for j := 7+ 1'step 1 until N do Blj,] := Ble, 7); 


if delta > eps then go to REPEAT 
end WURZEL 


REFERENCE: 

1. Basusxa, I., PrAgER, M., AND VitAseK, E. Numerical Processes 
in Differential Equations. John Wiley & Sons, Ltd., London, 1966, 
p. 31 ff. 
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DETERMINATION OF THE SQUARE ROOT OF A 

POSITIVE DEFINITE MATRIX [H. Spath, Comm. 
ACM 10 (Mar. 1967), 182] 

B. J. Duke (Recd..26 Apr. 1967, 16 July 1968 and 10 Oct. 
1968) 

Department of Chemistry, University of Lancaster, Bail- 
rigg, Lancaster, England 

KEY WORDS AND PHRASES: matrix, symmetric matrix, 


positive definite matrix, matrix square root 
CR CATEGORIES: 5.14 


Algorithm 298 has been tested in ICT Atco. and used suc- 
cessfully on a number of matrices. One minor modification seems 
advisable. To avoid the procedure looping if an error occurs in 
its call, a maximum number of iterations should be set, with the 
procedure exiting through a label if this number is reached. The 
modifications to the procedure are obvious. 

Comparisons with an alternative method using a binomial 
series are interesting. If 


aie 
Al} 


fiat? ve Feeney ee 
{en {r-5e~ §e = 5" oh, 


For convergence, 


Cz=TI A, 


At 


6< 2 || Al] /Amax , 
and thus a sufficient condition is @ < 2. 
Optimum convergence is for 
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Pont cat 


2 |All 
max + Amin ; 


Thus 
1<a < Oop < 2a 


where a = |{A||/Amax . The choice of 0 is difficult, as the method 
is particularly slow for values of @ not close to Oop: . Unless other 
information is available, it seems preferable to choose @ in the 
range 1.4-1.8. 

Both methods have been tested on over 30 positive definite 
matrices of order 2 to 12 arising from physical problems. In about 
half the cases studied all diagonal elements of A were equal; two 
typical examples are illustrated below. There was no significant 
difference between the behavior of these matrices and matrices 
with diagonal elements differing in magnitude. 


(a) 

A = 1.0 0.259952 0.08886876 0.01772265 0.03886876 
0.259952 1.0 0.259952 0.03886876 0.01772265 
0.08886876 0.259952 1.0 0.259952 0.03886876 
0.01772265 0.03886876 0.259952 1.0 0.259952 
0.03886876 0.01772265 0.03886876 0.259952 1.0 


Ad = /0.9911413 0.1809132 0.0104918 0.0063647 0.0187119 
0.1309132 0.9826457 0.1308604 0.0102163 0.0063647 
0.0104918 0.1308604 0.9826144 0.1308604 0.0104918 
0.0063647 0.0102163 0.1308604 0.9826457 0.1309132 
0.0187119 0.0063647 0.0104918 0.13091382 0.9911413 


(b) 
A= 1.0' 0.74917 0.48985 
(oa 1.0 on 
0.48985 0.74917 1.0 


A} 


0.3893683 0.8347366 0.3893683 
0.1875400 0.3893683 0.9017878 


In both methods iteration was continued until, at iteration k, 
the estimate of A1D™ changed by less than 107, ice. 


| pb’; — D¥y" | < 1077 for all ¢ and j. 
Algorithm 298—No. of Iterations 


(ss 0.3893683 sn 


N a 0 
0.8 0.9 0.95 0.999 1.05 
(a) 5 1.054 22 18 17 16 14 
(b) 3 1.071 60. 52 49 47 44 
Series Method--No. of Iterations 
N Bont 0 


1.0 1.11.2 1.31.4 1.5 1.6 1.7 1.8 1.9 2.0 
(a)5 1.59 21 19 17 16 14 18 15 19 26 37 67 
(b)3 2.006 118 108 100 98 87 81 77 72 69 65 63 


The behavior of Algorithm 298 was found to be similar in all 
cases studied. The best choice of 6 is as close to a as possible. 
Normally, 0.999 must be chosen. The performance of the series 
method is well illustrated by the two examples chosen. It is dif- 
ficult to determine a good value of 6, and even if a value very close 
to Oopt is accidentally used, the performance of the series method 
can be inferior to the method used in Algorithm 298. 

The series method has one other disadvantage. For an efficient 
algorithm, several extra arrays are required as intermediate 
storage. The only clear advantage is that the series method can be 
readily modified for powers other than square root. Algorithm 298 
is the most efficient method of the two. 
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ALGORITHM. 299 

CHI-SQUARED INTEGRAL [S15] ) 

I. D. Hitt anp M. C. Pike (Recd. 9 Sept. 1965 and 3 
Oct. 1966) 

Medical Research Council, Statistical Research Unit, 
115 Gower St., London W. C.1., England 


real procedure chiprob (x, f, bigx, normal, wrong); 
value x, f, bigx; realxz; integer f; Boolean bigz; 
real procedure normal; label wrong; 

comment Finds the probability that x?, on f degrees of freedom 
exceeds 2, i.e€., 


1 ] 


eee 4f-1 phe 
ECAP) girl e- dz 


(¢ 20, f 21) 


The algorithm is based upon the recurrence formula 


L ho. 420 
2X é 


rGf) 


[Handbook of Mathematical Functions, National Bureau of Stand- 
ards, Appl. Math. Series 55 (1964), formula 26.4;8] by means of 
which any x?-integral can be reduced to the sur of 
(i) a series of terms that can be directly evaluated, and 
(ii) a x?-integral on 2 degrees of freedom (if f:is even), or on 
1 degree of freedom (if f is odd). 
To evaluate (ii) we have either 


P (xf? > #) = P (xj > 2) + 


P(x? > x) = 


or 


P (x2 > x) = (2/+/2r) i e-#” dz. 


The evaluation of the latter expression is performed by the 
formal real procedure normal which must evaluate the lower 
tail area of the standardized normal curve (real procedure 
Gauss [D. Ibbetson, Alg. 209, Comm. ACM 6 (Oct. 1963), 616] 
may be used as the actual parameter). 


The parameter bigz should be set to true if the value of x is - 


too big for exp (—0.5X2) to be accurately represented by the 
machine, or false otherwise. 

‘For even degrees of freedom the method is exact, and the 
algorithm is essentially accurate to the accuracy of the machine. 
For odd degrees of freedom the accuracy will be dictated by the 
accuracy of the real procedure normal. 

For large degrees of freedom, if speed is more, important than 
great accuracy, it may be found preferable to use an approxima- 
tion, e.g., the Wilson-Hilferty cubic formula [Wilson, E. B., and 
Hilferty, M. M., Proc. Nat. Acad. Sct. 17 (1931), 684] which may 
be expressed as 


chiprob := normal (—sgrt (4. BXF)X ((e/f) (1/3) +2/OXf)— -1)), 


This is accurate to 3 decimal places for f > 40. 
The authors thank the referee and the editor for helpful 
criticisms and suggestions; 
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begin. 
ifx<O0Vf < 1 then go to wrong else 
begin 
real a, y, 8; 
Boolean even; 
a:=05X «2; even := 2X (f+2) =f; 
if even \/V f > 2 A abigz then y := exp(—a); 
s := if even then y else 2.0 X normal (—sgqrt (x)); 
if f > 2 then 
begin 
real ¢, c, z 
= 0.5 X (f—1.0); 2 := if even then 1.0 else 0.5; 
if bigx then 
ee 
:= if even then 0 else 0. 572364942925; ¢ := 
comment 0.572364942925 = In iG: ; 
for z := z step 1.0 until z do 
ssl 
= In (2) + @; 
= exp(eXz— a +s 
aa 
chiprob := 
end else 
begin 
é := if even then 1.0 else 0.564189583548/sqri(a); ¢ := 0; 
comment 0,.564189583548 = 1/sqrt(r); 


In (a); 


for z := z step 1.0 until x do 
begin 
eé:= € X a/z; 
c:=c+e 
end; 
chiprob :=cXyt+s 
‘end 
end else chiprob := 


end — 
end chiprob 
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CHI-SQUARED INTEGRAL [I. D. Hill and M. C. 
Pike, Comm. ACM 10 (Apr. 1967), 243] 

Witiram M. O’Brien AND Joan Woop (Recd. 17 Oct. 
1967 and 1 Dec. 1967) 

Department of Preventive Medicine, University of Vir- 
ginia School of Medicine, Charlottesville, Virginia 


KEY WORDS AND PHRASES: 
bility, special functions 
CR CATEGORIES: 5.5, 5.12 


chi-squared integral, proba- 


Chi-Squared Integral compiled and ran in Burroughs B5500 
AuGou with the following revisions: 
(i) wrong was removed from the formal parameter list; 
(ii) label wrong; was removed from the specification part 
(iii) the last two lines were modified to read: 


end, 
wrong: end chiprob 
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These modifications were necessary since the heading of a typed 
procedure may not contain a label in Burroughs Extended ALGOL. 
{[Editor’s note: The question of whether a function procedure in 
ALGOL 60 may have a label as a formal parameter providing an exit 
from the procedure via a go to statement is not completely an- 
swered in the ALGou 60 report. See D. E. Knuth, The remaining 
trouble spots in ALGou 60, Comm. ACM 10 (Oct. 1967), 611-618 
(614). The use of wrong as a formal parameter in chiprob may cause 
trouble in many compilers. Perhaps the best way to handle the 
problem of error exits is to provide a formal parameter, error, 
which is a: procedure name and let the user provide his own pro- 
cedure for error recovery.—JGH]. 

bigx was set to true if erp(—0.5X2z) < 1-10 and Algorithm 209 
[D. Ibbetson, Gauss, Comm. ACM 6 (Oct. 1963), 616] was used for 
the formal real procedure normal. 

The following were calculated: 
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Chi squared Degrees of freedom 


0.001 1 to 3 

2.2 1 to 17 
8.2 1 to 32 
82.0 even values 34 to 70 


The results were checked against E. 8. Pearson and H. O. Hartley, 
Biometrika Tables for Statisticians, vol. 1, 2nd ed., Cambridge, 
1962, pp. 122-129, which gives values of chi squared to five decimal 
places. The computer calculations, which were carried to nine 
places, gave identical results except in three instances, which were 
x? = 2.2 with df = 10, x? = 8.2 withdf = 24, and x? = 82 with df = 
38. In all three cases the sixth figure would have rounded to a 5 
and the discrepancies appear to be due to inconsistencies in the 
rounding of the original Biometrika Tables, rather than errors in 
the procedure. 
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REMARK ON ALGORITHM 299 
Chi-Squared Integral [S15] 


{I.D. Hill and M.C. Pike, Comm. ACM 10, 4 (April 1967), 243] 
Mohamed el Lozy, M.D. [Recd 20 May 1976 and 15 July 1976] 


Department of Nutrition, Harvard School of Public Health, 665 Huntington Ave., 
Boston, MA 02115. 


This work was supported in part by the Agency for International Development under Con- 
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tion, Harvard School of Public Health. 


This algorithm suggests the usc of the Wilson-Hilferty formula [3] if an approxi- 
mation is desired for large degrees of freedom. Peizer and Pratt [2] have since then 
described a family of normal approximations far supcrior to the cube-root family, 
their formula for the chi-square distribution being [eqs. (2.24b) to (2.27) of their 
paper]: 

2 = d{(1 + g(s/x)) /2a]"” 


where z represents the corresponding normal deviate, x represents the chi-squared 
value, x represents the degrees of freedom, and 


s=n—l 

d =x —n-+ 344 — 0.08/n 
gf) = (1 —?@+2thd)/1—)*, t>0,t#1 
g(0) = 1, g(1) = 0. 


The two approximations were compared for degrccs of freedom n = 1 (1) 20 (5) 
100 (2) 200 using, for each value of n, a grid of 500 chi-squared valucs uniformly 
distributed over the interval from P = 0.00001 to P = 0.99999. The “‘truc” values 
of P were calculated using an IMSL subroutine, MDCHDI [1] which is essentially 
a double precision Fortran version of Algorithm 299 not using any approximation. 
Table I shows the maximum difference between the “true” results and those ob- 
tained with both approximations; the superiority of the Peizer and Pratt approxi- 
mation is clear. For'only 4 degrees of freedom it will give 3 correct decimals; for 11 
degrees of freedom it will give 4 correct decimals; for 31 degrees of freedom it will 
give 5 correct decimals; and for 120 degrces of freedom it will give 6 correct deci- 
mals. In contrast, the Wilson-Hilferty approximation will give 3 correct decimals 
for 25 or more degrees of freedom, and calculations with a coarse grid show that 4 
correct decimals are achicved somewhere between 200 and 300 degrees of freedom. 


COLLECTED ALGORITHMS (cont.) 


Since full word length accuracy is rarcly, if ever, necded in the evaluation of the 
integrals of probability functions, it is suggested that for more than 30 degrees of 
freedom the Peizer and Pratt approximation be used in place of the iterative al- 
gorithm. There would appear to be no justification for using the Wilson and Hil- 
ferty approximation. 

The calculations were done on an IBM 370/168 using double precision through- 


Table I. Maximum Absolute Errors for the Wilson-Hilferty and 
Peizer-Pratt Approximations to the Chi-Squared Integral 


Maximum error using approximation of 
Degrees of freedom 


Wilson-Hilferty Peizer-Pratt 

5 26E—2 33E—3 
10 13E—2 58H —4 
15 82h —3 22Te—4 
20 61-3 12K—4 
25 48E—3 73E—5 
30 389E—3 50E—5 
50 23E—3 19f—5 
100 liie—3 55E—6 
120 92In—4 411i—6 
200 54 —4 18E—6 


out, as it was desired to test the accuracy of the approximation without having to 
worry about inaccuracies due to the short word length of the machine. Single 
precision calculations gave almost identical results for the Wilson-Hilferty approxi- 
mation. In the case of the Peizer and Pratt approximation very similar results were 
obtained up to about 30 degrees of freedom, after which the maximum error ob- 
tained with single precision was greater than that obtained with double precision, 
and never fell below .2E — ‘5. However, like the double precision version, the single 
precision routine gave 5 correct decimal places for 31 or more degrees of freedom. 

The use of the g function avoids inaccuracies that would arise if the simpler 
equation (2.24a) of [2] were used. In evaluating it, care must be taken near the two 
singularities. For 1 degree of freedom s = 0; so the argument to g(t) will be zcro 
and g(t) must be set to 1.iIn the testing done, the smallest nonzero valuc of the 
argument to g(t) was 0.04, which did not lead to any numerical problems. On the 
other hand, values very close to 1 were obtained, the smallest absolute difference 
from 1 being .14E — 4. In single precision at least such arguments can lead to great 
loss of accuracy; so for values of the argument close to 1 the power expansion given 
by Peizer and Pratt [2, eq. (10.3)] should be used: 


g(t) = EC ~ N/G+ YG +2). 


It is not clear what the optimal value of the crossover point from the logarithmic 
to the power series form of g(t) is, but in the single precision version we have used 
an absolute value of (1 — #) less than 0.1 as the crossover criterion, taking the first 
5 terms of the series. 
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COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 300 

COULOMB WAVE FUNCTIONS [822] 

J. H. Gunn (Recd. 19 Feb. 1965) 

Nordisk Institut for Teoretisk Atomfysik 
Blegdamsvej 15, Copenhagen, Denmark 


procedure Coulomb(F, Fd, G, Gd, sig, rho, eta, lmaz, exit); 
value rho, eta, lmaz; 
real rho, eta; integer lmar; array F, Fd, G, Gd, sig; label 
exit; 
comment The Coulomb wave functions Ff, and Gz are defined 
as the two independent solutions of the differential equation 


@ 
qa t l-2n/e-LIL+D)/o%y = 0 
having the asymptotic behavior for large p 
ee L 
Fy~sin (o-» In 20-5 tes) : 


L 
Gr ~ cos (o- In 2-3 +o) 


where o, = arg’ (in +Z+1). The procedure calculates for a 
given p = rho and » = eta, the functions F, and G_, , their de- 
rivatives F,’ andG_’, and cz for all L from 0 up to Imax (>0) and 
places the results in the arrays F, G, Fd, Gd, sig respectively, 
which must have bounds 0:lmaz. rho must lie in the range 5-30 
and eta in the range 0.1-30: values outside this range cause the 
procedure to leave via the label exit. This range is one that is 
often used in scattering and reaction problems in physics. De- 
tails of the methods used are to be found in: C. E. Froéberg, 
‘“‘Numerical treatment of Coulomb wave functions,” Rev. Mod. 
Phys. 27 (1955), 399-411, and in: H. F. Lutz and M. D. Karvelis, 
“Numerical calculation of Coulomb wave functions for repulsive 
Coulomb fields,” Nucl. Phys. 48 (1963), 31-44. The author grate- 
fully acknowledges the extensive help of Miss Margaret Wirt 
in the preparation of this procedure; 
begin 
integer n; real rhom; 
comment jump to label ezit if rho and eta lie outside range of 
procedure; 
if rho < 5 V/V rho > 30 V eta < 0.1 V eta > 30 then 
go to exit; 
begin real sto; integer 1; 
comment phase shifts o, are calculated using formulae 
44-45 of Lutz and Karvelis; 
sto := 16 + eta T 2; 
sig[0] := —eta + eta/2 X In(sto) + 3.5 X arctan(eta/4) — 
(arctan (eta) +arctan (eta/2)-+arctan (eta/3)) — eta/(12X sto) X 
(1+1/30 * (eta f2—48)/sto T2 + 1/105 
X (eta tT 4—160X eta fT 2+1280)/sto T 4); 
for 7 := 1 step 1 until lmaz do 
sig{t] := stg[¢—1] + arctan(eta/t) 
end; 
if rho S (5Xeta—15)/3 \/ rho S eta then 
begin comment G/[0] and Gd[(0] are calculated using the Riccati 
method (p<2n) ref. formulae 9.1-9.4, Froberg; 
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integer 7; real q, psi, psid, f; array g, gd[0:7], t, s[1:10]; 
t{1] := rho/(2Xeta); s{1] := 1 — t[1]; q := sert(t{i]x<sfi]); 
for 7 := 2 step 1 until 10 do 
begin ?¢[t] := ¢{1] * é[¢—1]; 

s[t] := sf] X s[i—1] 


end; 

g(0] := q+ arctan(t{1]/q) — 1.5707963; 

g{1] := 0.25 & In(@t[1]/s[1]); 

g[2] := —(8Xt(2]—12X¢[1]+9)/(48XqXs[1]); 

g[3] = (8X¢t[1]—3)/(64xX¢[1] Xs[8]}); 

gl4] := (2048 X2[6] —9216 X¢[5]-+16128 Xt[4] — 13440 x t[3] — 12240 


¢t[2]+-7560 X ¢[1] — 1890) / (92160 X gX¢[1] Xs[4]) ; 
3X (1024 X [3] — 448 X ¢[2] 4-208 X ¢[1] — 39) / (8192 Xt[2] x 


g{6] = — (262144 1[10] — 1966080 X [9] -+6389760 x t(8] — 11714560 
Xt{7]-+13178880 X t[6] — 9225216 X t[5] +13520640x t[4] — 
3588480 X ¢[3] +-2487240 X t[2] — 873180 X ¢[1] +130977) /(10321920 
x<qXt[2] Xs[7]); 

gl7] := (1105920 t[5] — 55296 X 1[4] +-314624 x ¢[3] — 159552 x ¢[2] 
+45576 X t[1] — 5697) / (893216 X ¢[3] Xs[9]); 


g@[0] := g/t{1]; 

ga(1] := 0.25/(é{l]xs[1]); 

ga[2] := —(8Xit{1]—3)/(82xXqXi[1] Xs[2]); 

gd[3] = 3X (8Xz[2]—4X¢t[1]+1)/ (64 X¢[2] Xs[4]) ; 

gd[4] := — (1536 Xi[3]—704 Xt[2]+336 X¢[1] —63) / (2048 X q¢Xt[2] 
Xs[5]); 


ga[5] := 3X (2560 Xt[4] —832 <t[3] +728 X t[2] — 260 ¢[1]-++39)/ 
(4096 < ¢[3] Xs[7]) ; 

ga[6] := (—368640 X¢[5] —30720X ¢[4]+114944 x t[3] — 57792 X ¢[2} 
-+16632 X ¢[1] — 2079) / (65536 X qX ¢[3]X s[8]) ; 

ga{7] := 3X (860160 ¢[6]-++196608 X ¢[5]-+308480 x t[4]— 177280 
t[3]-+-73432 X t[2] — 17724 X t[1] +1899) / (131072 x ¢[4] X s[10]) ; 

f :=2*X eta; psi := psid := 0; g:= —1; 

for 7 := 0 step 1 until 7 do 

begin psi := psi +q Xf X gle]; 
psid := psid + qa Xf X galt; 


f := f/@xXea); q:= —¥ 
end; 
G[0] := exp(psi); Ga[0] := G0] x psid/(2X eta); rhom := 
rho 
end else 


if rho = (30Xela+75)/13 A rho < 2X eta 7 2 then 


begin comment G/[0] and Gd[0] are calculated using the second 

Riccati method (2n< p) ref. formulae 9.6-9.8, Froberg; 

integer 7; real A, B, psi, phi, M,q; array x,y, e[1:10]; 

z[l] := 2X eta/rho; y{l] :=1 —all]; q:= sqart(yfl]); ef] 
:= 2 X ela; 

for i := 2 step 1 until 10 do 

begin x(7] := 2{1] X 2[t—1]; elt] := e{1] & e[¢—1]; 
y(t] := yftl x ylt—1] 

end; 

psi := — (8Xx[3]—3X2[4])/ (64Xe[2]Xy[B]) + 3 X x5] Xx 
(1024 — 448 Xa[1]+208 x 2[2] 39 X2[3])/ (8192 x e[4] x<y[6]) — 
z[7] X (1105920— 55296 X2[1]4-314624 X 2[2] — 159552 Xz[3]+ 
45576 X2[4] — 5697 X 2[5]) / (893216 X e[6] x y[9]) ; 

phi := e[l] X (g/x[1] + 0.5 X In((l—q)/(l+q))) + 0.7853982 
— (9X2[2]—12X2[1]+8)/48Xe(1] X¢xXyl[1)) 
— (2048—9216X2[1]+16128 x x[2]— 13440 X 2[3]— 12240 


COLLECTED ALGORITHMS (cont.) 


X 2[4]+-7560X x[5]— 1890X x[6])/(92160X e[3] x gy[4]) 
— (130977 X x[10]— 873180 X x[9]-+2487240X 2x[8] — 8588480 
X x[7]+13520640 x x[6] — 9225216 X x[5]+15178880  2[4] 
— 11714560 2[3]+-6389760 x[2] — 1966080 [1] 
+262144)/ (10821920 e[5]X gx y[7]) ; 
= q/z[2] + (8X2[1]—3X-2[2])/ (2X e[2]X qx y[2]) 

— x[3] X (1536—704X2[1]+336X 2[2]—63X2[3])/ 
(2048 x e[4]X qXy[5]) + 2[5] X (368640—30720X2[1] 
+114944x 2[2] —57792 2[3]-+16632X [4] — 2079X2[5])/ - 
(65536 X e[6]X qXy[8]) ; 
= 1/(4Xe(1]xXy[1]) — 3 X [2] & ([2]—4x2[1]+8)/ 
(64x e[3]Xy[4]) + 3 & 2[4] X (2560—832X2[1]+728 
X #[2]— 260 x[3] +39 Xx[4])/ (4096 X e[5]Xy[7]) -- 3 
X 2[6] X (1899 X2[6]— 17724 2[5]+-73432 Xx 2[4]--177280 
X 2[3]+308480X x[2]+ 196608 X x[1]-+860160) / (131072 
Xe[7]Xy{10]) ; 

M := sqrt(1/q) X exp(pst); 

G[0] := M X cos(phi); 

Gd[0] := —2[2] Kk (AXMXsin(phi)+BXG[0]); rhom :=rho 

end else 
if ela < 4then 


begin comment G/(0] and Gd[0] are calculated using an asymp- 

totic expansion, ref. formulae 12.3-12.7, Fréberg; 

real ss, sl, tt, t1, SS, S1, TT, T1, sn, in, Sn, Tn, a; Bn, theta, 
cth, sth; integer 7; 

rhom := if rho 2 2 x eta T 2 then rho else 2 X eta fT 2; 

comment a suitable value of rhom is chosen for which the 
expansion is valid; 

ss:=sn:=1; th:= in :=0; 

SS := Sn :=0; TT := Tn := 1 — eta/rhom; 

for 7 := 0 step 1 until 10, 11, 7 + 1 while (abs(sn)>10—7 
Xabs(ss)\V abs (tn) >10—7 Xabs (it) Vabs (Sn) >10--7 
Xabs(SS)\Vabs(Tn)>10—7 Xabs(TT)) A (abs(sn) 
<abs(s1) abs (in) <abs (#1) abs (Sn) <abs(S1)A\ abs(Tn) 
<abs(T1)) do 


begin An := (2Xi+1) X ela/(2X (¢+1) Xrhom); ' 
Bn := (eta t 2-71 @4+1))/(2X& (+1) Xrhom); 
sl:= sn; tl:=in; Sl:= Sn; T1l:= Tn; 
sn := An X sl — BnX tl; 
in := An X tl + Bn X s1; 
Sn := An X 81 — Bn X T1 — sn/rhom; 
Tn := An X T1+ Bn X S1 — tn/rhom; - 
ssi=ss-+sn; th := tt + in; 
SS := S8+S8n; TT := TT + Tn 

end; 

theta := —eta X In(2Xrhom) + rhom + szg[0]; 


cth := cos(theta); sth := sin(theta); 
G{0] := ss X cth — tt X sth; Gd[0] := SS X cth + TT X sth 
end else | 
begin comment G[0] and Gd[0] are calculated on the transition 
line for rhom = 2 X eta, ref. formulae 10.38-10.4, Fréberg; 

G0] := 1.22340416 X eta T (1/6) X (1+0.0495957017/eta T (4/3) 
— 0,0088888889/eta T 2+-0.00245519918/eta tT (10/3) 
—0.000910895806/eta f 4+-0.000253468412/eta fT (16/3)) ; 

Gd[0] := —.707881773 X eta f (—1/6) X (1--0.172826037/ 
eta t (2/3)-+0.000317460317 /eta T 2—0.00358121485/eta T (8/3) 
+0. 000311782468/eta t 4— 0.000907396643/eta T (14/3) ; ‘ 

rhom := 2X eta 

end; 

if rhom # rho then 

begin comment Integrate the solutions G[0] and Gd[0] from 
the value of rhom at which they were evaluated to the value 
of rho required using Runge-Kutta formula; 

integer nh, i; real kl, k2, k3, k4, kip, k2p, k3p, kp, y, yp, 
x, h; 
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nh := entter(abs(rhom— rho) X10+1); 
= (rho—rhom)/nh; 

z:=rhom; y:=G[0]; yp := Gd(0]; 

for 7 := 1 step 1 until nh do 

begin kl :=h X yp; kip := 
k2 :=h X (ypt+klp/2); k2p := 
xX (y+kl/2); 


—hX (—-2xXeta/xr) X y; 
—h X (1—2xXeta/ (4+h/2)) 


k3 :=hX (yptk2p/2); k8p := —h X (1—2Xeta/ (x+h/2)) 
X (y+k2/2) ; 
k4 = h X (yptk8p); k4p = —h X (1—2Xela/(z+h)) X 


(y-+k3); y = y + (KI42Kk24+2%k3-+h4)/6; 
yp := yp + (klp+2Xk2p+2Xk3p+k4p)/6; 


ai=ath 
end; 
G[0] := y; Gd[0] := yp 
end; 


n := if rho > lmaz then entier(rho+10) else Imax + 10; 


begin comment Use downward recurrence relation (Millers 
method) and normalisation condition to obtain solutions 
F(Z]; 
array f[0:n]; 
f{n] := 0; 
fin—1} = 1; 
for L := n — 1 step —1 until 1 do 
f(L—-1) := L/sqrt(etat 2+L72) XK ((2XL-+1) Xeta/ 
(LX (L+1))+(2%L+1)/rho) X f[L]— sqrt(eta ft 2 
+(L+1) f 2)/(2+1)xf[L+1)); 
fdO := (eta+1/rho) X f[0] — sqrt(eta T2+1) X F[L]; 
G[1] := (—Gd[0]+ (1/rho+eta) XG[0])/sqrt(I+eta fT 2); 
alpha := 1/(sgrt(l+eta f 2) (f[0]XG[1]—fIL]XGI[0])); 
F[0] := alpha X f{0]; 
Fd(0| := alpha X fd0; 
comment Upward recurrence 
solutions; 
for L := 0 step 1 until lmaz—1 do 
begin F[L+1] := alpha X f[Z+1]; 
sto := sqrt(eta T2+(L£+1) f 2)/(Z£+1); 
Fd[L+1] := sto X F[L] — (eta/(L+1)+ (L+1)/rho) X 
F([L+1]; G[Z+1] := 1/sto X ((eta/(L-+1)+ (£+1)/rho) 
XG[L]—Ga[L]) ; Gd[L+1] := sto X G[L] — (eta/(Z+1)+ 
(L+1)/rho) X G[L-+1] 
end 
end 


real fd0, alpha, sto; integer L; 


relations for remaining 


end Coulomb 


CERTIFICATION OF ALGORITHM 300 [822] 
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Applied Mathematics Group, Data Handling Division, 
European Organization for Nuclear Research (CERN), 
1211 Geneva 23, Switzerland 
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CR CATEGORIES: 5.12 


The procedure Coulomb was checked for a few parameter 
values using the ALGoL compiler of the CDC 3800 computer at 
CERN. It was found that for p = 7 better results were obtained if 
the first line of the second if statement was altered to read: 


COLLECTED ALGORITHMS (cont.) 


if rho < (5 X eta — 15)/3 VV rho < eta then 

It was also necessary to correct a misprint in the first constant 
following the comment “G[0] and Gd[0] are calculated on the 
transition line for rhom = 2 X ela, ref. formulas 10.3-10.4, Fré- 
berg.’’ The line following this comment should read: 


G[0] := 1.223404016 eta T (46) X (1 + 0.0495957017/ela Tf (4) 


The procedure was then translated into ForTRAN and tested in 
more detail on a CDC 6600 computer. The tests included the fol- 
lowing: 

(i) Generation of ®z(n,0) = [(Cu(m)o"* 1 Fila), L = 0(1)21 
for » = 1(1)5, p = 5. The results were compared with values 
tabulated in [1]. In most cases, 6 to 7 significant digits agreed, 
except for 7 = 1, where agreement was found to 8 to 4 significant 
digits. It is interesting to compare some results for p = 7 = 5 
obtained with and without the first of the above corrections: 


Table [1 d 

L\@1r Without correction With correction nee 2] 
0 6.554097 103 6.552297 193 6.552292 103 
5 1. 865738191 1.865226191 1.865225 101 
10 5.354953 100 5.358482199 5.858478100 
20 2 .440859:100 2.440188 190 2.440187100 


(ii) Computation of Fo(n,e), Fo’(n,e) = (d/dp)Fo(n,e) for » = 
2(2)12, p = 5(5)30. Comparison with the table of Tubis {8] re- 
vealed frequent discrepancies of | (occasionally 2) units of the fifth 
significant digit. However, disagreement was observed in many 
fewer cases when comparing the calculated results with those 
obtained by Gautschi’s algorithm [2]. 

(iii) Computation of Po(n,p), Fo’(n,p), Go(n,p), and Go’(n,p) for 
p = 2n, p = 5(.5)20(2)30. Comparing the results with the table of 
Abramowitz and Rabinowitz [4] or with the values obtained with 
Gautschi’s algorithm, the following discrepancies were found in 
units of the seventh decimal place: 


Fy) —frequently 1, occasionally 2, units for p < 10; 
F,’—frequently 1 unit for p < 8.5; 
Go —for p < 8 up to 40 units, for 8 < p < 14.5 up to 2 or 3 units; 

Go’—for p < 7.5 up to 13 units. 

(iv) Calculation of Go(n,p), Go’(n,o) for » = .5(.5)20, p = 5(1)20. 
The results have been compared with the tables given by Abramo- 
witz [5]. Agreement was found in most cases to 5 significant 
digits. Discrepancies of 1, occasionally more, units of the fifth 
significant digit were found, mainly for arguments near a line 
separating two methods used in the algorithm. In some cases (in 
the immediate neighborhood of a zero of G or Go’) there was 
agreement to only 2 or 3 significant digits. 

(v) Generation of Fr(n,o), Fx'(n,p), Gr(n,e), Gr'(n,e), e1(n) 
for L = 0(1)10, p = 5,10, y = 1(1)5,10,25. As a first step, the results 
were compared with values given in a table by Lutz and Karvelis 
[6]. Since important discrepancies were noted for 7 = 1, p = 5 
and » > 4, the values for F, and F,’ were also calculated by 
Gautschi’s algorithm, known to be correct by checking it against 
the table [1]. Lutz and Karvelis give 6 significant digits, but 
without commenting on a possible error tolerance. They state, 
“we test [the generated functions] to see how closely the 
Wronskian relation F/G, — F,G1,’ = 1 is obeyed.’’ Comparison 
of their values with those obtained from Gautschi’s algorithm 
shows, for 7 < 4, occasional discrepancies of 1 unit in the sixth 
significant digit. For 7 > 4 [disregarding some obvious misprints, 
e.g. for Gi(2,10) and Gio(10,10)] there are discrepancies which in 
a few cases exceed a 100 units in the sixth significant digit. Be- 
cause of this, the table of Lutz and Karvelis was used for check- 
ing the procedure Coulomb only for 7 < 4. For » => 4 check values 
were obtained from Gautschi’s algorithm (F, and F,’ only). 
The following discrepancies were found in units of the sixth 
significant digit: 
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n=1,e = 5: F,—uptol119 units (ZL = 8). 
F,’—up to 87 units (ZL = 0). 
G,.—up to 350 units (Z = 2). 
G,,’—up to 247 units (LD = 0). 

n=1,p= 10; 

n = 2,3 : 1 or 2 units in several cases, exceptionally more; 
one isolated case G's(3,10) with 23 units. Compari- 
son with Gautschi’s values (where possible) 
gives better agreement. 

n> 4 Occasionally 1 unit for F,; and F;’. 


o7,(n) nearly always agreed to 6 significant digits for all tested ». 

To complete the check, values of the functions at 7» = 1, p = 5, 
and » = p = 5 were calculated using the Aucou procedure. The 
results agreed with those calculated by the FortRaN program to 
the 6 significant digits which were compared. 
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Recently, Isacson [1] pointed out that the coefficient of n71? in 
the known asymptotic expansion for the irregular Coulomb wave 
function Go(y, p) on the transition line p = 2y was erroneous. 

In addition, he gave the expansions for F'9 , Go , Fo’ and Go’ up 
to order 778, whereas the old expansions were given to, order 
7768 only. 

Therefore, and for reasons of speed, the relevant part of Algo- 
rithm 300 should be changed as follows: 


begin comment G(0] and Gd[0] are calculated on the transition 
line for rhom = 2 X eta, ref. Isacson in remark; 
array ei[1:12]; real etl; 
el{l] := eta | (—%); 


COLLECTED ALGORITHMS (cont.) 


for 1 := 2 step 1 until 12 do eé[i] := et[1] : et[i—1]; 
etl := ela T (36); 
GO] := 1.223404016 x etl & (1 + 0.04959570165 X et [2] 
—0.008888888889 x et [3] + 0.002455199181 x et [5] 
—0.0009108958061 x e¢ [6]. + 0.0008453619999 x et [8] 
—0.0004096926351 x et [9] + 0.0007116506205 x et [11] 
—0.00002439615603 x et [12]); 
Gd{0] := (—0.7078817734/et1) X (1 — 0.1728260369 x et [1] 
+ 0.0003174603174 X et [3] — 0.003581214850 x et [4] 
-+-0.0003117824680 x et [6] — 0.0009073966427 x et [7] 
+0.0002128570749 x et [9] — 0.0006215584171 xX et [10] 
+0.00003685244766 XX et [12]); 
rhom := 2 X eta 

end; 


Furthermore, it was found in this connection that replacing the 
first line of the fourth 7f statement of the algorithm by 
if ela < 4 A eta < rho/2 then 
gives, together with the above expansions, better results for 
p = 2n in test (ili) and for p = 3,7 = 5 in test (i) of the Certification. 
The relevant statements in test (ili) of the Certification should 
therefore be replaced by the following ones: 
Fo — 1 unit for p = 5, p = 6, and p = 8.5. 
Fo’ — 1 unit for p = 6 
Go — 1 unit for p = 5.5, 
Go’ — 1 unit for p = 
REFERENCE: 
1. Isacson, T. Asymptotic expansion of Coulomb wave functions 
on the transition line. BIT 8 (1968), 248-245. 
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300 [K.S. Kélbig, Comm. ACM 12 (May 1969), 

279)|; Remark on Algorithm 300 [K.S. Kolbig, 
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Key Words and Phrases: Coulomb wave functions, wave 
functions, special functions, function evaluation 
CR Categories: 5.12 


The procedure Coulomb can be used very well to generate the 
Coulomb wave functions F,, and G,, and their derivatives, needed 
in elastic scattering calculations in nuclear physics. When the pro- 
cedure is used many times for many values of rho and efa, it is 
not only very useful but also necessary to have in each instance an 
indication about the accuracy of the results. It is obvious to use 
the Wronskian relations F,'G, — F,G,' = 1 for the purpose of 
checking the results, as Froberg [1] states after formula (3.4). 
However, one has to be very careful in using these relations. The 
most significant check is given later on, but first it is shown what 
can go wrong. 


This investigation was part of the research program of the 
“Stichting voor Fundamental Onderzoek der Materie (F.O.M.),” 
which is financially supported by the ‘Nederlandse Organisatie 
voor Zuiver Wetenschappelijk Onderzoek (Z.W.O.)”. 
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Kolbig pointed out already in the certification that Lutz and 
Karvelis [2] failed to notice discrepancies exceeding 100 units in 
the sixth significant digit in their tables although they state “when 
all the functions are generated we test to see how closely the Wron- 
skian relation F,’G, — Fx,G,' = 1 is obeyed.” The way Lutz 
and Karvelis generate the functions goes as follows. First they 
calculate Go and Gy’; then they use recurrence relations to get 
G, and G_' for L > 0; and lastly them use backward recurrence 
relations together with the relation FoG; — Gof; = (n? + 1)7} 
to get F, and F,’ for all L. This last relation is in fact a different 
form of the Wronskian relation, see e.g. Fréberg [1] formula 
(3.5). The use of the Wronskian relations to check the results now 
gives information only about the stability in the use of the recur- 
rence relations, not about the accuracy of the Coulomb wave func- 
tions. 

As an independent check on the function values, the following 
procedure can be used. It is easy to calculate Fo and F,’ directly, 
that is in the same way as Go and Gy’ are calculated (see Froberg 
[1] and Isacson [3]). We call the results Fy (dir) and Fy’ (dir). 
These values can be compared with the Fy (rec) and Fy’ (rec) cal- 
culated via the recurrence relations, Wronskian relation, and G5 
and G; as in the procedure Coulomb. This direct test has to be 
preferred above a test via the Wronskian relation for the direct 
results GoFo’ (dir) — Go'Fo (dir) = 1 because errors in Fo (dir) 
and F)’ (dir) sometimes cancel in the Wronskian. The other Wron- 
skian relations (i.e. for Z > O and Fo (rec) and Fy’ (rec)) are hardly 
needed as a test because they only check the recurrence relations 
used. The experience is that errors herein are completely negligible 
(always less than one unit in the tenth digit for all values of L 
for the 12-digit EL-X8 computer of the Mathematisch Centrum in 
Amsterdam). 

To include this check, Algorithm 300 should be changed as 
follows: 

1. The line following the first begin should read 


integer #7; real rhom, q; 


2. The line following the fourth comment (G/0] and Gd|0] are 
calculated using the Riccati method (p <2y) formulas 9.1-9.4, 

Froberg;) should be altered, according to Froéberg (1] formulas 
(9.1) and (9.2), to read: 


integer 7; real g, psi, psid, phi, phid, f; array x, gd|0:7}, ¢, 
s{1:10]; 


3. The relevant lines after the statement starting with gd|7] := ... 
should read: 
f:= 2X eta; psi := psid:= phi:= phid:= 0;q:= —1; 
for i := Ostep 1 until 7 do 
begin psi := psit+qxX fx gi{il; 

psid := psid+q Xf X gdil; 

phi:= phi+f X gli]; 

phid := phid + f X gd{i]; 

f:= f/2/eta,q:= -q 
end, 
G[0O] := exp(psi); Gd|O] := GO] * psid/2/eta; 
F[O] := 0.5 X exp(phi); Fd(0) := F[0] & phid/2/eta; 
rhom := rho; 


4. The line just before the fourth if statement (if eta <4 / eta 
< rho/2 then), i.e. end else, should according to Froberg formula 
(9.8) be replaced by: 


;F(0] := M &X sin(phi); 
Fd(0] := —x(2] * (BXF(0]—AXG|0]) 
end else 


5. Insert after the last line of the calculation using an asymptotic 
expansion, just before the third end else, according to Fréberg 
formula (12.7) , the following lines: 


F(O] := tt X cth + ss X sth, 
Fd(O] := TT X cth + SS X sth; 


COLLECTED ALGORITHMS (cont.) 


6. The two statements after the line 
etl := eta T (1/6); 


i.e. G(0] ae , and Gd[0] := ..., should be replaced by: 


q:= 1; 


here 1: 

G(0] := 1.223 404 016 X etl & (1+ q XK 0.049 595 701 65 X er[2] 
—0.008 888 888 889xer[3]+¢X0.002 455 199 181 
Xer[5]—0.000 910 895 806 1 Xer{6]-+g 
X0.000 845 361 999 9xer[8]—0.000 409 692 635 | 
Xet(9]+¢qX0.000 711 650 620 5Xer(11] 

—0.000 024 396 156 03Xer(12]); 

Gad{0] := (—qX0.707 881 773 4/ertl) K (l—@ 

X0.172 826 036 9Xer{1]+0.000 317 460 317 4 
Xet(3]—qX0.003 581 214 850XKer[4] 
+0.000 311 782 468 Oxet[6]—qX0.000 907 396 642 7 
Xer|7]+0.000 212 857 074 9xXer[9}--¢ 
0.000 621 558 417 1 xXer(10]+4+0.000 036 852 447 66 
Xer[12)); 
if g < 0 then begin 

q:= +1; 

F{0] : = G{0] X 0.706 332 637 3 / 1.223 404 016; 

Fd(0] := Gd[0] XK 0.408 695-732 3 / 0.707 881 773 4; 

go to here 1 

end; 


7. Replace the line 
x := rhom; y := G[0]; yp := Gd[0}; 
after comment Integrate the solutions G[0] . . . by the lines 


x := rhom; y := G[0]; yp := Gal0];q := +1; 
here 2. 


8. Replace the line following the next for statement; i.e. 
G[0] := y; Ga[0] := yp 


by the lines 
if g > Othen 
begin G[0] := »; Gd[0] := yp;q:= —1, 
y := F[O]; yp := Fad{Q); x := rhom; go to here 2 
end else 
begin F[O] := y; Fd[0] :== yp 
end; 


-_ 


Insert after the next end; before the line 


iy 


:= if rho > Imax then. . . the following lines: 


outreal (F[O]); outreal (Fa[0]) ; outreal 
(Fd(0)X G [0] — F[0] x Ga[0}) ; 

comment fF) (direct), Fo’ (direct) and the Wronskian for the 
direct results W(direct) are printed; 


10. Insert just before the comment (Upward recurrence 
relations for remaining solutions) the lines: 


outreal (F(0}) ; outreal (Fd[0}); 
comment Fo (rec) and Fy’ (rec) are printed; 


The tests of the procedure Coulomb with these changes in- 
cluded all the computations mentioned in the Certification except 
those under (il), and those in the Remark. The tests gave the 
same results as in the Certification and in the Remark. Moreover 
the following results were obtained: 

The maximum M of the absolute differences 


M = max (| [Fo(dir) —Fo(rec)] / Fo(rec) | , 

| (Fo' (dir) -- Fo'(rec)] / Fo'(rec) | ) 
was always greater than the absolute difference between the Wron- 
skian for the direct results W(dir) and 1; i.e. 
M > AW = | 1-—Wadir) |. 
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In some cases W(dir) differed not significantly from 1, while the 
test with M indicated considerable discrepancies (see Table [, 
pin = 6,1.5; 7,3.5 and 19,5.5). It was found that for all discrep- 
ancies stated by K6lbig in the Certification and in the Remark, 
the relative error was smaller than or of the same order as M, 
so M gives a good indication about the accuracy of the results 
(see Table p,y = 7,3.5(cert) and 7,3.5(remark)). So discrepancies 
of several units in the fourth or fifth significant digit were found 
near some lines in the (p,7) plane separating two methods used in 
the Algorithm: namely, the lines p = » for 5 < p < 7.5, Syn = 
3p + 15, 307 = 13p — 75 and» = 4, where integration of the 
Coulomb wave functions from the transition line to the desired 
arguments turned out to be the best method (see e.g. Table p, 
n = 5,5.5). In some cases in the neighborhood of a zero of Gy or 
G,', the check with M indicated discrepancies in the third or fourth 
significant digit (see e.g. Table p, 7 = 19,5.5). 

These examples show that when the procedure Coulomb is 
used as a standard procedure in calculations where an accuracy 
of three or more digits is required, it is necessary to have in each 
instance an indication about the accuracy of the results. The quan- 
tity M introduced above can be used very well for such a check. 

Acknowledgment. We would like to thank Prof. Dr. C.C. Jonker 
for valuable discussions and comments. 


Table I. 

a) AWxX108 MxX105 Tabulated 
5 5.5 128 200 Go = .38701 (+2) .38704(+2) 
6 1.5 1.3 50 Go = —.60187 — .60177 

6 2 14.6 20 Go = .57306(—1)  .57313(—1) 
7 3.5* 4 5 Go = 1.520489 1.520492 

7 3.5+ .007 .05 Gy = 1.5204917 1.520492 
19 5.5 5 2000 Gy = —.16442 — .16427 

* Certification. 

t+ Remark. 
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ALGORITHM 301 Here re 
AIRY FUNCTION [820] Sao hiding 
GILLIAN Bonp anp M. L. V. Pirreway go to zerostep 
(Recd. 7 Apr. 1966 and 19 Oct. 1966) end shoricut 
Cripps Computing Centre, University of Nottingham, eae 
egin 
England tor{0] = 4; 
procedure Airy (Ai, Bt, Aid, Bid, x, xia, control); tor[1] := h X derivy; 
value x, xia, control; real Ai, Aid, Bi, Bid, x, xia; square :=h Xh; 
integer control; tor[2] := 0.5 & square X x X tor[0); 
comment This procedure evaluates the real Airy functions and yl := tor[0] + tor[1] + tor[2]; 
their derivatives by solution of the differential equation y” = zy. derivyl := tor[1] + 2 X tor[2]; 
The solutions Az and Bi satisfy the Wronskian relation Ai Bi’ — for n := 3 step 1 until 10 do 
Bi At’ = 1/x. Ai decreases exponentially for large positive begin 
values of x. For large negative values of x, Az and Bi have simi- tor[n] := square X (xXtor[n—2]+hX tor[n—3])/ 
lar amplitudes but differ by 7/2 in phase. ((n—1)Xn); 


The solution is tabulated in the interval —6.6 < 2 < 6.6 by 
Taylor integration of the differential equation in the stable 
directions (towards negative x for Az and away from the origin 
for Bz) with step size 0.1. Alternate values are stored using 268 
locations so that any point is within Taylor range for subse- 
quent interpolation in the table. Asymptotic series are used 
outside this range. The solutions are accurate to eight decimal 
figures. 

For extensive use, computation times can be reduced by ex: 
tending the tabular range to —10 < x < 10 and changing the 
step size to 0.05, using 804 locations. The coefficients A[7] to 
A[10] may then be dropped from the asymptotic series, and 
tor [9] and tor [10]from the Taylor series (J. C. P. Miller, The 
Airy Integral, British Association Mathematical Tables, part- 
volume B, Cambridge, 1946). 

The operation of the procedure is controlled by the integer 
code. A negative value should be assigned to code to set up the 
Airy function tables on the first call for the procedure, or when- 
ever the tables have been disturbed. A subsequent entry with 
code greater than 0 will form: 


At = exp(zia) X Ai(zx) Aid = exp(xia) X At’(x) 
Bi = exp(—zxia) X Bi(z) Bid = exp(—zia) X Bi'(x) 


I 
I] 


If the derivatives are not required, code should be set to zero. 
This will avoid asymptotic series calculations, but Aid and Bid 
are set if |z| < 6.6 even if code = 0; 


begin 


real rimdx, x1, factor, p, q, scale, s, c, xtab, h, pi; 
integer n, 7, j; 
array A[0:10]; 
own real array Aitab, Bitab, Aidtab, Bidiab{[—33:33]; 
procedure Taylor (y1, derivy1, x, h, y, derivy) ; 
value z,h, y, derivy; real y1, derivy1, x, h, y, derivy; 
comment Calculates y(z+h) from y(«) by series expansion of 
dy?/da*? = xy; 
begin 
real square; 
array tor[0:10]; 
integer 7; 
if h = 0 then 


yl := yl + tor[n]); 
derivyl := derivyl + n X tor[n] 
end; 
derivyl := derivyl/h 
end calculation of coefficients in series expansion; 


zerostep: 
end Taylor; 


pt := 3.14159 26536; 

if control < 0 then 

begin 
Bitab[0] := 0.61492 66274; 
Bidtab[0] := 0.44828 83574; 
Attab[33] := 2.15659 9952519 — 6; 


Aidtab[33] := —5.61931 944219 — 6; 
ztab := 0; 

for n := 0 step 1 until 32 do 
begin 


Taylor (Bi, Bid, xtab, 0.1, Bitab[n], Bidiab(n)) ; 
Taylor (Bitab[n-+1], Bidtab[n+1], xtab+0.1, 0.1, Br, Bid); 
Taylor (Bi, Bid, —xtab, —0.1, Bitab[—n], Bidtab|—n)); 
Taylor (Bitab[—n—1], Bidtab[—n—1], —xtab—0.1, —0.1, Bi, 
Bid); 

stab := xtab + 0.2 

end setting up Bz tables; 

for n := 33 step —1 until —32 do 

begin 
Taylor(Ai, Aid, xtab, —0.1, Aitab[n], Atdiab[n]); 
Taylor (Attab[n—1], Aidtab[n—1], xtab—0.1, —0.1, At, Aid); 
ztab := xtab — 0.2 

end setting Az tables 


end; 
if abs(z) S 6.6 then 
begin 
J:= 5X2; 
stab := 9/5; 
h := x2 — stab; 
scale := exp(—wta); 


Taylor(At, Aid, xtab,h, Attablj], Atdtab[j]); 
Taylor(Bi, Bid, xtab, h, Bitab[j], Bidtab[j]); 
Ai := Ait/scale; 

Aid := Aid/scale; 

Bi := Bi X scale; 

Bid := Bid X scale; 

go to finish 
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end interpolation in previously established table; 
rimdx := sgrt(abs(z)); 
ai := rtmdx fT 3/1.5; 
factor := 1/(12Xz72); 
A[0] := 1/sgrt(ptxXrimdz) ; 
r i= 6; 
for n := Ostep 1 until 9 do 
begin 
A[n+1] := (r—1) X (r—5) X factor X A[n]/r; 
ri=r+6 
end calculation of asymptotic series coefficients; 
if x < 0 then go to neg; 
p := A[0}] + A[2] + A[4] + A[6] + A[8] + A[10]; 
q:= A[l] + A[3] + A[5] + A[7] + A[9]; 
scale := exp(xi—zia); 
At := (p—q)/(2Xscale) ; 
Bi := (p+q) X scale; 
go to continue; 
neg: 
p := A[0] — A[2]+ A[4] — A[6] + A[8] — A[10]; 


q:= A[l] — A[3] + A[5] — A[7] + A[9]; 
8 := sin (ai+pi/4); 
€ := cos(xi+pi/4); 
scale := exp(—xia); 
At := (pXs—qxXc)/scale; 
Bi := (pXe+qXs) X scale; 
continue: 


if control = 0 then go to finish 
else if x < 0 then 
begin 
p := —(rtmdx/xi) X 
(—-2X A[2]+4x A[4]—6x A[6]+8x A[8]—10X A[10]); 
g:= —(rimdz/zt) X 
(A{1]—3X A[3]+5xX A[5]—7X A[7]4+9X A[9]); 
Aid := —(rimdzxX Bt)/(scaleX scale) — Ai/(4Xz) 
— (pXs—gqXe)/scale; 
Bid := rimdz X At X scale X scale — Bi/(4Xzx) 
— (pXc+qXs) X scale; 
go to finish 
end calculation of derivatives; 
p := (rimdx/xzt) X 
(2X A[2]+4x A[4]+6x A[6]+8xX A[8]+10X A[10)) ; 
q := —(rimdz/zt) X 
(A[1]+3X A[3]+5x A[5]+7X A[7]+9X A[9)) ; 
Aid := (p—q)/(2Xscale) — At X (rimdx-+1/(4Xz)); 
Bid := (p+q) X scale + Bi & (rtmdx—1/(4Xz2)); 
jinish: 
end Airy 


REMARK ON ALGORITHM 301 [820] 

AIRY FUNCTION [Gillian Bond and M.L.V. Pitteway, 
Comm. ACM 10 (May 1967), 291] 

M.L.V. Pitteway (Recd. 19 May 1967) 

Brunel University, ACTON, W.3., England 


The initial minus sign has been omitted from the line immedi 
ately following the line 
end calculation of derivatives; 
The statement should read 
p:= — (rimdz/xi) X (2X Al2]} + 4X Al4] +6 X AI6] 
+8 X A[8] + 10 x A[10)); 
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ALGORITHM 302 

TRANSPOSE VECTOR STORED ARRAY [K2] 

J. Boornroyp (Recd. 12 Sept. 1966, 28 Nov. 1966, and 
6 Feb. 1967) 

U. of Tasmania, Hobart, Tas., Australia 


procedure transpose (a,m,n); valuem,n; integerm,n; array 

a, comment performs an in-situ transposition of an m X n array 
A[i:m, 1:n] stored by rows in the vector a[1l:m X n]. The method 
is essentially that of Windley [1], modified for use with vectors 
having unit lower subscript bounds. 

The algorithm processes only elements A[1, 2] through 
Alm, n—1] since A[l, 1] and A[m, n] retain their original posi- 
tions. Elements A[g, p] of the transposed matrix are placed in 
alt], in the order 7 = 2,3, --- , mn — 2, by an exchanging proc- 
ess. At the last step two elements are correctly placed which 
accounts for the value mn — 2 as the upper bound on 7. Valid 
subscripts of the vector a[l:mXn] are elements in the 1-origin 
index set [1, 2, --- , mn]. Computationally, however, it is more 
convenient to use the zero-origin set [0, 1, --- , mn—1]. Denot- 
ing by % (to=7—1) the corresponding zero-origin index of 
ali], to be occupied by Alg, p], we have = m(q~—1) + (p—1). 

The corresponding zero-origin index jo of the A[p, g] element 

now in a[j], which must be transferred to a/[z], is: 


jo=7—1 = n(p—-1) 4+ (q-1) = 2 X  mod(mn—1). 

For each value of 7 = 2, 38, «-+, mn — 2 (or t= 
1, 2, --+ , mm — 3) we compute the index 7 of a[j] and exchange 
a(t] and a[j] provided j= 2 (i.e., jo = to). The case j < 7 indicates 
that the element originally in a[j] is now elsewhere following 
previous exchanges. Its present position is given by the first 
jr 2 % in the series of zero-origin indices: 


jo. jr = m X-Jr mod(mn—1). | 


The two sequences modulo(mn—1) are generated by different 
methods. An additive process generates the first, using k to 
duplicate the function of j, in case this is adjusted in the second 
recurrence-generated sequence if j < 7. 

Unlike the similar problem [3], transposition he not appear 
to be completely soluble on wholly group-theoretic lines. A 
general discussion of transposition and a reference to its formu- 
lation as a problem in Abelian-Groups is given in [2]. * 

[1] P. F. Windley, Transposing matrices in a digital computer. 

Comp. J. 2 (1959), 47-48. [2] G. A. Heuer,, Control Data 

Technical Report T.R.53, pp. 3-5. [8] Fletcher, W., and 

Silver, R. Algorithm 284. Comm. ACM 9 (May 1966), 326; 

begin integer #, j, k, ilessl, mnless1, done, jn, modlessn; 


real {; 
mnless1:=mXn—1;. modlessn := mnlessl —.n; 
done := mnlessl —1; k:=0; itlessl := 1; 


for 7 := 2 step 1 until done do ae 
begin comment computes j = k = n X % mod(mn—1); 
j:=k:=ifk S modlessn then k + n else k — modlessn; 
test: if j < tless1 then 
begin comment computes jru = n X jr mod(mn—1); 


jni=jJXn; 
j= jn — jn + molessl X mnlessi; 
go to lest 


end; 


302-P 1- RI 


comment avoid unnecessary exchanges; 
if j ~ tless1 then 


beginj :=j +1; 
¢:= alt]; aft] := aj]; aly] := ¢ 
end; 
tlessl := 
end 


end transpose 


CERTIFICATION OF ALGORITHM 302 [K2] 

TRANSPOSE VECTOR STORED ARRAY [J. Booth- 
royd, Comm. ACM 10 (May 1967), 292] 

I, D. G. MacLerop (Reed. 8 Jan. 1968) 

Department of Engineering Physics, Australian National 
University, Canberra 2600, Australia 


KEY WORDS AND PHRASES: matrix transposition, array 
transposition, vector stored array 
CR CATEGORIES: 5.39 


Algorithm 302 has been tested using both Fortran IV and 
AL@oL on A.N.U’s IBM System 360 model 50, with satisfactory 
results in each case. 

- There is a misprint in line 2 of the procedure: the comma be- 
tween a and comment should be replaced by a semicolon. 

This compact algorithm can be written even more briefly and 
with improved efficiency by making the following changes: 

1. Delete jn from the list of declared integers. 

2. Replace lines 8 through 13 of the procedure body by 

if j < tlessl then 

begin comment computes j;41 = n X jr mod (mn—1); 

newj: J:=jJXn-J+mxX mniessi; 

if j < ilessl then go to new] 
end; . 


In-situ transposition of a vector stored array may be considered 
as a permutation which decomposes into a set of unique cycles. 
Accessing arrays may be a relatively slow process (as in ALGOL 
with subscript-bound checks) and, in general, unnecessary ac- 
cesses should be avoided. The test in Algorithm 302 for unneces- 
sary exchanges has been inserted for this purpose but it should 
be pointed out that only one exchange is saved in each cycle. 
The inclusion of this test yields a useful gain in efficiency only for 
those situations in which: (i) the implementation is such that 
array access time is dominant; and (ii) the required transposition 
decomposes into a high proportion of short cycles, e.g. transposi- 
tion of a square matrix of order n decomposes into n cycles of 
length 1 and n(n—1)/2 cycles of length 2. 

If the implementation is such that accessing arrays is efficient, 
and the algorithm is to be used for rectangular as well as square 
matrices, replacement of lines 14 through 18 of the procedure 
body by 

sel be oe 

t:= aft]; alt] := alg]; aly] := 


may make the algorithm more efficient and even more compact. 
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ALGORITHM 303 | 

AN ADAPTIVE QUADRATURE PROCEDURE 

WITH RANDOM PANEL SIZES [D1] 

L. J. GALLAHER (Recd. 8 Nov. 1966 and 1 Feb. 1967) 

Georgia Institute of Technology, Engineering Experi- 
ment Station, Atlanta, Ga. 


real procedure Iniegral(a, x, 6, fx, random number, error); 
value a, b, error; 
real a, z, b, fx, error; 
real procedure random number ; 
comment This procedure approximates the quadrature of the 
function fz on the interval a < x < 6 to an estimated accuracy 
of error. It does this by sampling the function fz at appropriate 
points until the estimated error is less than error. The points to 
be sampled are determined by a combination of random sam- 
pling and of estimating what regions are more in need of sam- 
pling, this need being determined by the samples already taken. 
This process goes under the name “importance sampling” in nu- 
clear reactor literature [for example, see J. M. Hammersley and 
D. C. Handscomb, Monte Carlo Methods, John Wiley, Inc., 
1964, p. 57]. The form of importance sampling used here is based 
on estimates of the error contributed to the quadrature by the 
second derivative. That is, random samples of the average value 
of the second derivative of fz in a region are taken and used to 
decide if more samples are needed in that region. 

Randomness here is achieved through the real procedure 
random number. This procedure is not given explicitly here but 
can be any random number generator available, provided only 
that the numbers given are distributed on the interval 0 to 1. 
The random numbers given need not be of particularly high 
quality (i.e., need not: have low correlation). Further the ran- 
dom number generator need not be passed as a parameter but 
could be either global or local to the procedure Integral. 

This procedure is meant to be used for low-accuracy estimates 
of quadratures, especially large dimensional multiple integrals 
for which the high-accuracy methods would be too time consum- 
ing and expensive. It'can achieve high accuracies but not as 
efficiently as algorithms already in the literature. The general 
form of this algorithm is similar to Algorithm 145 [W. M. 
McKeeman, Adaptive Numerical Integration by Simpson’s 
Rule, Comm. ACM 6 (Dec. 1962), 604] (and others) except that 
in subdividing the region of integration the panel sizes are de- 
termined in part by the random-number generator. 

This quadrature procedure has been found particularly effec- 
tive in integrating ill-behaved functions of the following type. 

A. Functions having singularities on the boundary of the re- 

gion of integration. Such integrals as 


1 

vom] 
[ eA da, 
0 


a 


2 
daz 
0 0 


—~(k—2)2 
| Me dy(+y)™, 
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and 
W122 
[ dx | : dy(a?-+ yr), 
0 0 


have been successfully integrated with this procedure to 
1% accuracy. 

B. Functions having an infinite number of zeros in the interval 
of integration. Such integrals as 


1 
, dy +/y sin (1.51n y), 
Jo 


1 
i dy y~? sin (0.5 In y), 
0 


and 
2 1 
dx a dy xy sin (x In y), 
1 0 


have been successfully integrated with this procedure to 
1% accuracy. 

C. Functions having high-frequency oscillations or a large 
number of discontinuities. The function 


om 2 if the least significant bit of z is 1 
0 otherwise 


is almost as discontinuous as can be represented in a binary 
number computer. One hundred attempts at integrating this 
function on the interval 0 to 1 gave an average of the abso- 
lute value of the error ~ 0.13. 
The main limitation in integrating anomalous functions of the 
above type is in the hardware or software of the particular ma- 
chine being used. The procedure will fail when the interval is 
subdivided to a point where it is smaller than the smallest in 
magnitude nonzero number representable in the machine. 
A histogram is given below of the errors in the evaluation of 
the integrals 


1 
/ dy xy® sin (x In y) 
0 


and 


1 
| dy xy* cos (z In y) 
0 
for z = 1.04(0.04)2.00, with error tolerances 107? and 10-‘. 


Number of occurrences 0 0 1 1 | 53] 411 4 0 0 


| 


—§ —4 —-3 —2 -1 0 1 2 3 4 «5 


logio(e/€0) 


Here ¢o is the error requested, e is the error obtained. 

The formal parameter fz is an arithmetic expression dependent 
on z. In translating to another language it may be desirable to 
make this parameter a procedure identifier with appropriate 
modifications in the body of the program; 

if a = b then Integral := 0 
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else 
begin real fl, fr, c; 
real procedure Ini(a, x, b, fx, fc2, error); 
value a, b, fe2, error; 
real a, x, b, fx, fc2, error; 
begin real dz, dxc, fcl, fc3; 

error := error X 0.577; 

comment The factor 0.577 is an approximation to 1/+/3. 
The assumption here is that error contributed by the indi- 
vidual panels is random and not additive, thus the error 
from three panels is assumed to be 1/3 (not 3) times the 
error of one panel; 

dxe := (random number+0.5) X (b—a)/3; 

dz := (b—a—dxc)/2; 

x:=a+dz/2; fel := fa; 

x:= b — da/2; fed := fa; 

Int := 

if abs(dxX (fel—2Xfe2+fc3)) < error then 
dz X (fcl+fc3) + dace X fc2 

else 
Int (a, x, at+dzx, fx, fel, error) 
+Int (at+dz, x, b—dzx, fz, fc2, error) 
+Int (b—dx, x, b, fx, fc3, error) 


end; 

c:=a-+ (random number+0.5) * (b—a)/2; 
zw := (atec)/2; fl := fa; 

xz := (e+b)/2; fr := fz; 


error := abs(error) X 14.6; 

comment The factor 14.6 can be thought of as an empirical 
constant. There is some theoretical justification for calculat- 
ing an optimum value for this factor, but in practice it was 
determined empirically; 

Integral := 
Int(a, x, ¢, fx, fl, error) 
+Int(c, x, b, fx, fr, error) 


end 
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ALGORITHM 304 

NORMAL CURVE INTEGRAL [815] 

I. D. Hitt ann §. A. Joyce (Recd. 21 Nov. 1966) 

Medical Research Council, Statistical Research Unit, 
115 Gower Street, London W.C.1., England 


real procedure normal (x, upper); 
value x, upper; realz; Boolean upper; 

comment calculates the tail area of the standardized normal 
curve, 1.€., 


If upper is true the limits of integration are x and ©. 
If upper is false the limits are — © and zx. 
If x lies in the central area of the curve the method used is the 
convergent series 
nd xy xz 


oy) = 9 x 
eUaae [ en Giaye® di = 7 + + - — 
Jo 3x 5 e 3X 5 X7 


Ae Both 


(See [1, 26.2.11].) 
If x lies in one of the tails the method used is the continued 
fraction 


plan | eran dy =x be 8 _ 
“2 Se a as a ae eae Eo 


(See [1, 26.2.14].) 
The changeover point between the two methods is at abs(z) = 


3.5 if the required area is greater than 0.5. This value is chosen 
on grounds of speed. If, however, the required area is less than 


0.5, a changeover as far out as 3.5 will lead to the loss of three 
significant decimal figures due to cancellation error upon making 
a subtraction. In this case speed is sacrificed to accuracy and 
the changeover point is at abs(z) = 2.32, chosen as the point at 
which the area is 0.01. The value 2.32 may be changed to 1.28 
(the point at which the area is 0.1) if the full accuracy of the 
machine is desired over the range 1.28 < abs(x) < 2.32, but this 
leads to a considerable loss of speed and the accuracy lost by 
using 2.32 is only one decimal place. 

Except for this subtraction error, the procedure works vir- 
tually to the accuracy of themachine (provided that the constant 
1/sqrt(2r) is given to this accuracy) forz < 7 but to 1 decimal 
place less than the accuracy of the machine for xz > 7. 

REFERENCE: [1]. ABRaAMOvITZ, M.anp Stsecun, I. A. Handbook 
of Mathematical Functions, National Bureau of Standards, 
Appl. Math. Ser. 55, US Government Printing Office, Wash- 
ington, D.C., 1964; 

ifz = 0 then normal := 0.5 else 
begin 
real n, x2, y; 
upper := upper =x > 0; 
gi= abs(x); 22:= 24 X a; 
y := 0.3989422804014 X exp (—0.5X22); 
comment 0.3989422804014 = 1/sgri(2X:) ; 


n= y/x; 

if — upper A 1.0 — n = 1.0 then normal := 1.0 else 
if upper /\ n = 0 then normal := 0 else 

begin 
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real s, t; 
if x > (if upper then 2.32 else 3.5) then 
begin 
real pl, p2, ql, q2, m; 
ql := 2; p2:=y Xa; 
n:= 10; pl :=y; 
q2 := 22 + 1.0; 
if upper then 
begin 
s:= m:= pl/ql; 
t:= p2/q2 
end else 
begin 
s:=m:= 1.0 — pl/ql; 
t:= 1.0 — p2/q2 
end; 
for n := "+10 while m #t/As #tdo 
begin 
s:= xX p2+7nX pl; 
pl := p2; p2:= 8; 
s:= 2X q2tnxX ql; 
q2 := 8; 


t := if upper then p2/q2 else 1.0 — p2/q2 
end; 
normal := t 
end else 
begin 
si=2i=yXax;, n:= 10; ¢:=0; 
for n := n + 2.0 while s + t do 


begin 
L:= 8s; w:= a XK x2/n; 
s:=s+a2 
end; 
normal := if upper then 0.5 — s else 0.5 + s 
end 
end 


end normal 


REMARKS ON: 

ALGORITHM 123 [S15] 

REAL ERROR FUNCTION, ERF (2) 
[Martin Crawford and Robert Techo Comm. ACM 6 
(Sept. 1962), 483] 


ALGORITHM 180 [815] 
ERROR FUNCTION—LARGE X 
[Henry C. Thacher Jr. Comm. ACM 6 (June 1963), 314] 


ALGORITHM 181 [S15] 
COMPLEMENTARY ERROR FUNCTION— 
LARGE X 

{Henry C. Thacher Jr. Comm. ACM 6 (June 1963), 315] 


ALGORITHM 209 [S15] 
GAUSS 
[D. Ibbetson. Comm. ACM 6 (Oct. 1963), 616] 
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ALGORITHM 226 [S15] 
NORMAL DISTRIBUTION FUNCTION 
[S. J. Cyvin. Comm. ACM 7 (May 1964), 295] 


ALGORITHM 272 [815] 
PROCEDURE FOR THE NORMAL DISTRIBUTION 
FUNCTIONS | 

[M. D. MacLaren. Comm. ACM 8 (Dec. 1965), 789] 


ALGORITHM 304 [815] 

NORMAL CURVE INTEGRAL 
(I. D. Hill and S. A. Joyce. Comm. ACM 10 (June 
1967), 374] 


I. D. Hitz ano 8. A. Joyce (Recd. 21 Nov. 1966) 

Medical Research Council, 

Statistical Research Unit, 115 Gower Street, London 
W.C.1., England 


These algorithms were tested on the ICT Atlas computer using 
the Atlas ALGoL compiler. The following amendments were made 
and results found: 


ALGORITHM 123 

(i) value x; was inserted. 

(ii) abs(T) < »:—10 was changed to Y— T= Y 
both these amendments being as suggested in [1]. 

(iii) The labels 1 and 2 were changed to Z1 and £2, the go to 
statements being similarly amended. 

(iv) The constant was lengthened to 1. 12837916710. 

(v) The extra statement x := 0.707106781187 X x was made 
the first statement of the algorithm, so as 'to derive the 
normal integral instead of the error function. 

The results were accurate to 10 decimal places at all points 
tested except x = 1.0 where only 2 decimal accuracy was found, as 
noted in [2]. There seems to be no simple way of overcoming the 
difficulty [8], and any search for a method of doing so would 
hardly be worthwhile, as the algorithm is slower than Algorithm 
304 without being any more accurate. 


ALGORITIIM 180 
Gi) T := —0,56418958/zr/erp(v) was changed to 
T := —0.564189583548 < exp(—v)/xz. This is faster and also 
has the advantage, when »v is very large, of merely giving 0 
as the answer instead of causing overflow. _ 
(ii) The extra statement « := 0.707106781187 x was made 
as in (v) of Algorithm 123. 
(iii) form :=m-+41= waschanged to form :=m+2. m+l1 
is a misprint, and gives incorrect answers. _ 
The greatest error observed was 2 in the 11th decimal place. 


ALGORITHM 181 
(i) Similar to (i) of Algorithm 180 (except for minus sign). 
(ii) Similar to (ii) of Algorithm 180. 
(iii) m was declared as real instead of integer, as an alternative 
to the amendment suggested in [4]. . 
The results were accurate to 9 significant figures for « < 8, 
but to only 8 significant figures for x = 10 and x = 20. 


ALGORITHM 209 
No modification was made. The results were accurate to 7 decimal 
places. 


ALGORITHM 226 
(i) 10 f m/(480Xsqrt(2X3.14159265)) was changed to 
10 f m X 0.000831129750836. 
(ii) for 7 := 1 step 1 until 2 X n do was changed to 
=2xXn; for? := 1 step 1 until m do. 
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(iii) —(ixb/n) fT 2/8 was changed to —(tXb/n) T 2X 0.125. 
(iv) iff =2xXn—1 waschanged to ifi =m—1 
(v) b/(6XnXsqrt(2X3.14159265)) was changed to 


b/(15.0397696478X n). 
Tests were made with m = 7 and m = 11 with the following 
results: 
Number of significant Number of decimal 
. figures correct places correct 
m=7 m= il m=7 m= il 
—0.5 7 11 7 11 
—1.0 7 10 7 10 
—1.5 7 10 8 10 
—2.0 7 9 8 10 
—2.5 6 9 8 11 
—3.0 6 7 8 9 
—4.0 5 7 10 11 
—6.0 2 1 12 10 
—8.0 0 0 11 9 


Perhaps the comment with this algorithm should have referred 
to decimal places and not significant figures. To ask for 11 sig- 
nificant figures is stretching the machine’s ability to the limit, 
and where 10 significant figures are correct, this may be regarded 
as acceptable. 


ALGORITHM 272 
The constant .99999999 was lengthened to .9999999999. 

The accuracy was 8 decimal places at most of the points tested, 
but was only 5 decimal places atz = 0.8. 


ALGORITHM 304 
No modification was made. The errors in the 11th significant figure 
were: 


abs (x) x > 0 = upper x > 0 # upper 
0.5 1 1 
1.0 | 2 
1.5 219(5) 2 
2.0 - 259(0) 4 
3.0 0 0 
4.0 2 3 
6.0 6 0 
8.0 14 0 
10.0 23 0 
20.0 35 0 


* Due to the subtraction error mentioned in the comment section 
of the algorithm. Changing the constant 2.32 to 1.28 resulted in 
the figures shown in brackets. 

To test the claim that the algorithm works virtually to the 
accuracy of the machine, it was translated into double-length 
instructions of Mercury Autocode and run on the Atlas using the 
EXCHLF compiler (the constant being lengthened to 
0.398942280401432677939946). The results were compared with 
hand calculations using Table II of [5]. The errors in the 22nd 
significant figure were: 


COLLECTED ALGORITHMS (cont.) 


abs (x) x > 0 = upper z > 0 # upper 
1.0 2 3 
2.0 7 1 
4, 2 0 
8.0 8 0 
Timings. Timings of these algorithms were made in terms of 


the Atlas “Instruction Count,’ while evaluating the function 100 
times. The figures are not directly applicable to any other com- 
puter, but the relative times are likely to be much the same on 
other machines. 


Instruction Count For 100 EvauuaTIons 


Algorithm number 
abs(x) | 7 eae: 
123 | 180 | 181 | 209 226 272 | 304% | 304> 
;m=7 
0.5 58 8 97 24 25 24 
1.0 65° 8 176 24 29 29 
1.5 | 164 | 128 | 127 9 273 25 35 35 
2.0 | 194 78 90 8 387 24 39 39 


2.5 | 252 54 68 10 515 24 131 44 


3.0 42 51 9 628 25 97 50 
4.0 27 39 9 900¢ | 25 67 44 
6.0 15 30 6 14004 | 16 49 23 
8.0 9 28 7 21007 | 18 44 11 
10.0 10 25 5 27004 | 16 38 11 
20.0 9 22 5 65004 | 16 32 11 
30.0 9 9 5 | 10900¢ | 16 11 11 


» Readings refer to x > 0 = upper. 

b Readings refer to x > 0 # upper. 

* Time to produce incorrect answer. A count of 120 would fit a 
smooth curve with surrounding values. 

1100 times Instruction Count for 1 evaluation. 


Opinion. There are advantages in having two algorithms 
available for normal curve tail areas. One should be very fast and 
reasonably accurate, the other very accurate and reasonably 
fast. We conclude that Algorithm 209 is the best for the first 
requirement, and Algorithm 304 for the second. 

Algorithms 180 and 181 are faster than Algorithm 304 and may 
be preferred for this reason, but the method used shows itself in 
Algorithm 181 to be not quite as accurate, and the introduction 
of this method solely for the circumstances in which Algorithm 
180 is applicable hardly seems worth while. 


Acknowledgment. Thanks are due to Miss I. Allen for her 
help with the double-length hand calculations. 
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CERTIFICATION OF AND REMARK ON 

ALGORITHM 304 [S15] 

NORMAL CURVE INTEGRAL [I]. D. Hill and S. A. 
Joyce, Comm. ACM 10 (June 1967), 374] 

A. Brereson (Recd. 11 Aug. 1967 and 9 Nov. 1967) 

Computing Laboratory, Sunderland Technical College, 
Sunderland, Co. Durham, England 


KEY WORDS AND PHRASES: 
bility, special functions 
CR CATEGORIES: 5.5, 5.12 


normal curve integral, proba- 


Algorithm 304 was coded in 803 Augou and run on a National- 
Elliott 803 (with automatic floating-point unit). 

There are typographical errors in the first two integrals con- 
tained in the comment. 

The integrals should read: 


1 
(i) ae / e be dt 
T 
2 3 
(ii) avon | e OF dt = a+ = + 
0 


a x 
3X5 °3xKX5X7T) 

The algorithm was run as published and gave answers within 
the accuracy of the machine [1] for a random selection of values of 
x and upper. 

With the following alterations, however, the algorithm was 
made 0.2 percent more efficient in speed, and gave identical re- 
sults as above. 


(a) n := 1.0; was omitted from the linen :=1.0; pl := y; 
(b) the ten lines after g2 := 22 + 1.0; were replaced by: 


m:=n; t:= p2/ Q2; 
if — upper then 
begin 
m:=10—m; ¢:=10-t¢ 
end; 


for n := 2.0,n + 1.0 whilem #t/As # tdo 

(c) in the line beginnings :=2:=yX2;, n:=1.0; andt:=0; 
were omitted and the next line written: 
for n := 3.0, n + 2.0 while s # t do 


REFERENCE: 
1. A specification of 803 ALGOL; Description of 803 Library Pro- 
gram A104. Elliott-NCR Ltd., Borehamwood, Hertfordshire, 
England. (Jan. 1965, issue 4). 
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REMARK ON ALGORITHM 304 [815] 

NORMAL CURVE INTEGRAL [I. D. Hill and 8S. A. 
Joyce, Comm. ACM 10 (June 1967), 374] — 

Artur G. ApAMms* (Recd. 17 Feb. 1969 and 11 June 1969) 

Glaxo Research Ltd., Greenford, Middlesex, England 

* Deceased 7 July 1969. 

KEY WORDS AND PHRASES: normal curve integral, proba- 
bility, special functions . 

CR CATEGORIES: 5.5, 5.12 
Algorithm 304 may be made faster by using the continued frac- 

tion 


1 ie4 -1 —6 —20 —42 al — 
x x + 8+ at + 74 x? + 114+ 2? 415+ 2? + 19+ 


whose convergents are equal to alternate convergents of the con- 
tinued fraction 


t+ ab xt e+ ot e+ 

used in the original algorithm when =z lies in one of the tails. This 
requires two extra statements in the iteration loop, which, how- 
ever, will only be performed about half as many times. 

The alteration required to implement this improvement is to 
replace the 19 lines between 

if x > (if upper then 2.32 else 3.5) then 
and 


begin 
real pl, p2, gl, g2, a1, a2, m; 
al := 2.0; a2 := 0.0; 
n:= 22 + 3.0; 
pl:=y; gl i= 2; 
p2:= (n—10)Xy; g@:=nX 2; 
m:= pl/ql; t := p2/q2; 
if — upper then 


begin 
m:=10—m; t:=10-12 
end; 
for n := n+ 4.0, n + 4.0 while m #~ t/As # t'do 
begin 


al := al — 8.0; a2 := al + a2; 
s:=a2X pl +n™X p2; 

pl := p2; p2:= 8; 

s:=a2X ql+tnxX q2; 

This also incorporates the alterations suggested in [1] below. 

Comparison of the two versions using an ICIL1903 (37-bit 
floating-point mantissa), showed that the number jof iterations 
was approximately halved, and that the results differed only to 
the extent to be expected from rounding error. 

The original Algorithm 304 contains in its comment, ‘The 
value 2.32 may be changed to 1.28 --- if the full accuracy of the 
machine is desired.”” However a test of the two seit taking 
arguments in the sequence 2.34 step —0.01 showed that the origi- 
nal version ran into overflow at 1.44, and the new version at 1.58, 
on a machine allowing exponents up to 10”. 


REFERENCE 
1. Berason, A. Certification of and Remark on Algorithm 304, 
Normal Curve Integral. Comm. ACM 11 (Apr. 1968), 271. 
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REMARK ON ALGORITHM 304[S15] 

NORMAL CURVE INTEGRAL [I. D. Hill and 8. A. 
Joyce, Comm. ACM 10(June 1967), 374] 

Bo Hoitmearen (Recd. 30 Apr. 1970) 

Dept. KDO, ASEA, 8-721 83 Vasteris, Sweden 


KEY WORDS AND PHRASES: normal curve integral, proba- 
bility, special functions 
CR CATEGORIES: 5.12, 5.5 


Algorithm 304 with the remark of Adams was translated into 
Fortran IV and run on a GE-625 computer. The GE-625 has a 28- 
bit mantissa and allows exponents up to 10®. With upper = false 
and x < —2.32, the routine ran into overflow at several values of 
xz. To avoid this the following lines 

if q2 > 10° then 


begin 
pl := pl X 2-30; p2:= p2 & w»—30; 
qi := ql X w—30; g2 := q2 XK »—30 
end; 
were inserted after the line 
S3s=m;, m:=t; 
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ALGORITHM 305 

SYMMETRIC POLYNOMIALS [C1] 

P. BratLtey anD J. K. 8. McKay (Recd. 23 Sept. 1966, 
15 Feb. 1967 and 10 Mar. 1967) 

Department of Computer Science, 
Edinburgh, Edinburgh, Scotland 


University of 


real procedure express(b, unit, n); valuen; integer n; 
integer array b; array unit; 
; bi be bn 
comment express expresses the symmetric sum Deas: *9agn 
over n variables as a sum of determinants in the unitary sym- 
metric functions >) 2i,2iti, +++ ai,. The non-negative ex- 
ponents b; (¢ = 1, --- , n) are assumed to be in b[1:n] on entry 
to express. (The elements of this array are altered by the pro- 
cedure.) The symmetric sum is first expressed in terms of Schur 
functions which are then evaluated as determinants in the 
unitary symmetric functions. The Schur functions are generated 
in the local array c[1:27] with the sign in the local integer sig. 
The unitary functions of degree r = 1,--: , » should be in 
unit({l:n] on entry to express. 

This procedure may be used to determine the coefficients of a 
polynomial with roots the kth (k a positive integer) powers of 
the roots of a given monic polynomial. Use is made of the 
procedures determinant [Algorithm 224, Comm. ACM 12 (Apr. 
1964), 243)] and perm [Algorithm 306, Comm. ACM 10 (July 
1967), 450] 

REFERENCES: 

1. Lirtnewoop, D. E. The Theory of Group Characters. Claren- 
don Press, Oxford, England 1958, 2nd ed., Ch. 6. 
2. McKay, J. K.S. On the representation of symmetric poly- 
nomials. Comm. ACM 10 (July 1967), 428-429; 
begin integer array c, d(1:n]; 
integer sig, p, g,7,7; Boolean finish; 
procedure sori (v%,c,n); value n; 

integer array 7; 
comment sorts the integer array x[1: 7] into descending order. 

c is set to +1 according to whether the number of transposi- 

tions made is even or odd; 
begin integer 7, j, k; 


real sigma; 
integer cC, 7; 


c:= 1; 
TA:i := 1; k:=0; 7 := 2(1); 
QTl:t:=%4+1; if i > n then go to L3; 


if z[i] <j then 
begin z{t—1] := 7; j := 2{t] end 


else begin z[t—1] := z(t]; k:= 1; c:= —c end; 
go to Ll; 
L3: 2[n] := j; if k # 0 then go to L4 


end sort; 

procedure conjugate(p, long1, q, long2); value longl; 
integer array p,q; integer longl, long2; 

comment conjugate forms in g[{l:long2] the partition conju- 
gate to that in p[1:long1]}; 


begin 
integer 1, 1, 7; 
long2 := 0; 


for r := longl step —1 until 1 do 
begin 7 := if r = long1 then p[r] else p[r] — p[r+1]; 


305-P 1- R1 


for j := 1 step 1 until 7 do 
begin long2 := long2 + 1; 
end 
end conjugate; 
finish := true; 
sort (b, stig, n); 
if b{1] = 0 then begin sigma := 1; 
L3: perm (b, n, finish); 
if finish then go to L99; 
for i := 1 step 1 until n do 
begin cli] := b[t] + n — 7; 
for j := 1 step 1 until 1 — 1 do 
if c[i] = c[j] then go to L3 
end; 
sort (c, sig, n); 
for 1 := 1 step 1 until n do 
begin c{i] := c[t] +7 — n; 
if c{i] = 0 then 
begin 7 := 7 — 1; 
end; 
1:= 1; 
comment each Schur function and its sign are to be found in 
c[1:t] and sig respectively; 
L7: conjugate (c, 7, d, q); 
begin 
array z(1:q, 1:q]; 
for 7 := 1 step 1 until ¢ do 
for j := 1 step 1 until ¢ do 
begin p := d[t] —i +9; 
tlt, j]):= ifp <0 V p > n then 0 else 
if p = 0 then 1 else unit[p} 


q{long2] := r end 


sigma := 0; 


go to L99 end; 


go to L7 end 


end; 
sigma := sigma +-sig X determinant (z, q) 
end; 
go to L3; 
L99: express := sigma 


end express 


REMARK ON ALGORITHM 305 [C1] 

SYMMETRIC POLYNOMIALS [P. Bratley and J. K. 
S. McKay, Comm. ACM 10 (July 1967), 450] 

J. K. 8. McKay (Recd. 13 Sept. 1967 and 18 Dec. 1967) 

Atlas Laboratory, Science Research Council, Chilton, 
Didcot, Berks., England 


KEY WORDS AND PHRASES: symmetric polynomials, sym- 
metric sum, unitary symmetric functions, Schur functions 
CR CATEGORIES: 5.39 


The published algorithm fails with subscript overflow if 
>? =1 6: is greater than n and the partition conjugate to that in 
c [1:2] has more than n. parts. 

The symmetric sum is defined ambiguously in the initial com- 
ment. 

The following alterations: are suggested to remove the am- 
biguitv and correct the algorithm. 


COLLECTED ALGORITHMS (cont.) 


(1) In line 4, 
++» over n variables --- 
should be replaced by 
- over all distinct terms in n variables --- 
to remove any ambiguity in the definition of the symmetric 
sum. 


(2) In line 8, before 
The symmetric sum -:: 
insert 
Three examples to clarify the value of the symmetric 
sum are: 
If n = 8 and the 0; are 3, 2, 0 in any order the sum is 
Bi ta aaa oe ee ee aa ea eat ei ais 2a 
If n = 2 and the 0; are 2, 2, 0 in any order the sum is 
ay? ty" + ae? xg? + 23? 213. 
If all b; are zero the procedure will return the value 1. 
(3) In lines 17-18, the reference to Algorithm 224 should read: 
Comm. ACM 7 (Apr. 1964), 248 and (Dec. 1964), 702. 


(4) Lines 25-26 
integer array c,d(1:n]; 
integer 819 ,P,9,2,3 
should be replaced by 
integer $19 P5909} ¥) := 0; 
for 7 := 1 step 1 until n do j := j+5[z]; 
begin integer array c[1:n], d[0:j]; 
(5) In line 72, 
comment each Schur function --- 
should be replaced by 
comment at L7 each Schur function --- 


(6) In line 87, an end should be inserted immediately before 
end express 
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ALGORITHM 306 

PERMUTATIONS WITH REPETITIONS [G6] 

P. Bratiey (Recd. 23 Sept. 1966 and 15 Feb. 1967) 

Department of Computer Science, University of 
Edinburgh, Edinburgh Scotland 


procedure perm(a, n, last); value n; integer n; 
integer array a; Boolean last; 
comment a([l:n] is an integer array. Initially the elements of 
a{l:n] must be arranged in descending order and last must be 
set true. If the elements of a are not initially in descending 
order the effect of the procedure is undefined. Successive calls of 
perm generate in a all permutations of its elements in reverse 
lexicographical order. 
last is set false if the procedure has generated a new permuta- 
tion, but if the procedure is entered after all the permutations 
have been generated, last will be set true. Neither a nor 7 should 
be altered between successive calls of the procedure; 
begin integer 1, p, 4,7; 
own integer ™; own integer array b[1:n]; 
if — last then go to £12; last := false; 
for 7 := 1 step 1 until n do b[7] := az]; 
p := b{n]; 
for 7 := n step —1 until 1 do 
if p ~ b[t] then 
begin m := 1; go to L99 end; 
m:=0; goto L99; 
£12: if m = 0 then go to L10; 
p:= blm|l; q:=m; r :=0; 
D9: «:= n; 
L4: if alt] = p then go to L2; 
if aft] < pthen r := 1; 
Lb: t@:=t—1; goto L4; 
L2: aft] := b[n] — 1; ifr = 0 then go to L8; 
pie Guan. Gee eee 
L3: r:=r+1; ifr >n then go to Lil else if afr] > p 
then. go to L3; 
Ell: if of = P then go to L1; r := 0; 
L6: r+1; ifa[r] > p then go to L6; 
i= “be 1; if ¢ = n then go to L7; 
q:=q+1; goto L6; 
:= false; go to L99; 
[T8: ¢gq:=q-—1; if g = 0 then go to L10; 
[gq] = p then go to L5; 
p := bq]; go to L9; 
£10: last := true; 


me 
Lar) 
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ALGORITHM 307 

SYMMETRIC GROUP CHARACTERS [A]1] 

J. K.S. McKay (Reed. 23 Sept. 1966, 15 Feb. 1967, and 
10 Mar. 1967) 

Department of Computer Science, 
Edinburgh, Edinburgh, Scotland 


University of 


integer procedure character (n, rep, longr, class, longc, first); 
value n, rep, longr, class, longe; 
integer n, longr, longc; Boolean first; 
integer array rep, class; 

comment character produces the irreducible character of the 
symmetric group corresponding to the partitions of the repre- 
sentation and the class of the group S, stored with parts in 
descending order in arrays rep[l:longr] and class[1:longc], re- 
spectively. Both arrays are preserved. The method is similar 
to that described by Bivins et al. [1]. Comét describes a later 
method. 

On first entry to character, first should be set true in order to 
initialize the own array p[0:n, O:n]. This single initialization is 
sufficient for all symmetric groups of degree less than or equal 
to n. character is intended for computing individual characters. 
Tf a substantial part of the character table is required it is sug- 
gested that procedure generate [Algorithm 263, Comm. ACM 
8 (Aug. 1965), 493)] be used to produce the partitions prior to 
use of character. If this is done, then the own array p should be 
replaced by a suitable global array, and jfirst‘should be set false 
to avoid unwanted initialization. character uses procedures set, 
generate, and place [Algorithms 262, 263, 264, Comm. ACM 8 
(Aug. 1965), 493]. 


REFERENCES: 


1. Bivins, R. L., Metropouis, N., Stein, P. R., and Weis, 
M. B. Characters of the symmetric groups of degree 15 
and 16. MTAC 8 (1954), 212-216. 
2. Lirrpewoop, D. E. The Theory of Group Characters. Claren- 
don Press. Oxford, England 1958, 2d ed., Ch. 5. 
3. Comét, S. Improved methods to calculate the characters 
of the symmetric group. MTAC 14 (1960), 104~-117.; 
begin 
integer procedure degree (n, rep, length); value n, length; 
integer n, length; integer array rep; 
comment degree gives the degree of the representation of the 
symmetric group on n symbols defined by the partition 
rep(1:length] with parts in descending order; 
begin 
own integer array p[0:n, 0:n); 
integer array g[l:length]; integer 7, j, deg; 
integer procedure fac(n); value n; integer n; 
fac := if n = 1 then 1 else n X fac(n—1); 
for 1 := 1 step 1 until length do 
gli] := rep{t] + length — 7; 


deg := fac(n); 
for 71 := 1 step 1 until length do 
for j := 7 +1 step 1 until length do 
deg := deg X (ql[¢J—aIJ]); 
for 7 := 1 step 1 until length do 
deg := deg + fac(q{t]); 


degree := deg 
end degree; 
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if first then 
begin set (p, n); first := false end; 
begin 
integer array pr{i:n], r[0:1, O:p[n, n]—1); 
integer length, m, t, old, new, index, i, char, k, coeff, u, pos, 


jl, 72; 

m := longe; 

new := Nn; 

index := 1; 

for 7 := 0 step 1 until pin, n] — 1 do 
r[index, 1] := 0; 


r[index, place(p, n, rep)] := 1; 
for ¢ := 1 step 1 until m do 
begin if class{t] = 1 then go to idenitty; 
index := 1 — index; old := new; new := new — class(t); 
for 7 := 0 step 1 until p[new, new] — 1 do 
rlindex, t] := 0; 
for u := plold, old] — 1 step — 1 until 0 do 
begin if r[l — index, u] = 0 then go to B; 
generate (p, old, u, pr, length); 
k := length; jl := 1; 
G: j2 := jl; coeff := r{l—indez, ul; 
for 7 := 1 step 1 until k do rep{t] := pr[z]; 
if rep{1] = old then go to H; 
rep[j2] := rep{j2] — class{t]; 
if rep[j2] + k — j2 < 0 then go to B; 
if rep[j2] > if G2 = k then 0 else rep[j2+1]) then gotoF; 
if rep[j2+1]= rep{j2] + 1 then go to J; 
4:= rep[j2+1]; rep{j2+1] := rep{j2] + 1; 
replj2] := 7 —1; coeff:= — leoef; 72 := j2+4+1; 


go to E; 
be BS rep(1] := rep{1] — class{t]; 
F: pos := place(p, new, rep); 


rlindex, pos] := rfindex, pos] + coeff; 


J: jl:= jl +1; if jl < k then go toG; 
B: 
end 
end; 
A: char := r{indezx, 0]; goto Z; 
identity: char := 0; 
for u := p[new,new] — 1 step — 1 until 0 do 


begin if rlindez, u] = 0 then go to BB; 
generate(p, new, u, pr, length); 
char := char + rlindex, u] X degree (new, pr, length); 
BB: 
end; 
Z: character := char 
end 
end character 


REMARK ON ALGORITHM 307 [A1] 
SYMMETRIC GROUP CHARACTERS 
[J. K. S. McKay, Comm. ACM 10 (July 1967), 451] 
J. K. 8. McKay (Reed. 13 Sept. 1967) 
Dept. of Computer Science, University of Edinburgh, 
Edinburgh, Scotland 


COLLECTED ALGORITHMS (cont.) 


Three corrections are noted. 
(1) Line 39: 
own integer array p[0:n,0:n]; 


should be moved to the line after the begin in line 32. 


(2) At E the line should read 
FE: if rep[j2] > (if j2=k then 0 else rep[j2+1)) 
then go to F; 
(3) Three lines, later 
coeff := —1 coeff; 
should read 
coef := —coeff; 
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ALGORITHM 308 

GENERATION OF PERMUTATIONS IN PSEUDO- 
LEXICOGRAPHIC ORDER [G6] 

R. J. Orp-Smiru (Recd. 11 Nov. 1966, 1 Dec. 1966, 28 
Dec. 1966 and 27 Mar. 1967) 

Computing Laboratory, University of Bradford, England 


Lexicographic generation has the advantage of: producing an 
order easily followed by the user, but its real value in certain com- 
binatorial applications is that a (k—1)-th intransitive subgroup 
of permutations is generated before the kth element is moved. By 
not insisting on strict lexicographic generation, though preserving 
the latter property, an enormous reduction in the total number of 
transpositions is obtained. The total number of transpositions in 
this algorithm can be shown to tend asymptotically to (sinh 1) n! 
which is less than in Algorithm 86 [J. E. L. Peck and G. F. Schrack, 
Permute, Comm. ACM 6 (Apr. 1962), 208] and almost as good as 
Algorithm 115 {H. F. Trotter, Perm, Comm. ACM 6 (Aug. 1962), 
434]. The algorithm offers a further useful facility, Like several 
others it uses a nonlocal Boolean variable called first, which may be 
assigned the value true, to initialize generation. On'procedure call 
this is set false and remains so until it is again set true when com- 
plete generation of permutations has been achieved. At any subse- 
quent call after initializing generation of permutations of degree 
n, one may set parameter n = n’ where n’ < n. Further calls with 
this value may continue until the completion of the subgroup of 
degree (n’ — 1) when first will be set true. The process can be con- 
tinued by resetting first false and calling with a larger value of n. 
This gives the user complete control over the main attribute which 
lexicographic order offers. There is no restriction on the elements 
permuted. Table I gives results obtained for HCONOPERM. 
Times given in seconds are for an ICT 1905 computer. The algo- 
rithm has also been tested successfully on IBM 7094, Elliott 503 
and STC Stantec computers. tn is the time for complete generation 
of n! permutations. 7, has the usual definition r, = tn/ (n+ tn). 


TABLE I 


Number of 


Algorithm trans positions 


ECONOPERM [0.88 6.2) 50.6) —| 1.04) 1.02; — 1.175n! 


procedure ECONOPERM (2,7); valuen; integer n; 
array 2; 
begin own integer array q[2:n]; 
comment own dynamic arrays are not often implemented. 
The upper bound will then have to be given explicitly; 


integer k,1l,m; real t; 
L:= 1; k:= 2; 
if first then 
begin first := false; go to label end; 


comment the above is the initialization process; 
loop: if q[k] = k then 

begin if k < n then 

begin k := k +1; go to loop end 

else begin first := true; go to finish end 
end; 
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=k-1; 
comment note 7 called by value; 
label: for m := 2 step 1 until 1 do g{m] := 
comment after the initialization the for statement sets all 
elements of g array to 1. Otherwise only the first k—2 elements 
are reset 1; 
glk) := glk} +1; 
transpose: = g[l]; xl] := afk); x[k] := ¢; 
l:=14+1; ks=k—-1; 
if 1 < k then go to transpose; 
comment when k < 4 only one transposition occurs. On final 
exit when-first is reset true, no transposition occurs at all; 
finish: 
end of procedure ECONOPERM 


REMARK ON ALGORITHM 308 [G6] 

GENERATION OF PERMUTATIONS IN PSEUDO- 

LEXICOGRAPHIC ORDER [R. J. Ord-Smith, Comm. 
ACM 10 (July 1967), 452] 

R. J. Orp-Smiru (Recd. 21 May 1969) 

Computing Laboratory, University of Bradford, England 

KEY WORDS AND PHRASES: permutations, lexicographic 


order, lexicographic generation, permutation generation 
CR CATEGORIES: 5.39 


Following the construction of the very fast lexicographic 
permutation Algorithm 323 [1] it has become clear that the permu- 


_ tation. sequence generated by the Algorithm 308 can be obtained 


more quickly. In fact, replacement of 


trstart:m := ee t:= 2[m]; 2[m] := 2[k]; z[k] := 
gk):=m+i1; k:=k-1; 

by 

irstart: q{k] := q{k] + 1; 


in Algorithm 323 produces the ECONOPERM sequence of AI- 
gorithm 308. 
The times are as follows on an ICT 1905, in seconds. 


ty ts 


Algorithm 323 6 47 

New ECONOPERM 5.9 45 

Old ECONOPERM 6.2 50.6 
REFERENCE: 


1. Orp-SmituH, R. J. Algorithm 323: Generation of permutations 
in lexicographic order. Comm. ACM 11 (Feb. 1968), 117. 
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ALGORITHM 309 

GAMMA FUNCTION WITH ARBITRARY PRE- 
CISION [S14] 

ANTONINO MacHapo Souza FinHo AND GEORGES SCHWACH- 
HEIM (Recd. 12 Apr, 1966 and 14 Apr. 1967) 

Centro Brasileiro de Pesquisas Fisicas, Rio de Janeiro, 
ZC82, Brazil 


procedure gamma(z,y,msize,error) ; 
value z, msize; real z; integer msize; label error; 

comment This procedure computes the value y of the gamma 
function for any real argument z for which the result can be 
represented within the computer, working with msize decimal 
digits. An exit is made thru the label error when the argument is 
a pole or is too large, while a zero result is returned when the 
argument is too small for a correct internal representation of the 
result. 

This procedure is especially useful for variable field length 
computers and for double- or multiple-precision computations, 
when a simple power series algorithm is no longer applicable. 

It computes the gamma function thru the Stirling asymptotie 
series for the logarithm of the gamma function with an argu- 
ment increased by an appropriate integer to insure the required 
precision with the least. computation work. 

Negative arguments are reduced to positive ones by: 


WT 


sin (rz) X TL — 2) 


T(z) = 


This procedure is not recursive and uses no own variable. 

It was translated to Fortran II and run on an IBM 1620. The 

errors were at most of afew hundred units in the last digit of the 

mantissa, being due to the use of logarithms; 

begin 

real procedure loggamma (t); value t; real t; 

comment The loggamma auxiliary procedure computes the 
logarithm of the gamma function of a positive argument t. 
If its argument is below a value tmin, loggamma first increases 
the argument by an integer value, using the relation: 


In P() = In F(t+k) — Inf (t + 1)) 


where In r'(¢ + k) is computed by the procedure lgm. 

The formula we use for tmin is a rough empirical relation 
to minimise computation time. 

Indeed an increase of k while decreasing the number of 
terms of the series, results in more computation for the fac- 
tor In ([[.(t+7)); 


begin integer tmin; 


imin := if msize > 18 then msize — 10 else 7; 
if ¢ > tmin then loggamma := Igm(t) 
else 
begin real f; 
f=; 
L: t:= ¢+1; 


if ¢ < tmin then 
begin f := f X 1; 
go to L 
end; 
loggamma := lgm(t) — In (f) 
end 
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end of procedure loggamma; 

real procedure lgm(w); value w; real wv; 

comment This procedure evaluates the logarithm of the 
gamma function acccrding to the Stirling asymptotic series: 


a C; 
In T(w) a (w — 4) X In (w) — w + In V2e + Do 
The coefficients ¢ = Bxy/(2i(2i—-1)), Ba being the 


Bernoulli numbers, are rational numbers given here as irre- 
ducible fractions. 

Twenty terms are sufficient for a precision of up to 50 
decimal digits; 


begin array c[{1:20]; real w2, presum, const, den, sum; 


integer 1; 

e{1]) := 1/12; c[2] := —1/360; 

c[3] := 1/1260; c[4] := —1/1680; 

c[5| := 1/1188; c[6] := —691/360360; 

c(7] := 1/156; c(8] := —3617/122400; 

c(9] := 43867/244188; c[10] := —174611/125400; 
c[11] := 77683/5796; c{12] := —236364091/1506960; 
c[13] : = 657931/300; c[14] : = —8392780147/93960; 
c[15] := 1723168255201 /2492028 ; 

c[16] := —7709321041217/505920;. 

c[17] := 151628697551 /396; 

c[18] := —26315271553053477373/2418179400; 

c[19] := 154210205991661 /444; 

c[20] := —261082718496449122051/21106800; 


const := .91893853320467274178032973640561763986139747363778 ; 
comment const = Inv/2r; 
den := w; w2:= wX w; 
w + const; 
for 7 := 1 step 1 until 20 do 
begin sum := presum +- c[t]/den; 
if sum = presum then go to exit: 
den := den X w2; 
presum := sum 
end; 
exit : lgm := sum 
end of procedure lgm; 
comment: main procedure gamma starts here; 
real pi; 
pi := 3.141592653589793238462643383279502884197 1693993751 ; 
comment argov, argund, Inunder are hardware dependent con- 
stants that are compared to the arguments of intermediate 
results, setting error exit or zero result to prevent exponent 
over or underflow. Should be replaced in the procedure by 
the appropriate numbers; 
if z > argov then go to error else if z = entier (z) then 
begin if z < O then goto error; y := 1; 
if z > 2 then 
begin loop:z2 :=z—1; y:=yX 2; 
if z > 2 then go to loop 
end. 
end when 2 is integer 
else if abs(z) < 10 fT (—msize) then y := 1/z 
else if z < 0 then 
begin if 2 < argund then y := 0 
else 


presum := (w—.5) X In(w) — 
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begin comment As the use of the sine subroutine for large 
arguments might introduce errors, some reductions of 
the argument are made before using it; 
Boolean procedure parity (m); real m; 
begin integer j; 
gj := entier(m); parity:=j=uj+2xX2 
end parity; 
real procedure decimal(x); real x; 
begin integer 7; 
Ni= 2; 
decimal := abs(a—n) 
end decimal; 
real delta, ex; 
sit := decimal(z) X pi; 
= (if delta<10 T (—msize/2) then — dn (dectmal(z)) else 
In (ps (stn (delta)))) — loggamma(1—z); 
y := if ex < Inunder then 0 else 
if parity (z) then erp(er) else 
—exp (ex) 
end 
end when 2 is negative 
else y := exp(loggamma(z)) 
end of procedure gamma 
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ALGORITHAL 310 

PRIME NUMBER GENERATOR 1 [A1] 

B.A. CHarrres (Reed. 25 Oct. 1966 and 138 Apr. 1967) 

Computer Seience Center, University of Virginia, 

Charlottesville, Virginia 

integer procedure sievel(m, pi; value om; integer im; ins 
leger array p; 

comment sfevel(m, p) generates the prime numbers less than 
or equal to m, and places them in the array p, setting pll| = 2, 
pl2} = 8, pls} = 5, ---, plk] = Uargest prime found). The value 
of the procedure is &, the number of primes less than or equal to 
Mm, 

The method used is amodification of the Sieve of Eratosthenes. 

In its customary form this method requires a repeated sweeping 
over mo numbers (or m/2 odd numbers), crossing out all multiples 
of the th prime on the 7th sweep. The variation of the method 
used here condenses all these sweeps into one. When the odd 
integer n is being tested (“if n=q[i]’’) to see whether it should 
be crossed out (“‘t:= false’’), g[¢], for? = 3, 4, ---,7, contains 
the smallest odd multiple of p{z] which is no smaller than either 
n or p(t] 12. The sequence of values taken on by g[t} defines the 
set. of numbers crossed out because they are multiples of p[z]. 
The initial value of g{z] is p[z] T 2 because all smaller odd multi- 
ples of p{z] have at least one other odd prime factor smaller than 
pit]. For the same reason, g{7+1] does not become active (‘9 
:= j+1"’) until n has become equal to p{j] f 2. The dimension of 
the arrays g and dq is therefore the number of primes less than 
or equal to the square root of m. Thus we have replaced repeated 
sweeps over the array p by (many more) repeated sweeps over 
part of the much smaller array g. This does not reduce the 
amount of computation, but does lead to a much more efficient 
computer implementation, as only the arrays g and dq need be 
held in a random access store.; 

begin 
integer array q, dq[2 : 2.7Xsqri(m)/In(m)]- 
integer 7, J, k,n; 
Boolean 1; 
pA] c= y= k= 2; pl2] := 3; 
for n := 5 step 2 until m do 


begin 
{ := true; 
for + := 2 step 1 until j do 
hegin 
ifn = glt] then 
hegin 


qt] = n + dat]; 
if i = j then 
begin 
J:= J +1; al = plylt2; 
dqlj] := 2X ply]; goto A 
end 
end 
end; 
if ¢ then 
begin 
k:=k+1; plk[:=n 
end; 
A: end; 
stevel := k 
end stevel 


(:= false; 
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REMARKS ON: 


ALGORITIIM 34 [A1] 

SreveE [T. C. Wood, Comm. ACA 4 (Mar. 1961), 151] 

ALGORITHM 310 [A1] 

PRIME NUMBER GENERATOR 1 [B. A. Chartres, 
Comm. ACA 10 (Sept. 1967), 569] 

ALGORITHM 311 [A1] 

PRIME NUMBER GENERATOR 2 [B. A. Chartres, 
Comm. AC AL 10 (Sept. 1967), 570] 


B. A. Cuartres (Reed. 13 Apr. 1967) 
Computer Scienee Center, University of Virginia, 
Charlottesville, Virginia 


The three procedures Siere(.p), stevel(m,p), and sleve2(m.p), 
which all perform the same operation of putting the primes less 
than or equal to m into the array p, were tested and compared for 
speed on the Burroughs B5500 at the University of Virginia. The 
modification of Steve suggested by J. 8. Willmore (Comm, ACM & 
(Aug. 1962), 438] was used. It was also found that Steve could be 
speeded up by a factor of 1.95 by avoiding the repeated evaluation 
of sqil(n). The modifieation required consisted of declaring an 
integer variable s, inserting the statement s := sqré(n) immedi- 
ately after 7 := 3, and replacing pl?|<sqrt(n) by plt|J<s. 

The running times for the eomputation of the first 10,000 primes 
were: 


Sreve (Algorithm 35) S45 sec 


Scere (modified) 13-4 sec 
stevel 220 sec 
sieve2 91 see 


The time required to compute the first k primes was found to be, 
for each algorithm, remarkably accurately represented by a power 
law throughout the range 500 < & < 50,000. The running time of 
Steve varied as k!°, that of sverel as k)8, and that of szeve2 as 
A), Thus the speed advantage of steve2 over the other algorithms 
increases with increasing &. However, it should be noted that 
steve2 took approximately 33 minutes to find the first 100,000 
primes, and, if the power law can be trusted for extrapolation past 
this point (there is no reason known why it should be), it would 
take about 12 hours to find the first million primes. 


CERTIFICATION OF ALGORITHM 310 [A1} 

PRIME NUMBER GENERATOR 1 [B. A. Chartres, 
Comm. ACM 9 (Sept. 1967), 569] 

Donaup G. Rapp anp Larry D. Scotr (Recd. 21 Apr. 
1969 and 13 Aug. 1969) 

Computer Science Group, Texas A&M University, College 
Station, TX 77843 


KEY WORDS AND PHRASES: 
CR CATEGORIES: 5.0 


Algorithm 310 was coded in AuGcou 60 reference language and 
run on an IBM 360/65. The algorithm was tested for a large range 
of values including m = 5, 10, 501, and 2000. Reference [1] was 


prime numbers, generator 
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utilized to verify the theory involved in the algorithm before actual 
machine testing. 

The intention of Algorithm 310 is to give only the number of 
primes less than or equal to m. Actual confirmation in the initial 
phases was accomplished through additional instructions that 
printed the array of prime numbers, p, and the number of primes, 
k. Both references listed were useful in substantiation of the prime 
numbers given. These references were again useful in verifying that 
all the primes in the array had been discovered and printed. 

Each test produced the correct number of primes, k, for the 
specified range, m. When the primes were listed, the total taken by 
hand agreed with the number, k, given by the algorithm. 

REFERENCES: 

1. Esrermann, T. Introduction to Modern Prime Number Theory. 
Cambridge U. Press, Cambridge, England, 1952. 

2. Lenmer, D. N. Carnegie Institution of Washington, Publica- 
tion No. 165. Hafner, New. York, 1956. 


310-P 2- 


0 


COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 311 

PRIME NUMBER GENERATOR 2 [A1| 

B. A. CHartres (Reed. 25 Oct. 1966 and 13 Apr. 1967) 
Computer Science Center, University of Virginia, 
Charlottesville, Virginia 


integer procedure sieve2(m, p); 
integer m; integer array p; 

sieve2 is a faster version of stevel. Two changes were 
made to obtain higher speed. 

(1) The multiples g[z] are sorted, smallest first, so that each 
value of n does not need to be compared with every g[i]. The 
sorted order of the q[{i] is indicated by an index array r. The 
ith sorted element of q is g[r[z]]. It was found empirically that 
greater speed is obtained when the g[r[zt]| are not kept con- 
stantly sorted, but are re-sorted only at the time a new prime is 
discovered. The integer jj indicates which of the q[r[z]] are sorted: 
qr(3]| through g[r[77-1]] are out of order, whereas g[r[j/]] through 
air[{j]] are in order. Sorting is performed in two stages. A sift 
sort first rearranges r(3] through r{jj-1] into rr[3] through 
rr[jj-1]. Then a single merge sort combines rr[3] through rr{jj7-1] 
and r[j7] through r[j] into r(1| through r[j]. 

(2) All multiples of 3 are automatically excluded from con- 
sideration by stepping n alternately by 2 and 4, and, in a similar 
way, by stepping g[z] alternately by 2 X p{z] and 4 X plz].; 

begin 
integer array q, dq, sq, 7, rr(2: 2.7Xsqrt(m)/ln(m) |; 
integer 1, j, Jj, k, n, tr, jr, dn; 


value m; 


comment 


Boolean 1; 

pil] := dn := 2; pl.) := 7 := 77 =k = r[3] := 3; 
[3] := 5; @[3] := 25; dg[3] := 10; sqg[3] := 30; 
for n := 7 step dn until m do 

begin 


{:= true; dn := 6 — dn; 
for 1 := 3 step 1 until 77 do 


begin 
tr := rit]; 
ifn = qltr| then 
begin 


qltr] := n + dafir]; 
dg{tr] := sq{tr] — dqlzr]; 
t:= false; 
if 7 = 7) then 
begin 
Hye 
if iy = j then 
begin 
j=jths ils 
qj] := ply) T 2; 
sqj] := 6 X ply]; 
dqlj] := sqlj] X A+(plj] + 3)) — 2X fy] 
end 
end 
end 
end; 
if ¢ then 
begin 
k:=k+1; plik] =n; 
A: if jj = 3 then goto F; 
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7 = 9 — 4; 

if q[rlg7]l < girljj+1]] then go to 4; 
comment sift sort; 

rr[3] := r[3]; 

for ir := 4 step 1 until jj do 


begin 
t:= ir —1; 
B: if g[r{ir]] < g[rr[t]] then 
begin 


mrfi+1] := rrft]; t:= 7-1; 


ift > 38then goto B 
end; 
rr(t+l] := rfer] 
end; 
comment merge sort; 
t:= tr := 38; grs= +1; 
C: if qlrriir]] < q{rljr]] then 
begin 
r(tq] := rrltr]; tr := ir +1; 
if ir > 37 then go to E 
end 
else 
begin 
r(tq] := rlyrl; gr c= jr +1; 
if jr > j then go to D 


end; 
t:=1+1; gotoC; 
D: @:=t+1; rh] = rrlir]; sir +1; 


if ir < jj then go to D; 


K: 7 := 3 
end; 

F; end; 
steve2 := k 


end szeve2 


REMARKS ON: 


ALGORITHM 35 [A1] 

Sreve [T. C. Wood, Comm. ACM 4 (Mar. 1961), 151] 

ALGORITHM 310 [A1] 

PRIME NUMBER GENERATOR 1 [B. A. Chartres, 
Comm. ACM 10 (Sept. 1967), 569] 

ALGORITHM 311 [A1}j 

PRIME NUMBER GENERATOR 2 [B. A. Chartres, 
Comm. ACM 10 (Sept. 1967), 570] 


B. A. Caartres (Reed. 13 Apr. 1967) 
Computer Science Center, University of Virginia, 
Charlottesville, Virginia 


The three procedures Szeve(m,p), stevel(m,p), and steve2(m,p), 
which all perform the same operation of putting the primes less 
than or equal to m into the array p, were tested and compared for 
speed on the Burroughs B5500 at the University of Virginia. The 
modification of Sieve suggested by J. S. Hillmore (Comm. ACM 6 
(Aug. 1962), 438] was used. It was also found that Sieve could be 
speeded up by a factor of 1.95 by avoiding the repeated evaluation 
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of sgrt(n). The modification required consisted: of declaring an 
integer variable s, inserting the statement s :== sgrt(n) immedi- 
ately after 7 := 3, and replacing p[i]<sqrt(n) by pli]<s. 
The running times for the computation of the first 10,000 primes 
were: 
Steve (Algorithm 35) 845 sec 


Steve (modified) 434 sec 
stevel 220 sec 
steve2 91 sec 


The time required to compute the first k primes was found to be, 
for each algorithm, remarkably accurately represented by a power 
law throughout the range 500 < k < 50,000. The running time of 
Sieve varied as k!°, that of stevel as 13, and that of steve2 as 
k1-%, Thus the speed advantage of sieve2 over the other algorithms 
increases with increasing k. However, it should be noted that 
sieve2 took approximately 33 minutes to find the first 100,000 
primes, and, if the power law can be trusted for extrapolation past 
this point (there is no reason known why it should be), it would 
take about 12 hours to find the first million primes. 
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ALGORITHM 312 

ABSOLUTE VALUE AND SQUARE ROOT OF A 
COMPLEX NUMBER, [A2] 

PavuL FRIEDLAND (Recd. 13 Feb. 1967 and 16 Junc 1967) 
Burroughs Corporation, Pasadena, California 


real procedure cabs (x,y); 
value z,y; real x, y; 
comment This procedure returns the absolute value of the com- 
plex number x + iy. The procedure provides for the possible 
begin 
x := abs (x); y := abs (y); 
cabs := if x = 0 then y else if y = 0 then z else 
ifx > y then x X sqrt (1+(y/z) TF 2) 
else y X sqrt (l+(2/y) T 2) 
end cabs; 
procedure csqrt (z,y,a,b); 
value x,y; real z, y, a, b; 
comment This procedure computes @ and 6 where a + ib = 
Vx + ty. Forz = y = Owe have thata = & = 080 we will assume 
that x and y are not both zero. 
Solving simultaneously fora and thend --- 


(1) a= 4/2 lee, b = y/(2a) 
2 
and for 6 and then a... 
(2) baa g/ telat, a = y/(2b) 
2 


To keep the radical real, we will always use the positive sign 
with |x + ty | and use equation (1) with the sign of “a” taken 
positive for z > 0 and (2) when x < 0, with the sign of “b” 
taken positive for y > 0 and negative for y < 0; 

begin 
ife =OAy = Othena := b := Oelse 
begin 

a := sqrt ((abs (x) + eabs (2, y)) X 0.5); 

ifx > Othen b := y/(a + a) else 


begin 
b:= ify < 0 then —a else a; 
a:= y/(b + 6) 
end 
end 


end csqri 
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ALGORITHM 313 

MULTI-DIMENSIONAL PARTITION 
GENERATOR [A1] 

P. Braguey anp J. K. 8S. McKay (Recd. 23 Aug. 1966, 
15 Feb. 1967 and 14 Apr. 1967) 

Dept. of Computer Science, University of Edinburgh 


procedure partition (N, dim, use); 
value N, dim; integer N, dim; nesvedites USE; 
comment A partition of N is an ordered sequence of positive 


k 
integers, m1 > m2 > ms > +++ > m, such that D> n; = N. 
i=1 
Such a partition may be represented by a Ferrers-Sylvester 
graph of nodes with n; nodes in the zth row, e.g., 
* * * * * 
* * * * 
* * 
* * 
represents 5, 4, 2, 2. This two-dimensional diagram may be gen- 
eralized in a natural way to three, or more, dimensions. More 
formally, we regard a d-dimensional partition of n as a set S of 
n nodes, each defined by its non-negative integer coordinates 
such that 
(v1 ,%2, °°" 
whenever 
O-<ay = 2, forall. 152, 37d. 

This generalization reduces to the usual definition when d = 2. 
There is little literature on these generalized partitions. It is 
with a view to facilitating numerical studies that this algorithm 
is published. 

After generation, each partition is presdatedt to the procedure 
use, which should be supplied by the user for: the purpose he 
requires. use has three formal parameters, the first being the 
name of a two-dimensional integer array, and:the second and 
third being integers giving the size of this array. When the pro- 
cedure is called by 


,ta) © S if and only if (x1', 22’, «+ , aa’) € 8 


use (current, dim, N) 


then the coordinates of the nodes entering into the newly 

generated multi-dimensional partition will be found in current 

[1:d¢m,1:N]. The parameters of use should be called by value, 

or alternatively care should be taken that neither dim, N, nor 

the contents of the array current are disturbed. 

REFERENCES: 

1. Gupta, H., Gwytuer, C. E., anp Miuurer, J.C. P. Tables 
of Partitions. Royal Society Mathematical Tables, Vol. 
4, Cambridge Univ. Press, 1958. 

2. MacManon, P. A. Combinatory AURIS, 
bridge Univ. Press, 1916. 

3. Cuaunpy, T. W. Partition generating functions. 
J. Math. 2 (1931), 234-240. 

4. Arkin, A.O. L., Bratury, P., MacDonatp, I. G., ann Mc- 
Kay, J. K. 5. Some computations for m-dimensional par- 
titions. Proc. Cambridge Phil. Soc. (to appear); 


Vol. 2, Cam- 


Quart, 


begin 


integer 7; integer array current [l1:dim, 1:N], 
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x[l:dim,0: (N—1)xXdim]; 
procedure part (n,q,r);_ value n, q, 7; 
begin integer s, 7, 7, k, p, m, 2; 


integer n, 9,7; 


for p:= q step 1 until r — 1 do 
begin 
for i := 1 step 1 until dim do current [t,n] := cli,p); 


if n = N then begin use (current,dim,N); go to L2 end; 
S:= 7; 
for 1 := 
begin 
for j := I step 1 until dim do z[j,s] := 
x[z,8] := 2[t,s] + 1; 
for j := 1 step 1 until dim do 
begin 
if z[7, s] 
for k := 
begin 
for m := 
begin 
z:= if 7 = m then 1 else 0; 
f ciurrent [m, k] # x[m,s] — z then go to L4 
end; 
go to L3; 


1 step 1 until dim do 


x{j,p]; 


= 0 then go to L3; 
1 step 1 until n do 


1 step 1 until dem do 


TA: 
end k; 
go to L5; 
D3: - 
end 7; 
s:=s+1; 
D5: 
~  endz7; 
part (n+1,p+1,s); 
L2: end p 
end part; 
for 2 := 1 step 1 until dim do [7,0] := 
end partition - 


part (1,0.1) 
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ALGORITHM 314 

FINDING A SOLUTION OF WN 

EQUATIONS IN N UNKNOWNS [C5] 

D. B. DuLuEy aNnpD M. L. V. Prrreway (Recd. 7 Apr. 1966, 
19 Oct. 1966 and 5 July 1967) 

Cripps Computing Centre, University of Nottingham, 
England 


procedure ndinvi (functions, inttstep, error, cycles, x, f, accest, n); 
value 7; procedure functions; real initstep, error; 
integer cycles, n; array x, f, accest; 

comment This procedure performs inverse interpolation in n 
dimensions, i.e., it will find a set of values for n variables 2, 
such that n functions f(z) are zero. A more sophisticated tech- 
nique, suitable for large values of n, has been developed by 
S. M. Robinson (Interpolative Solution of Systems of Nonlinear 
Equations, SIAM Journal of Numerical Analysis, 3 (1966), 
650-658). It can also be used to fit a curve with n arbitrary 
parameters to a set of points, the » functions being formed, in 
this case, by equating to zero the differential of the sum of the 
squares of the residues with respect to each parameter in turn. 

The functions required are specified by a procedure of the 
form functions (f, 2) where f and z are declared as arrays from 
1 to n. This procedure should calculate the n functions from a 
set of values given in x, placing the results in f. The first step is 
made by forming partial derivatives over an interval initstep. 
li — 6 should be suitable for values of x of the order 1 to 10. 
Exit from the procedure will occur if: 

(i) the root sum square of the x increments is less than 
error. If error is negative, this condition must be 
satisfied for | error |, and in acldition this process is 
continued until the root sum square of the incre- 
mentsfails to decrease 

or (ii) the number of iterations is greater than cycles, implying 
that too much accuracy has been requested 
or (iii) the specified equations are singular. In this case exit 
is by a jump to a label fails. 
On entry, the array x should contain the starting values. On 
exit, the array z will contain the accurate root, f the residues and 
accest the last increments made to x as a measure of the accuracy. 

This procedure calls on a global procedure egnsolve 
(A, 6, n, label), which solves n linear simultaneous equations in 
nm unknowns Az = b, placing the result in b. If A is singular, it 
is assumed that an exit is made by a jump to label; 

begin 
real work, sumsqres, prevres; 
integer 1, j, count; 

Boolean switch; 

array prevf[l:n], copydelf[1:n, 1:n], delx, delf(l:n, 1:n+1]; 
functions (prevf, x); 

for 7 := 1 step 1 until n do 

begin 

alt] := 2[t] + initstep; 

functions (f, x); 

for 7 := 1 step 1 until n do 

begin 

delfli, j] := fli] — prevflsl; 


FUNCTIONAL 


314-P 1- RI 


delz[i, j} := 0; 
end differencing initial point; 
delz[i, 7] := inttstep; 
alt] := 2[t] — inttstep; 
end setting up the initial matrix of points; 
sumsgres := 1130; 
count := 0; 


tterate: 
switch := true; 
prevres := sumsgres; 
tryagain: 
for 1 := 1 step 1 until n do 
begin 


fla] := preof{t]; 
for j := 1 step 1 until n do copydelf{t, j] := delf{t, Jj] 
end copying delf for destructive use in procedure eqnsolve; 
eqnsolve (copydelf, f, n, inline); 
sumsqres := 0; 
for := 1 step 1 until n do 
begin 
work := 0; 
for j := 1 step 1 until n do work := work — delz{t, j] X fIJ]; 
accest[1] := work; 
z[z] := 2[t] + work; 
sumsqres := sumsgres + work X work 
end calculation of next point; 
count := count + 1; 
functions (f, 2); 
if count > cycles \/ sumsqres < error X error /\ 
(error > 0 VV sumsgqres > prevres) then go to exit; 
for 7 := 1 step 1 until n do 
begin 
work := f[t] — prevf[t]; 
prevf{i] := f{t]; 
for j := n step — 1 until 1 do 
begin 
delz[i, j+1] := delx[t, j] — accestli]; 
delf{t, 7+1] := delf[i, 3] — work 
end calculation of new differences; 
delz[t, 1] := —accest[t]; 
delf{t, 1] := —work 
end moving points up one place in tables; 
go to tierate; 
inline: 
for i := 1 step 1 until n do 
begin 
delz[i, n] := delx [t, n+1]; 
delf{i, n] := delf[t, n+1] 
end discarding alternative point; 
switch := - switch; 
if switch then go to fails else go to tryagain; 
exit: 
end ndinvt 
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FINDING A SOLUTION OF N FUNCTIONAL 


COLLECTED ALGORITHMS (cont.) 


EQUATIONS IN N UNKNOWNS [D. B. Dulley and 
M.L. V. Pitteway, Comm. ACM 10 (Nov. 1967), 726]. 
JAMES VANDERGRAFT AND CHARLES MESZTENYI 
(Reed. 12 Aug. 1968) 
Computer Science Center, University of Maryland, 
College Park, MD 20742 


KEY WORDS AND PHRASES: functional equations, interpola- 
tion, nonlinear equations, secant method 
CR CATEGORIES: 5.13, 5.15 


The algorithm, as published, requires four iterations to find the 
solution to a pair of linear equations. The difficulty seems to lie in 
the last statement of the first column. If this is replaced by 


delf [j,i] := fj] — preofly]; 


then the algorithm works well. In fact, however, it is now simply 
an n-dimensional secant method, which can be described by the 
iteration 


htt = ok — §2,(6F x)! F(a), k = 0,1,2,..., 


where 5F;, and 6x, are matrices whose ith columns are f(x'-*) — 

f(a*) and z*-i — z*, respectively. The iteration is started by setting 
cts oo + he; 

where 2° is a given vector, h is a small positive constant, and e; is 

the ith unit coordinate vector. 

It should be observed, also, that the algorithm will not break 
down if 6x; becomes singular. However, if this should happen it 
means that x*, z*1,. . ..2*™ lie in a proper subspace S of E*, Euclid- 
ean n-space, and all successive iterates will also lie in S. Hence 
the algorithm may converge to a point in S which is not a solution 
to f(z) = 0. To prevent this, the norm of f(z) should be checked 
before leaving the procedure. 
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ALGORITHM 315 
THE DAMPED TAYLOR’S SERIES METHOD FOR 
MINIMIZING A SUM OF SQUARES AND FOR 
SOLVING SYSTEMS OF NONLINEAR EQUATIONS 
[E4, C5] 
H. Spatu (Recd. 25 Oct. 1966 and 19 June 1967) 
Institut fiir Neutronenphysik und Reaktortechnik 
Kernforschungszentrum Karlsruhe, Germany 
procedure 7'A YLOR (n,m,x,h,f, ttmaz, eps1, eps2,der,S, KENN, 
EXIT), 
value n, m, epsl, eps2; 
real epsl, eps2, S; 


integer n, m, itmax, KENN; 


Boolean der; array 2, h, f; label EXIT; 
comment 
Let 
S(a,° bs ,n) re dL Fie, tae Xn) (m2n) (1) 
f= 


the function to be minimized. Such functions always appear if 
you apply the method of least squares to estimate nonlinear 
parameters. The following sequence 


+) = oh) ®) = ok) 'T po yap’ T 
- SOR BRE cal ce) Eis 


, of: \. 2 
Fen (B) inte mig tein @) 


OX; 


F(2) 


F= (fi,-++ Sm); 


where 8, which is always possible, is chosen to be such that 


S(z®—pAr®) < (1—Br)S(e) ~— (0<A<1) (3) 


is known to converge [1] for any 2 to a stationary point of 
S (grad S=2F,"F(z)=0), if on the carrying out of the iteration 
the matrix Fz7F; does not become singular. 

For m = n you have Az = F7-1F (x) and (2) becomes a damped 
version of Newton’s method for solving the system of nonlinear 
equations 


F(z) = 0 (4) 


All zeros of (4) are stationary points of (1). Thus we are able to 
generate a sequence which converges for any x to a stationary 
point of (1) and the possible divergence of Newton’s method 
(8=1) is avoided. It is not assured, however, that the method 
will always converge to a solution of (4). Numerical experience 
has shown that though Newton’s method (6=1) diverges for a 
certain 2) the damped sequence converges to a solution of (4) 
for the same x“), 

In the program we have chosen = .2. At each iteration we 
set first 8 = 1 and then, if (3) isnot valid, 6 = 2-7 (j=1,2,... ,16). 
If j is greater than 16 then 6 < .00002 and we assume to have 
reached a stationary point of S. 

Meaning of the formal parameters: 


n the number of variables z; 
m the number of functions f; 
x the array z[1:n] which must first contain a starting value 


x and finally will contain a stationary point of S, if 
F,"F, or for m = nF,, respectively, has not become 
singular 
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h h{1:n] is a step size vector for the approximation of F, 
(see below) 

f the array f[1:m] will contain the function values at the 


last x calculated in TAYLOR 
must initially contain the maximum number of iterations 
to be performed. Leaving TAYLOR regularly, itmaz 
contains the actual number of performed iterations 
epsl1___ the iteration is stopped when S < epsl 
eps2 the iteration is discontinued when > -f|Ax”| < 
eps2 X dvi | ast? | 
der if der = true the matrix F, must be produced by a global 
procedure named DERIVE (x, dfdx) which adjoins to 
the vector 2[1:n] the array dfdzx[1:m, 1:n]. In this case 
the array fh can be loaded by an arbitrary vector, for 
instance z. 
if der = false the matrix F; is approximated by 


imax 


ofs _ filmi, Bi Ti, En) —filti,...,%; —hj,...,2n) 
02; 2h; 


whereh is a given step size vector. With a suitable choice 

of the A; the convergence behavior of the sequence (2) 
is not destroyed. DERIVE (z, dfdr) must be formally 
declared outside of 7A YLOR in this case. 

[In some cases, particularly when solving nonlinear 
equations, the extra accuracy achieved by using 
central differences to estimate the derivatives is not 
necessary. A considerable saving in execution time can 
be obtained by using one-sided differences which 
means only minor changes in the program below. 
—Rer.] 

S should initially contain the greatest positive number 
that the employed computer can store. Finally S con- 
tains S = S(¢@tmez)) if 7A YLOR is regularly left. 
if after having called TAYLOR 

KENN = 0 then one of the above interruptions applies 
(epsl, eps2), 

KENN = 1 then timaz iterations were carried out and 
TAYLOR is left, 

KENN = —1 then 8 = 27" and TAYLOR is left. 
EXIT TAYLOR goes to this global label if i encounters a 

singular matrix. 
Further two global procedures must be made available to 

TAYLOR: 

i) FUNCTION (zg, f) which is able to calculate for a given vector 
x{1:n] the function values f[1:m] 

ii) GAUSS(n, A, 6, x, EXIT) which solves the linear system of 
n equations Ax = 6 forz. If A is singular thenGAUSS returns 
to the global label EXIT. Any linear equation solver may be 
used for GAUSS; 

begin integer 7,j,k,z,1; realhf, hl, hs, hz; 
array fp, fm[1:m], b, dz[1:n], dfdx[1:m, 1:n], aa[l:n, 1:n}; 
hs :=S8S; KENN := 2 :=0; 

ITERATION: z :=2z+1; 
if z > ttmar then begin KENN := 1; 

1 := 0; Al := 1.0; 

DAMP:1 :=1+1; 
if 1 > 16 then begin KENN := —1; 
FUNCTION (cz, f); hf := 0; 
for 7 := 1 step 1 until m do hf := hf + f[t] X f[t); 


KENN 


go to ENDE end; 


go to ENDE end; 


COLLECTED ALGORITHMS (cont.) 


if hf > hs X (1.0 — .2 X Al) then 
begin hl := hl X 5; 
for k := 1 step 1 until n do 2[k] := a[k] -+ hl X de[k]; 
go to DAMP 
end; 
hs := hf; if hs < eps 1 then goto ENDE; 
if der then DERIVE (a, dfdz) else 
begin 
for 7 := 1 step 1 until n do 
begin Af := h[t]; hz := 2.0 X Af; 


x[t]) := 2[t]) + hf; FUNCTION (a, fp); 
x[t] := 2[t] — hz; FUNCTION (z, fm); 
x(t] := aft] + Af; hz := 1.0/hz2; 


for k := 1 step 1 until m do 
dfdx[k, i] := hz X (fplk] — fm{k]) 
end 
end; 
ifm = n then GAUSS(n, dfdx, f, dx, EXIT) else 
begin . 
for 7 := 1 step 1 until n do 
begin hf := 0; 
for k := 1 step 1 until m do 
hf := hf + dfdzlk, i] X fk]; blt] := Af; 
for k := 7 step 1 until n do 
begin hf := 0; 
for 7 := 1 step 1 until m do 
hf := hf + dfdx{j, 1] X dfdzlj, k); 
aalt, k] := aalk, 7] := Af 


end 
end; 
GAUSS(n, aa, 6, dx, EXIT) 
end; 
hz := hf := 0; 
for 7 := 1 step 1 until n do 
begin 


x(t] := a2[t] — dz[t]; hz := hz + abs(e[t)); 
hf .= hf + abs(dz[i)) 
end; 
if hf > eps2 X hz then go to ITERATION; 
ENDE: FUNCTION (2, f); S := 0; ttmax := 2; 
for 7 := 1 step 1 until m do S := S + f[i] X ffi] 
end TAYLOR 


REFERENCE: 7 
{1] Brass, D. Uber Dampfung bei Minimalisierungsverfahren. 
Computing 1 (1966), 264-272. 
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THE DAMPED TAYLOR’S SERIES METHOD FOR 
MINIMIZING A SUM OF SQUARES AND FOR 
SOLVING SYSTEMS OF NONLINEAR EQUA- 
TIONS [H. Spith, Comm. ACM 10 (Nov. 1967), 726]. 
Gary SitveRMAN (Recd. 4 Mar. 1969, 14 Apr. 1969 and 
11 June 1969) 

IBM Scientific Center, Los Angeles, CA 90067. 

KEY WORDS AND PHRASES: solution of equations, least 


squares approximation, Newton’s method 
CR CATEGORIES: 5.18, 5.14, 5.1 5 
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The algorithm, as published, may introduce unnecessary trun- 
cation error into the solution. If the matrix F’; is approximated 
by central differences (der = false) then the value of the iterate is 
used to compute these differences. This involves two additions to 
and one subtraction from the iterate, each of which may result in 
truncation error. To correct this, the following statements on 
page 727 


alt] := 2f[¢] + hf; FUNCTION (a2, fp); 
x(t] := 2[¢7] — hz; FUNCTION (zc, fm); 
alt] := 2[t] + Af; hz := 1.0/hz; 


may be replaced by 


hh := xt]; 

zz] := a[t] + hf; FUNCTION (2, fp); 
x(t] := a[t] — hz; FUNCTION (ca, fm); 
a[t] := hh; hz := 1.0/hz; 


after declaring an additional real variable hh. 

In solving two equations in two unknowns the published al- 
gorithm converged to a solution with S = 8.83653 K 107 and 
KENN = —1. After the above modification convergence was with 
S = 0and KENN = 0. 
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ALGORITHM 316 

SOLUTION OF SIMULTANEOUS NON-LINEAR 

EQUATIONS [C5] 

K. M. Brown (Reed. 27 Oct. 1966, 31 Mar. 1967, 17 July 
1967, and 26 July 1967) 

Department of Computer Science, Cornell University, 
Ithaca, New York 


procedure nonlinearsystem (n, mazxtt, numsig, singular, 2x); 
value n, numsig; integer n, maztt, numsig, singular; array x; 

comment This procedure solves a system of n simultaneous 
nonlinear equations. The method is roughly’ quadratically con- 
vergent and requires only ((n?/2)+ (8n/2)) function evaluations 
per iterative step as compared with (n?+n) evaluations for 
Newton’s Method. This results in a savings of computational 
effort for sufficiently complicated functions. A detailed de- 
scription of the general method and proof of convergence are 
included in [1]. Basically the technique consists in expanding 
the first equation in a Taylor series about the starting guess, 
retaining only linear terms, equating to zero and solving for 
one variable, say x, , as a linear combination of the remaining 
n ~— 1 variables. In the second equation, xz is eliminated by 
replacing it with its linear representation found above, and 
again the process of expanding through linear terms, equating 
to zero and solving for one variable in terms of the now remain- 
ing n — 2 variables is performed. One continues in this fashion, 
eliminating one variable per equation, until for the nth equa- 
tion, we are left with one equation in one unknown. A single 
Newton step is now performed, followed by back-substitution 
in the triangularized linear system generated for the z.’s. A 
pivoting effect is achieved by choosing for elimination at any 
step that variable having a partial derivative of largest absolute 
value. The pivoting is done without physical interchange of 
rows or columns. 

The vector of initial guesses 2, the number of significant digits 
desired numsig, the maximum number of iterations to be used, 
mazxit, and the number of equations n, should be set up prior to 
the procedure call which activates nonlinearsysiem. After execu- 
tion of the procedure, the vector xz is the solution of the system 
(or best approximation thereto), mazit is now the number of 
iterations used and singular = 0 is an indication that a Jaco- 
bian-related matrix was singular—indicative of the process 
‘“blowing-up,’’? whereas singular = 1 is an indication that no 
such difficulty occurred. Storage space may be saved by imple- 
menting the algorithm in a way which takes advantage of the 
fact that the strict lower triangle of the array pointer and the 
same number of positions in the array coe are not used; 

begin integer converge, m,j, k, 1, jsub, ttemp, kmaz, kplus, tally; 
real f, hold, h, fplus, dermaz, test, factor, relconvg; 
integer array poinier(l:n, 1:n], tsub[L:in—1]; 
array temp, pari[{l:n], coe[l:n, 1:n+1]; 
procedure backsubstitution (k, n, x, tsub, coe, pointer); 

value. k, n; 

integer k, n; integer array tsub, pointer; array x, coe; 
comment This procedure back-solves a triangular linear 

system for improved 2[7] values in terms of old ones; 
begin integer km, kmaz, jsub; 

for km := k step —1 until 2 do 

begin kmax := isub(km—1]; x[kmaz] :== 0; 
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for j := km step 1 until n do 
begin jsub := pointer(km, Jj]; 
z(kmaz] := x{kmax] + coe[km—1, jsub] X a[jsub] 
end; 
a[kmax] := x[kmazx] + coe[km—1, n+1] 
end; 
end backsubstitution; 
procedure evaluatekthfunction (x, y, k); 
integer k; realy; array x; 
begin comment the body of this procedure must be provided 
by the user. One call of the procedure should cause the value 
of the kth function at the current value of the vector x to be 
placed in y; 
end evaluatekthfunction; 


converge := 1; singular := 1; relconvg := 10 T (-numsig); 
for m := 1 step 1 until mazit do 
begin 


comment An intermediate output statement may be in- 
serted at this point in the procedure to print the successive 
approximation vectors « generated by each complete itera- 
tive step; 

for j := 1 step 1 until n do pointer [1, j] := J; 

for k := 1 step 1 until n do 

begin if k > 1 then backsubstitution (k,n, x, isub, coe, pointer) ; 
evaluatekthfunction (x, f,k); factor := .001; 


AAA: tally := 0; fori := k step 1 until n do 


begin ttemp := pointer(k, 7]; hold := x{itemp]; 
h := factor X hold; ifh = 0 then h := .001; 
x[ttemp] := hold +h; 
ifk > 1 then backsubstitution (k, n, x, tsub, coe, pointer); 
evaluatekthfunction (x, fplus, k); 
part{itemp] := (fplus—f) /h; 
z{ttemp] := hold; if (abs(part[itemp]) =0)V 
(abs(f/part{itemp]) > 1.020) then tally := tally + 1; 
end; 
if tally < n — k then goto AA; factor := factor X 10.0; 
if facior > .5 then go to SING; go to AAA; 


AA: if k<n then go to A; if abs (parilitemp]) = 0 


then go to SING; 
coe[k, n+1] := 0; kmaz := itemp; go to ENDK; 


A: kmaz := pointer{k, kl; dermax := abs(part|kmaz)); 
kplus := k +1; 
for 7 := kplus step 1 until n do 
begin jsub := pointer(k, 7]; test := abs(part[jsub]); 
if test < dermaz then go to B; dermazx := test; 
pointer [kplus, 1] := kmax; kmax := jsub; 
go to ENDI; 
B: pointer [kplus, t] := jsub; 
ENDI: 
end; 
if abs(part[kmaz]) = 0 then gotoSING; isub[k] := kmaz; 
coe[k, n+1] := 0; 
for j := kplus step 1 until n do 
begin jsub := potinter[kplus, j]; 
coe[k, jsub] := —pari[jsub]/part[kmaz} ; 
coe[k, n+1] := coe[k, n+1] + part{jsub] X x[jsub] 
end; 
ENDEK: 


coe[k, n+1] := (coe[k, n+1]—f)/ partlkmaz] + x[kmaz] 
end k; 
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x[kmax| := coe|n, n+1]; 
if n > 1 then backsubstitution (n, n, x, isub, coe, pointer); 
if m = 1 then goto D; 
for 7 := 1 step 1 until n do 
if abs((temp[t]—a{t])/x[¢]) > relconvg then go to C; 
converge := converge + 1; 
if converge 2 3 then go to TERMINATE else go to D; 
Cc: converge := 1; 
D: for 7 := 1 step 1 until n do temp[i] := 2[t] 
end m; 
go to THROUGH; 
TERMINATE: 
mazit := m; goto THROUGH; 
SING: 
singular := 0; 
THROUGH: 


end nonlinearsysiem 


APPENDIX 


We include a sample procedure evaluatekthfunction for the 
2X 2 system: 


1 € 
(1-2 Jeno +2 — 20 = 0 
4r w 


one solution of which is (.5, 7) see [2] 
procedure evaluatekthfunction (x, y, k); 
integer k; realy; array x; 
begin switch functionnumber := F1, F2; 
go to functtonnumber [k]; 
Fl: y := 2.71828183 X (.920422528 & (erp(2X2z[1]—1)—1)+ 
x[2] /3.14159265—2X2[1]}); 
go to RETURN; 

F2: y := 5 X sin(x[1]X2z[2]) — 2[2]/12.5663706 — x[1]/2; 

RETURN: 

end evaluatekthfunction; 

REFERENCES: 

1. Brown, K. M. A quadratically convergent method for solv- 
ing simultaneous non-linear equations. Doctoral Thesis, 
Dept. Computer Sciences, Purdue U., Lafayette, Ind., Aug., 
1966. 

2. Brown, K. M., anp Conte, S. D. The solution of simultane- 
ous nonlinear equations. Proc. ACM 22nd Nat. Conf., pp 
111-114. 
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Remark on Algorithm 316 [C5] 
Solution of Simultaneous Nonlinear Equations 
(K.M. Brown, Comm. ACM 10 (Nov. 1967), 728-729) 


William J. Raduchel (Recd. 12 Aug. 1970 

and 8 Jan. 1971) 

Project for Quantitative Research in Economic 
Development, Harvard University, 
Cambridge, MA 02138 


Key Words and Phrases: nonlinear equations 
CR Categories: 5.15 


The procedure was coded in both Burroughs 5500 ALGoL and 
IBM FORTRAN-IV and ran correctly on the sample problem provided. 
However, two changes seem appropriate: In the loop to compute 
the partial derivatives following AAA replace 

ifh = Othen/ := 0.001; 
with 

iff = Othenh := factor; 
for otherwise the purpose of the loop is lost for variables currently 
having the value zero. To avoid an interrupt for a zero-divide 
replace 

if abs ((temp |i]—x[i})/x{i]) > relconvg 

then go to C; 
with 
if abs ((temp [i]|—~x[i]) /(if x[i] ~ 0 then x[/] else if temp [i] ~ 0 
then temp [i] else 1)) > relconvg then go to C 

As the author indicates there are unused positions in the arrays 
pointer and coe because of the triangularity of the method. Im- 
plementing the algorithm to use this fact to conserve storage is 
much easier if, in both the main procedure and in backsubstitution, 
values are stored and retrieved in natural order rather than accord- 
ing to the current pivot scheme. 
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ALGORITHM 317* 

PERMUTATION [G6] 

CHaruLes L. Ropinson (Reed. 12 Apr. 1967, 2 May 1967 
and 10 July 1967) 

Institute for Computer Research, U. of Chicago, Chicago, 
Til. 
* This work was supported by AEC Contract no. AT (11-1)-614. 


procedure permute(n, k, v); value n, k; integer array v; 
integer n, k; 

comment This procedure produces in the vector v the kth 
permutation on 7 variables. When k = 0, v takes on the value 
1, 2, 3, 4,---, ». This algorithm is not as efficient as pre- 
viously published algorithms [1], [2], [8] for generating a 
complete set of permutations but it is significantly better 
for generating a random permutation, a property useful in 
certain simulation applications. Any non-negative value of 
k will produce a valid permutation. T'o generate a random 
permutation, k should be chosen from the uniform distribu- 
tion over the integers from 0 to n! — 1 inclusive; 

begin integer 7, q, 7, x, J; 
for 7 := 1 step 1 until n do v{[7] := 0; 
for 7 := n step —1 until 1 do 


qimk+d; rez hkh—qXt; «:= 0; ji=n: 
no: if v[j] = 0 then 


begin 
if < = r then go to ti else x := x + 1 
end; 
jJ:= 37-1; gotono; 
wi: vj] := 7; kim gq; 
end 
end 
REFERENCES: 


1. Coveyou, R. R., anp Sunuivan, J. G. Algorithm 71, Per- 
mutation. Comm. ACM 4 (Nov. 1961), 497. 

2. Peck, J. EK. L., anp Scurack, G. F. Algorithm 86, Permute. 
Comm. ACM 6 (Apr. 1962), 208. 

3. Trorrer, H. F. Algorithm 115, Perm. Comm. ACM 5 (Aug. 
1962), 434. 
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ALGORITHM 318 

CHEBYSCHEV CURVE-FIT (REVISED) [E2] 
J. Boornroyp (Reed. 15 May 1967) 

University of Tasmania, Hobart, Tas., Australia 


procedure chebfil(c, y,n,a,m); value n,m; 
array 2,Y,@; integer n, ™; 

comment evaluates, in a[0] through a[m] of a[0:m+1], the co- 
efficients of an mth order polynomial P(x) = ao + ait + +++ Gmx™ 
such that the maximum error abs(P(z;:)—y:)) is a minimum over 
the n(n>m-+1) sample points zx, y[1:n]. The z[z7] must form a 
strictly monotonic sequence. 

This procedure is an extensive revision of Algorithm 91 (Albert 
Newhouse, Chebyshev Curve-Fit, Comm. ACM & (May 1961), 
281). The polynomial P (2) is a best-fit polynomial in the Cheby- 
shev sense as described by Stiefel (Numerical Methods of Tcheby- 
cheff Approximation), in Langer (Iip.), On Numerical A pproxt- 
mation, U. of Wisconsin Press, 1959, pp. 217-232. Stiefel (p. 221) 
shows that the procedure must terminate after a finite number 
of steps. This is not always so with imperfect arithmetic, where 
roundoff errors may cause cycling of the chosen reference sets. 
This condition is detected by checking that the reference devia- 
tion is always raised monotonically. At exit the absolute value 
of a{m-+1] yields the final reference deviation. Negative a[m-+1] 
indicates that the procedure has been terminated following the 
detection of cycling; 

begin 
integer 7, j, k,mplusi, ri, i1, imaz, rj, j1; 
real d, h, ail, rhil, denom, at, rhi, xj, hmax, himax, xi, hi, abshi, 
nexthi, prevh; 
integer array 7[0:m-+1]; array rz, rh[0:m+1]; 
mplusl := m+ 1; prevh := 0; 
comment index vector for initial reference set; 
r[0] := 1; r[mplusl] := n; 
d := (n~1)/mplusl; h := d; 
for 7 := 1 step 1 until m do 
begin 7[?7] := h+ 1; h:=h+dend; 
start: h := —1.0; 
comment select m + 2 reference pairs and set alternating 
deviation vector; 
for 7 := 0 step 1 until mplusl do 


begin 
rt := r[t]; 
re[t] := a[rt]; ale] := yf[ri]; 
rh{t] := h:= —h 

end 7; 


comment compute m + 1 leading divided differences; 
for j := 0 step | until m do 
begin 

a1 := mplusl; ail := al[zll; 

rhil := rh[el]; 

for 7 := m step —1 until j do 


begin 
denom := rz[t1] — re[t—J]; 
at := alt]; rht := rh{tl; 
a[tl] := (atl—ai)/denom; 
rh{t1] := (rhil—rhi)/denom, 
t1:= 7%; ail :=at; rhil := rhi 
end 2 


end 7; 
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comment equate (m-+1)th difference to zero to determine h; 

h := —almplusl]/rh[mplus1]; 

comment with h known, combine the function and deviation 
differences; 

for 7 := 0 step | until mplusl do 
alt] := alt] + rh{t] x A; 

comment compute polynomial coefficients; 

for j := m — 1 step —1 until 0 do 


begin 
vj i= ralj]; t:= 9; at := alt]; 
for 11 := 7 + 1 step 1 until m do 
begin 
ail := alzl]; 
alt] := at — aj X atl; 
at := ail; 7:=i1 
end 71 
end j; 


comment if the reference deviation is not increasing mono- 
tonically then exit; 

hmax := abs(h); 

if hmax < prevh then 

begin a[mplusl] := —hmaz; go to fit end; 

comment find the index, tmaz, and value, himaz, of the largest 
absolute error for all sample points; 


almplusl] := prevh := hmax; tmar := r[0]; himar:= h; 
j:= 0; 77 := rly]; 
for 7 := 1 step 1 until n do 
if 7 + rj then 
begin 
at := a(t]; ht := alm]; 


for k := m — 1 step —1 until 0 do 
hi := hi X at + afk]; 
hi := ht — yf[t]; absht := abs (ht); 
if absht > hmax then 
begin hmax := abshi; himax := hi; twmax := 71 end 
end 
else 
if j < mplusl then 
beging :=j+1; 17 := rly] end; 
comment if the maximum error occurs at a nonreference 
point, exchange this point with the nearest reference point 
having an error of the same sign and repeat; 
if imax ~ r[0] then 
begin 
for 7 := 0 step 1 until mplusl1 do 
if imax < r{t] then go to swap; 


2 := mplusl; 
swap: nectht := if i ~ 71 + 2X 2 = 0 then h else —h; 
if himax & nextht > 0 then r[i] := imaz 
else 
if imax < 1r[0) then 
begin 
jl := mplusi; 


for 7 := m step —1 until 0 do 
begin r[jl] := r[j];_\ 71 := 7 end; 


r[O] := tmazx 
end 
else 
if imax > r[mplus1] then 
begin 
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j= 0; 
for j1 := 1 step 1 until mplusl do 
begin r{j] := r{jl]; 3 := jl end; 


r{mplusl] := imax 
end 
else r{i—1] := imaz; 
go to start 
end; 


jit: 
end chebfit 
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ALGORITHM 319 

TRIANGULAR FACTORS 

MATRICES [F1] 

Davip R. Green (Recd. 26 Apr. 1965, 19 Oct. 1965 and 
30 Aug. 1967)* 

Mount Isa Mines Ltd., Queensland, Australia 


KEY WORDS AND PHRASES: matrix decomposition, matrix 
factors, matrix modifier, matrix perturbation 
CR CATEGORIES: 5.14 


OF MODIFIED 


procedure modifacs (1,¢,2,m,n,epstlon,fatl) ; 
value epsilon,m,n; array l,c,z; integer m,n; 
real epsilon; label fail; 

comment Suppose that the symmetric, positive definite,n X n 
matrix a has been decomposed into the matrix product J.J? 
where | is a lower triangular matrix and T denotes transpose. 
If a is to be modified by the addition of a matrix triple product 
z.c.aT, this procedure will modify J, in its own space, to pro- 
duce the triangular factors of a + 2.c.x7 in approximately mn? 
operations (z is ann X m matrix, c is a symmetric, m XK m 
matrix, m>1,m <n). 

This situation can arise, for example, in some treatments of 
network flow problems and the elastic plastic analysis of plane 
frames. The referee has pointed out that a further very useful 
application would be updating least squares solutions when 
additional readings have been obtained. A full description of 
the algorithm for general matrices is given by J. M. Bennett, 
Triangular Factors of Modified Matrices, Numer. Math. 7 (1965), 
217-221. 

On entry, array | should hold the lower triangular matrix J. 
Elements above the diagonal of / are ignored by' the procedure. 
On exit the modified values of | are held in the same format. 
The method will fail if the resulting matrix a + z.c.xT is not 
positive definite, so should the absolute value of any pivot be 
less than the parameter epsilon, or should a pivot be negative, 
then exit through fail will occur; 

begin 
array p[l:m]; 
integer 7,j,k; 
real d,t; 
i:= 1; 
repeat: 
d:= (1,7); 
t:= dT 2; 
for k :=.1 step 1 until m do 
begin 

pik] := 0; 

for j := 1 step 1 until m do 

plk] := pik] + 2[2,j] X clj,k]; 

t:= t + al[t,k] X pk] 
end; 
if t < epstlon then go to fail; 

L[i,t] := sqrt(t); 
if i = n then go to exit; 
for j := 1 step 1 until m do 

ply] = plgl/tle,2); 
for j := 7+ 1 step 1 until n do 
begin 


Uj,2] := Ufj,t]/d; 
t:= 0.0; 
for k := 1 step 1 until m do 
begin 
[j,k] := x[j,k] — afte] X 19,2]; 
t:= t + 2fj,k] X plik) 


end; 

Lj,t] := Uft,7] & Uj,t] + t 
end; 
for j := 1 step 1 until m do 
begin 


elj,j] = clj,j] — pljlt2; 

if 7 < m then 

for k := j + 1 step 1 until m do 

clj,k] := clk,j] «= cfj,k] — pli] X pik] 
end; 


t:=it+d; 
go to repeat; 
exit: 
end 
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ALGORITHM 320 

HARMONIC ANALYSIS FOR SYMMETRICALLY 

DISTRIBUTED DATA [C6] 

D. B. Hunrer (Reed. 1 June 1965, 4 Jan. 1966, and 26 
June 1967) 

Department of Mathematics, University of Bradford, 
Yorkshire, England 


KEY WORDS AND PHRASES: harmonic analysis, cosine series, 
sine series, function approximation, curve fitting, trigonometric 
series 


CR CATEGORIES: 5.13. 


procedure trigfit (index, n, m, h, e, x, f, mt, a); 
value index, n, m,h, e; integer index, n, m, mt; real h, e; array 
x, f, a; 
comment Approximates afunction y of z by ahalf-range cosine or 
sine series of period 2h from values specified at discrete points, 
not necessarily equally-spaced, in the range (0, h). The input 
parameters are: 
index—if index = 0, a cosine series is fitted, if index = 1, a 
sine series. No other value is permitted. 
n—number of function-values given. 
m—order of the highest harmonic required. 
h—half-period of the fitted series. 
e—used to terminate the process if rounding errors start to 
accumulate excessively (see note below). 
x—the given values of x are stored on z(1], [2], --- , z[n]. 
f-—the value of y corresponding to x == 2z[i] is stored on f[z] 
(t=1, 2, ---,n). 
The procedure then calculates the coefficients a{r] in the ap- 
proximation 


rT 


mt 
: + 2, alr] cos (rrx/h) if index = 0, 


li 
_ 


> alr] sin (rz/h) if index 
r=l1 
Here normally mt = m, but provision is included to calculate 
fewer harmonics if rounding errors begin to accumulate exces- 
sively (see note below). 
Method of calculation. The coefficients a[r] are calculated 
so as to minimize the sum 
n : ; 4 if z[t] = Oorh, 

d wi(fliJ-S@lé))*, wi = j 

i=l 1 otherwise. 
The method used is similar to that of [1]. First S(z) is expanded 
in the form 


S(rz) = > bipi (ic) 


t=index 


where 


It 
= 


|i + x ai; cos (jra/h) if index 
j=l 
pie)=4 , 
ps aij; sin (jrx/h) 
j=l 


if index = 1. 


320-P 1- 0 


Then 
mt 
= pa biQir « 
i=r 


The polynomials p;(z) are chosen so as to be orthogonal w.r.t. 
summation over x = 2[t], with weights w; . This implies that 


b; 


Hl 


n n 

> wif lilpi (eliD/ 2, wjlpi(as)]?. 
j= j= 
The p:(x) are generated by a recurrence relation 


Pisilx) = (2 cos (rx/h)—ai)pi(z) — Bipise) 


where 
232 w; cos (wal fl/b) -[pslanl sD? 
ea - Seomwe (i > index), 
Dd wilp(2l i)? 
j= 
 wjlpi(al sll? 
6, = 2 (i > index). 
yw jlpalaly)? 


The initial forms are 


if index = 0 


ll 
nie 


pox) 
or pi(z) = sin (r2/h) if index = 1. 


Thus if the z[7] are equally spaced, i.e. if z[¢] = @—1)h/(n—1), 
it follows that 
px(z) = cos (kra/h) or sin (krx/h) according as index = Oor 1. 
The values of the p;(x) are calculated by the method of [2]. 
Note. If the x{z] are verp irregular in their distribution 
serious rounding errors may accumulate, and it is recommended 
that the points be as nearly as possible equally spaced. However 
the procedure includes provision, under control of parameter e, 
to reduce the number of harmonics calculated, mt, if rounding 
errors do start to build up. 
Rounding error is controlled by estimating the error which 
would occur in the analysis of a standard function g(x) for the 
given points, where 


1 if index = 0, 


q(x) = 


ye 
iP sin (rz/h)/ >| sin (ra[j]/h) | if index = 1. 

I= . 
The estimate used for the rounding error in the rth harmonic is 


Tr 


@= >), XX di, 
ti ndextl 


where 


c; = max |a;; | for index <j <1, 


I 


| > w3q(xlj)) pi ly] ud/d, w[p:(z[z])]? |. 


COLLECTED ALGORITHMS (cont.) 


If for any r,e, > e, the procedure is terminated with mi = r — 1. 
REFERENCES: 
1. Cuensuaw, C. W. Curve-fitting with a digital computer, 
Comput. J. 2, 170-173. 
2. Watt, J. M. A note on the evaluation of trigonometric 
series. Comput. J. 1, 162; 
begin 
integer 7,7; real sl, s2, s3, alpha, bela, c,d, u,v, w, g, 8, mean, 
p, coeff, er, cer; 
array cl[O:m], c2[0:m+1]; 
g := 8.1415926536/h; 
if index = 0 then mean := 1 else 
begin mean := 0; 
for 7 := 1 step 1 until n do 
mean := mean + abs(sin(gXz2[t])); 


mean := n/mean 
end; 
for 7 := index step 1 until m do alt] := 0; 


c2[m+1] := alpha := cer := 0; 
for 7 := 0 step 1 until m do cl[2] := c2{z] := 0; 


cllindex] := ~—1; 
beta := s8.:= 1; mt := indez; 

loop: coeff := 0; for? := index step 1 until mi do 
begin 


d := (if ¢=0 then c2[1] else cl[¢—1]) + c2[i+1] — beta x 
cl{t] — alpha X c2[t]; 
cl[tq] := c2{c]; c2{¢q] := d; d := abs(d); 
if d > coeff then coef := d 
end; ‘ 
sl := s2:=d:= er := 0; 
for 7 := 1 step 1 until n do 
begin 
ce := 2X cos(gXz[z]); 


if mt = 0 then begin p := 0.5; go to sum end; 


u:= v0 := 0; 
for j := mi step — 1 until 1 do 
begin 
wi=eoxXu-—v+ehj]; vi= uy u:=w 
end; 
if index = 0 then 
begin 
s:i= 1; p:= 0.5 X& (uxXc+c2[0]) — v 
end 
else 
begin 
s:= sin(gXz[i]); p:=uXs 
end; 


sum: w := if z{i] = 0 V z2{t] =) then 0.5 else 1;: 
d:=ad+wX px fit; . 
if mt > index then er := er + w X p X 8 X mean; 
p:=wxXpt 2; sl:=sl+cXp; s2:= s2+p 
end; 
cer := cer + coeff X abs(er)/s2; 
if cer > e then go to exit; alpha := 81/82; 
beta := s2/s8; d:= d/s2; 83 := 82; © 
for i := index step 1 until mt do 
afi] := aft] +d X c2lz); 
mt := mt +1; if mt < mthen go to loop; 
exit: mt:= mt — 1 
end trigfit; 
procedure harmanalsymm (n, m,h, e, x, ypos, yneg, mc, ms, a, b); 
value n, m,h,e; imteger n,m, mc, ms; realh,e; array x, 
ypos, yneg, a, b; 
comment Approximates a function y of x by a finite trigono- 
metric series of period 2h from values specified at discrete points 
in the range (—h, h). Those points need not be equally spaced, 
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but must be symmetrically distributed about the value z = 0. 
Thus only the values of x in the range 0 < x < h need be given. 
The input parameters are: 
n—number of values of x in the range 0 < x <h. 
m—order of the highest harmonic required. 
h—half-period of the fitted series. 
e—used to terminate the process if rounding errors start to 
accumulate excessively (see note on trigfit). 
x—the given values of x in the range (0, h) are stored on 2[1|, 
z[2], --- , #[n]. 
‘ypos—the value of y corresponding to x = + z[7] is stored on 
ypos|t] (@=1, 2,--- ,m). 
yneg—the value of y corresponding to x = — 2[7] is stored on 
yneg(t] @=1, 2,---, n). 
The procedure then calculates the coefficients a[r] and b[r] in the 
approximation 


Sw) = 2a[0] + > all cos (raz/h) + a) b[r] sin (rra/h). 


Here normally mc = ms = m, but provision is included to calcu- 
late fewer harmonics if rounding errors begin to accumulate 
excessively (see note on trigfit), or if m exceeds its maximum per- 
missible value. For the cosine terms this maximum value is 
n — 1. For the sine terms it is n, this figure being reduced by 1 
for each z[z] equal to-0 or h. The cosine and sine series are calcu- 
lated separately by trigfit, with 


fli] = 0.5 X (ypos[é] + ynegli]) for cosine series, 
0.5 X (ypos[i] — yneg[t]) for sine series; 


begin 
integer 1, md; array f[l:n]; 
for 7 := 1 step 1 until n do 
f{t] += 0.5 X (ypos{i] + ynegli)); 
trigfit (0, n, ifm > n then n — 1 else m,h, e, x, f, me, a); 
md := 7; 
for 7 := 1 step 1 until n do 
begin 
fli] := 0.5 X (ypos{t}] — yneg{z]); 
if c[z7] = 0 V 2[7] = h then md := md — 1 
end; 
trigfit (1, n, if md > m then m else md, h, e, x, f, ms, 6) 
end harmanalsymm 


procedure irigfit; 
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ALGORITHM 321 

t-TEST PROBABILITIES [S14] 

Joun Morris (Recd. 6 Jan. 1967, 18 July 1967, and 10 
Oct. 1967) 

Computer Institute for Social Science Research, Michigan 
State University, East Lansing, Michigan 


KEY WORDS AND PHRASES: T-test, Student’s t-statistic, dis- 
tribution function 


CR CATEGORIES: 5.5 


real procedure ttest (x, df, maxn, gauss, error); 
value x, df, mazn; realz; integer df, maxn; 
gauss; label error; 
comment This procedure gives the probability that ¢ will be 
greater in absolute value than the absolute value of xz, where ¢ 
is the Student t-statistic, as defined and tabled by R. A. Fisher 
[2], evaluated at df degrees of freedom: that is, 2 times the inte- 
gral of the distribution function of ¢, evaluated from abs(x) to 
infinity. The procedure may also be used, e.g., to estimate the 
two-tailed probability of a simple correlation, r, where N = the 
number of pairs of observations, df = N — 2,andt=~r  X sqrt 
(df/(1.0 — r 7 2)) (cf. e.g. [5]). 
For reasonably small df, Student’s cosine formula is used [8, 
4]: 


real procedure 


6 
ttest = 1.0 — coef [ cos?/—! 6 dé 
0 


where @ = arctan (t/sqrt(df)) and 
coef = (df—1)/(df—2) X @df—3)/@f—4) 


($) x (2/r) ‘for odd df, 
(7) X ) X (4) for even df. 


Integrated in series, this gives results which appear to be cor- 
rect to very nearly the full single precision accuracy of the 
machine (in terms of the number of digits after the decimal point, 
not necessarily significant digits). 

An approximation due to R. A. Fisher [1] gives results accurate 
to within +3 X 1077 when mazn has been set at 30. The tradeoff 
on time is also optimal at about this point. The real procedure 
gauss computes the area under the left-hand portion of the nor- 
mal curve. Algorithm 209 [6] may be used for this purpose. 

Thanks to the referee for many helpful suggestions, most of 
which have been incorporated, and to David F. Foster, who 
wrote an early version of part of the program. 

REFERENCES: 

1. Fisuer, R. A. Metron 5 (1925), 109-112. 
2.——. Statistical Methods for Research Workers. Oliver and 

Boyd, Edinburgh, 1965. 

3. GosseT, W. 8S. (Student). 
Biometrika 6 (1908), 1. 

New tables for testing the significance of observations. 
Metron & (1925), 105. 

5. Gurtrorp, J. P. Fundamental Statistics in Psychology and 
Education. McGraw-Hill, New York, 1956, pp. 219-221. 

6. Inpetson, D. Algorithm 209, Gauss. Comm. ACM, 6 (Oct. 
1963), 616. 


The probable error of a mean. 


4, 
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begin 
if df < 1 then go to error; 
if xz = O then tiest := 1.0 else 
begin real 1; 
t := abs (a); 
if df < maxn then 
begin integer 7, nh; 
z := t/sart(df); 
cth := 1.0/sqgri(z T 2+1.0); 
sth := z X cth; 
cthsq := cth 7 2; 
nh := (df—1) + 2; 
if df = 2X (df+2) then 


real cth, sth, cthsq, x1, coef, z; 


begin 
t:= sth; 
if nh = 0 then go to g; 
cth := cthsq; xt := 1.0; 
coef := 0.5 X sth 
end else 
begin 
t := 0.6366197724 X arctan(z); 
comment 0.6366197723675813430755351--- = 2/a; 


if nh = 0 then go to g; 


xi :=0; coef := 0.6366197724 X sth 
end; 
for 7 := 1 step 1 until nh do 
begin 
t:= t+ coef X cth; cth := cth * cthsq; 
zi := at + 2.0; 
coef := coef X xi/(ai+1.0) 
end; 
g:(:=10-—t 
end else 


if 1 > 6.0 then i := 0 else 
if df < 106 then 
begin real f, (2, 4, 6, 18, 110, #12, ¢14, 16, £18; 
fi=df; Q:=tXt; 44:= 12 2; 6 := t4 x 22; 
8 := (6 X (2; 110 := #& X 2; 112 := 110 #2; | 
(14 := #12 X 12; 116 := 14 X 12; t18 := 116 X 22; 
comment 0.3989422804014326779399461--- = 1/sgrt (2X7); 
t:= 2.0 X (gauss (—t)+¢X0.3989422804 x exp(—0.5X 12) & 
((12+1.0)/(4.0Xf) + (3.0% t6-7.0X t4—5.0X 12—3.0)/ 
(96.0XfXf)+ (410—11.0X #8-+ 14.0 6+6.0X t44—3.0 12— 
15.0)/(384.0Xf T 3)+ (15.0 t14—375.0& t12-+. 2225.0 t10— 
2141.0 t8—939.0X t6— 213.0 t4—915.0X 12+-945.0) / 
(92160.0Xf T 4)+(3.0X t18— 133.0% 116+1764.0 114— 
7516.0 X t12-+5994.0X 10+ 2490.0 18 +1140.0 t6-+180.0 
14+-5355.0X (24-17955.0) / (368640.0Xf T 5))) 
end else ¢ := 2.0 & gauss(—2); 
tlest := if t <0 then 0 else ¢ 
end 
end ¢test 


COLLECTED ALGORITHMS (cont.) 


REMARKS ON 

ALGORITHM 321 [S14] TEST PROBABILITIES 
[John Morris, Comm. ACM 11 (Feb. 1968), 115-6] 

ALGORITHM 344, STUDENT’S t-DISTRIBUTION 
[David Levine, Comm. ACM 12 (Jan. 1969), 37-8] 

G. W. Hit, anp Mary Loucuueap* (Recd. 16 Apr. 
1969 and 29 Sept. 1969) 

Commonwealth Scientific and Industrial Research Or- 
ganization, Division of Mathematical Statistics, Glen 


Osmond, South Australia 
* Present address: Monash University, Clayton, Victoria, 
Australia , 


KEY WORDS AND PHRASES: t-test, Student’s t-statistic, 
distribution function, approximation 
CR CATEGORIES: 5.12, 5.5 


Algorithm 321, as published, was coded in CSIRO 3200 Auaon 
and run on a CDC 3200 with programmed floating point opera- 
tions. A ForRTRAN equivalent of Algorithm 321 was run for com- 
parison with the Fortran Algorithm 344, which uses the same 
recurrence relation based on Student’s cosine formula as that 
used in Algorithm 321 for df degrees of freedom less than mazn. 
Numerical results agreed with 6-digit tabulated’ values [1] and 
double precision calculations indicate that accuracy is limited by 
truncation of intermediate results to the precisioh of the proces- 
sor, with error in the final result increasing as the square root of 
df. Timing tests rated Algorithm 344 at approximately (3 df+1%) 
msec; slightly faster than Algorithm 321, which required ap- 
proximately (2 df+2})msec for df < maxn. . 

For df > mazxn Algorithm 321 uses Fisher’s [2] fifth order ap- 
proximation, whose accuracy is summarized in the diagram for 
df = 10(10)50 (see Figure 1). The shaded regions indicate values 
of ¢ for which the claimed accuracy of 3 X 10-’ for maan = 30 
is not attained. For ¢ > 6.0 this algorithm returns zero values, 
giving errors up to 1.39 X 10°°. The following alterations avoid 
this error and, by “‘nesting’”’ Fisher’s polynomial ‘approximation, 
reduced the time from about 25msec to 20msec and reduced the 
store requirement by 27%. 


Replace the 19 lines beginning ‘“‘g: ¢:= 1.0 — ?” by 


g: 2:= 10-2 

end else a 

begin x := 2.0 X gauss (—?); 
if df < 16 then 

begin real f, (2; 

f := 0.25/df; 2:=t& ¢; 
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Log 19 | error| 


4 5 
Student’s t 
Fra. 1 


z= (((((C(((((((3.0% t2— 133.0) X £2 
+1764.0) X12—7516.0) X 12-+5994.0) X t2+2490.0) x £2 
+-1140.0) X t2+180.0) X#2+5355.0) X 12+17955.0) X f 
+ ((((((15.0X 12—375.0) X t2-+-2225.0) X 12—2141.0) X #2 
—939.0) X12— 213.0) Xt2—915.0)X 2+945.0) X f/60.0 
++ (((((2—11.0) X i2-++14.0) X#2+6.0) X12—3.0)X2—15.0) X f 
+ ((3.0X t2—7.0)Xt2—5.0)Xi2—3.0)Xf/6.0 
+-(t2+1.0)) X f X t X 0.7978845608 X exp (—0.5Xi2) +2 


‘end; 


ttest := if c < 0.0 then 0.0 else x 


The last statement, recommended by the referee, avoids negative 
results due to rounding errors when the answer is small. 

In Algorithm 344 the three statements beginning “1 T = 
ABS(T)”’ were replaced by: 

1 T2 = T*T/FLOAT (DF) 
Tl = SQRT(T2) 
T2 = 1./(1.4T2) 
to avoid changing the calling parameter T. 

Although Algorithm 321 occupies about twice the store space 
needed for Algorithm 344, and is slightly slower for df < magn = 
30, it is about three times faster for df = 100. 

REFERENCES: 

1. Smirnov, N. V. Tables for the Distribution and Density Func- 
tions of t-distribution. Pergamon Press, New York, 1961. 

2. Fisoer, R. A. Expansion of “‘Student’s” integral in powers of 
n-1, Metron. 6, 3 (1926), 109-112. 
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ALGORITHM 322 

F-DISTRIBUTION [814] 

Econ Dorrer (Recd. 25 Jan. 1967, 3 July 1967, and 17 
Oct. 1967) 

Institut fiir Photogrammetrie und Kartographie, Tech- 
nische Hochschule Miinchen, W. Germany; now: De- 
partment of Surveying Engineering, University of 
New Brunswick, Fredericton, N.B., Canada 


KEY WORDS AND PHRASES: Fisher’s F-distribution, Stu- 
dent’s t-distribution 
CR CATEGORIES: 5.5 


real procedure Fisher (m, n, x); 
value m,n, x; integer m,n; real x; 

comment Fisher’s F-distribution with m and n degrees of 
freedom. Computation of the probability 


(#3 
PrF <x) = 


(ye) el 
Tf—)-rf— 


where w = (m/n)z and F = (> t=: 2:2/m)/(>,41 y;?/n). The 
solution results recursively from the basic integrals 


Fisher (1,1,2) = 2-arctan /w/n, Fisher (1,2,2) = (w/(w+1))}, 
Fisher (2,1,2) = 1 — 1/(w+1)!, Fisher (2,2,2) = w/(w+1). 


w is introduced by 0.3183098862 = 1/x. By calling Fisher (1, n, 
£72), Student’s t-distribution will be obtained; 
begin integer a, b,i,j; real w, y, z, d, p; 
a@:= 2% (m+2) —-m+2; b:=2 (n+2) —n4+ 2; 
wis aX m/n; 2z:=1/(i+w); 
ifa = 1 then 
begin 
if b = 1 then 
begin 
p := sqrt(w); 
d:=y X 2/p; 
end else 
begin 
p := sqri(wXz); 
end 
end else 
if b = 1 then 
begin 
p := sqrt(z); 
end else 
begin 
d:=2X% 2; p:=wrXz 
end; 
y:= 2X w/z; 


gm {2-1 


dé, 


y := 0.3183098862; 
p:= 2X y X arctan(p) 


d := 0.5Xp X 2/w 


d:=05 Xz pp; p:=1l1—p 


d:= (1+ a/(j-2)) X dX 2; 
= ifa = 1 then p +d X y/(j—1) else (p+w) X z 


wz; z2:= 2/2; b:=n— 2; 
for 2 := a+ 2 step 2 until m do 
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begin a ; 
jizitb; d:=yXdXx j/G-2); pi=p— 2x a/j 


end i; 
Fisher := p 
end Fisher 


CERTIFICATION OF ALGORITHM 322 [814] 

F-DISTRIBUTION [Egon Dorrer, Comm. ACM 11 (Feb. 
1968), 116] 

J. B. F. Frevp (Recd. 15 Aug. 1968) 

Commonwealth Scientific and Industrial Research Organi- 
sation, Adelaide, South Australia 

KEY WORDS AND PHRASES: Fisher’s 


Student’s é-distribution 
CR CATEGORIES: 5.5 


F-distribution, 


Algorithm 322 was coded into Fortran and run on a CDC 3200, 
and its accuracy for moderate probability levels was tested using 
(a) 5-figure critical values of the F-distribution at the .95 and .99 
levels, taken from [1], and (b) 6-figure probability values of the ¢- 
distribution, taken from [2]. In both cases, limitations in the re- 
sults appeared to be due to limitations in the tables, rather than in 
the algorithm. 

232 values of the F-distribution were tested, for m = 1 and 12 
using all tabulated values of n, and for nm = 10 and 21 using all 
tabulated values of m. All the results agreed with the tabulated 
probability level to 4 significant figures, 89% to 5 figures, and over 
half the results agreed to 6 or more figures. 

300 values of the t-distribution were tested, for n = 1(1)30 and 
t = .5(.5)5. All the results agreed with the tabulated probability to 
5 significant figures, and 90% to the full 6 figures given in 
the tables. 

To test extreme probability levels, another 100 values of the 
F-distribution were used: for m = n = 2, 10, 50, 75, 100, 120, 150, 
200, 300, and 400 for each of the values x = 10', where 7 = 5(1)5. 
It was found that for probabilities which are extremely close to 0 
or 1, the algorithm may produce probabilities which are slightly 
less than zero, or slightly greater than 1. It is recommended that 
a “guard” be inserted in the program to set these values equal to 
0 or 1. For example, this could be done by inserting before 
Fisher:=p the additional statement 


p := if p > 1 then 1 else if p < 0 then 0 else p; 


The time taken by the algorithm was directly proportional to 
the sum of the degrees of freedom. The constant of proportionality 
depended mainly on whether m was even or odd (the time taken 
for m even being .81 of the time taken for m odd, using a CDC 3200 
with programmed floating point). To a much lesser extent, it was 
influenced by whether 7 was even or odd (the time taken for n even 
being .99 of that for n odd). 

REFERENCES 
1. Owen, D. B. Handbook of Statistical Tables. Addison-Wesley, 
Reading, Mass., 1962. 
2. Smirnov, N. V. Tables for the Distribution and Density Func- 
tions of t-distribution. Pergamon Press, Oxford, 1961. 
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REMARK ON ALGORITHM 322 [814] 
F-DISTRIBUTION [Egon Dorrer, Comm. ACM 11 

(Feb. 1968), 116] 
Husert Totman (Recd. 7 Apr. 1970 and 13 Oct. 1970) 


Department of Mathematics, Northeast Louisiana State 
College, Monroe, LA 71201 


KEY WORDS AND PHRASES: Fisher’s 
Student’s t-distribution 
CR CATEGORIES: 5.5 


Replacing the statements 
for j := b + 2 step 2 until n do 
begin 


d := (1 +a/(j—2)) X dX 2; 
p: 


end 


by the algebraically equivalent statements 


a 


F-distribution, 


ifa = 1 then p +d X y/(j—1) else (p + w) X z 


ifa = 1 then 


begin 
for j := b + 2 step 2 until n do | 
begin 
d:= (1+ a/(j-2)) Xd X 2; 
p:=pt+dxX y/G-}) 
end 7; 
end 
else 
begin 
zk := 2 7 ((n—1) + 2); 
d:= dX zk X n/b; 
pi=pX ek+wxrXzxX (ek—-1)/(e-]); 
end; 


substantially reduces the execution time when m is even, and did 
not change the speed when m is odd. For the resulting algorithm, 
the execution time is proportional to m when.m is even, and 
proportional to m + n when m is odd. 


TABLE I. Percent Time SavIncs 


8 
16 
32 

n 64 
128 
256 
512 
1024 


m 
4 8 16 32 64 
34 3 2 1 1 
37 32 20 12 7 
62 54 45 30 19 
79 73 63 51 36 
88 85 78 68 54 
94 92 88 81 71 
96 95 93 90 83 
98 97 96 94 90 


Both the new and original forms of the algorithm were coded 
in Fortran and timed. The percentage reductions in execution 
time are given in Table I. The greatest reduction came when n 
is large and m is small. In many statistical applications n is sub- 
stantially larger than m and seldom smaller, thereby falling in 
the region of the greatest saving in execution time. 
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ALGORITHM 323 

GENERATION OF PERMUTATIONS IN 

LEXICOGRAPHIC ORDER [G6] 

R. J. Ornp-SmituH (Reed. 27 Apr. 1967 and 26 July 1967) 

Computing Laboratory, University of Bradford, Bradford, 
Yorkshire, England 


KEY WORDS AND PHRASES: permutations, lexicographic 
order, lexicographic generation, permutation generation 
CR CATEGORIES: 5.39 


Author’s Remark. Lexicographic generation involves more 
than the minimum of 7! transpositions for generation of 
the complete set of m! permutations of n objects. The actual 
number of transpositions required can be shown to tend 
asymptotically to (cosh 1) n! = 1.53n! However, lexi- 
cographic generation can be described by an algorithm 
requiring very simple book-keeping. The author is indebted 
to Professor H. I°. Trotter for suggesting an improvement 
to an original algorithm, which now results in a process 
more than twice as fast as the previously fastest lexi- 
cographic Algorithm 202 [Comm. ACM 6 (Sept. 1963), 
517]. Tabulated results below show BE'STLEX to be only 
9.3 percent slower than the transposition Algorithm 115 
(Comm. ACM 6 (Aug. 1962), 484] when n = 8. 

The usual practice is adopted of using a nonlocal Boolean 
variable called first which may be assigned the value true 
to initialize generation. On procedure call this is set false 
and remains so until it is again set true when complete 
generation of permutations has been achieved. Table I 
gives results obtained for BESTLEX. The times given in 
seconds are for an I.C.T. 1905 computer. é, is the time for 
complete generation of n! permutations. r, has the usual 
definition r, = ty/(n+tn-1). 


TABLE I 
Number of 
Algorithm ty tg 3 trans positions 
BESTLEX 6 47 0.98 — 1.53n! 
202 12.4 100 1.00 ? 
115 5.6 43 0.98 n!} 
procedure BESTLEX (2, n); valuen; integern; array z; 
begin own integer array g(2:n]; integer k,m; real t; 


comment own dynamic arrays are not often implemented. The 
upper bound will then have to be given explicitly; 
if first then 
begin first := false; 
for m := 2 step 1 until n do g{m] := 1 
end of initialization process; 
if q[2] = 1 then 
begin g[2] := 2; 


t:= a[h}; afl] := [2]; [2] := ¢; 
go to finish 
end; 


for k := 2 step 1 until n do 
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if g{k] = & then g[k] := 1 else go to frstart; 


first := true; k:=n; go to trinit; 

trstart: m := q{k]; ¢:= a2[m]; z[m] := alk]; z[k] := ¢; 
gk] := m+1; k:=k-—-1; 

trinit: m:= 1; 

transpose: t :=-a[m]; alm] := afk]; alk] := t; 
m:=m+1; k:=k—1; 
if m < k then go to transpose; 

finish: 


end of procedure BESTLEX 


CERTIFICATION OF ALGORITHM 323 [G6] 

GENERATION OF PERMUTATIONS IN LEXI- 

COGRAPHIC ORDER [R. J. Ord-Smith, Comm. 
ACM 11 (Feb. 1968), 117] 

I. M. Lerrcu (Recd. 9 July 1968, 6 Jan. 1969 and 17 
Mar. 1969) 

Department of Medicine, University of Newcastle upon 
Tyne, Newcastle upon Tyne, England 


KEY WORDS AND PHRASES: permutations, direct lexico- 
graphic order, reverse lexicographic order, lexicographic generation 
CR CATEGORIES: 5.39 


The ranking function Ra(ai , d2,--- , @,) which specifies the 
position of a permutation (a1 , G2, +--+ , a) of the numbers 0 (1) 
n—1 in a direct lexicographic order is commonly defined recur- 
sively[1] by 


Ra(0) = 0 
and 
» Qn) = a1+ (n—1)! + Ra (Mai, a2, 


where M(a1 , G2, +++ , Qn) is the permutation of the numbers 0'(1) 
n—2 obtained from ai , a2, -+- ,@n by deleting a; and reducing by 
unity all those elements which exceed ai . 

Reverse lexicographic order of a permutation (b1, bz: , «+ 
is defined by a similar ranking function, 


Rr(bi , bg, «++ 5 bn) = nm! — 1 — Ralbn > 


As reverse lexicographic order has the property (which direct 
order does not) that all the permutations which involve only the 
first K elements are generated before the (K + 1)-thelement is 
moved, it is sometimes preferred above the direct order. ‘The two 
are closely related since in any n-element permutation vector a 
typical element a; of the direct order corresponds to element dy-#4.1 
of the reverse order. As both of these orderings are in common use, 
it is inappropriate to describe either as lexicographic without 
further qualification. 

After replacement of the dynamic upper bound of the own 
integer array by a constant (necessitated by a compiler imple- 
mentation restriction), Algorithm 323 was compiled by the Kids- 
grove ALGOL compiler and run on an English Electric KDF9 com- 
puter. The full permutation was generated for values of n = 2 (1) 


Ra(ar , a2, °°° » Gn)) 


» bn) 


. , be , 61). 


COLLECTED ALGORITHMS (cont.) 


9. The permutations generated by BESTLEX (Algorithm 323) 
were compared automatically with those of Algorithm 202 [Comm. 
ACM 6 (Sept. 1963), 517]. It was known that Algorithm 202 gener- 
ated permutations in a direct lexicographic order,.and it was found 
that permutations were produced by BESTLEX in a reverse lexi- 
cographic order. 

The order in which the permutations of BESTLEX are genera- 
ted is governed by the own integer array qof that procedure and 
its integer counters m and k. Because of the simple relationship 
which exists between direct and reverse lexicographic order, the 
published algorithm may be modified so that it will generate per- 
mutations in direct lexicographic order by systematic application 
of the following three rules: 

1. Wherever the value 1 or 2 occurs either as a subscript expres- 
sion or an integer constant which is not part of a more complex 
expression, replace it by n or n—1, respectively. 

2. Redefine the bounds of g and the limits of both for loops to 
be from 1 to n—1. Reverse the direction of the k for loop. 

3. In the last seven lines of the algorithm, the integer counter k 
must be incremented by 1 from 1 (rather than decremented from 
n), and, similarly, wherever m-+1 appears in an assignment state- 
ment it is replaced by m—1. Consequently m and k must be re- 
versed in the comparison on the penultimate line of the algorithm. 

At each call of the algorithm these modifications redirect atten- 
tion from the beginning of the permutation vector to the end, and 
so cause permutations to be generated in direct order. However, 
because of the nature of these changes, no loss in computational 
efficiency should be expected (since the only extra arithmetic in- 
curred is the evaluation of n—1, which need be performed only 
once for each procedure call). This was confirmed at run times as 
the times taken to generate a full permutation in reverse order by 
the published algorithm and in direct order by the modified al- 
gorithm were identical. 

Table I gives the time in seconds (t,) which is required by each 
procedure for the complete generation of the n! permutations, 1, 
has the usual definition of t,/(n:ty-1). 


TABLE I 
Algorithm 4 b ‘a 
BESTLEX 10.01 80.08 1.00 
202 20.84 166.75 1.00 


Both algorithms were also tested under the Whetstone ALGOL 
interpreter on the KDF9, an ALGou compiler for the 1130, and the 
IBM 360 Model 67 Operating System Aucou ‘‘F’’ compiler. As the 
last two implementations do not recognize the concept of own, 
results were obtained by inserting an integer array into the pro- 
cedure heading as an additional parameter and by not declaring 
the own integer array in the procedure body. For comparison, 
execution times for the n! permutations which were recorded when 
the procedure was run on the IBM 360/67 are given in Table II. - 


TABLE II 
4 4s a) 


BESTLEX 7.6 61.01 0.99 


REFERENCES 


1. Leumer, D.H. Teaching combinational tricks to a computer. 
Proc. of Symp. in Appl. Math., Vol. 10, Amer. Math. Soc., 
Providence, R. I., 1960, pp. 179-193. 
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Remark on Algorithm 323 [G6] 
Generation of Permutations in Lexicographic Order 
[R.J. Ord-Smith, Comm: ACM 11 (Feb. 1968), 117] 


Mohit Kumar Roy [Recd. 15 May 1972] 
Computer Centre, Jadavpur University, Calcutta 32, 
India 


In presenting Algorithm 323, BESTLEX, for generating per- 
mutations in lexicographic order, the author has mentioned the 
number of transpositions. It may be remarked here that equal 
numbers of transpositions are required by both BESTLEX and 
the previously fastest algorithm, Algorithm 202 [1]. The exact 
number of transpositions (7;) necessary to generate the complete 
set of n! permutations is given by 


T, = n! (var) — (n+1)/2, ifn is odd, and 
T, = n! (Wn-2) — n/2, if nis even, 


1 1 
where van =P bo be et = 1.543 forn > 3. 


(2n)! 
The above expressions do not include the few extra transpositions 
(equal to the integral part of n/2) required by BESTLEX to gener- 
ate the initial arrangement from the final one, as this portion has 
not been included in Algorithm 202. Therefore, the number of 
transpositions has no importance in the context of the claim that 
BESTLEX is more than twice as fast as Algorithm 202. 

The main factor contributing to the speed of BESTLEX is 
the substantial reduction in the number of comparisons required, 
by the introduction of the own integer array g. Taking into account 
only those comparisons which involve array elements, the number 
of comparisons (C,) required to generate all the ! permutations 
can be shown to be equal to 


{ 
C, (Algorithm 202) = il eS a, 


C, (BESTLEX) = n! [4+¢n-1], 
1 1 1 

where y, = 1+—-+-=-4+---+—= 1.718 forn > 6. 
2! 3! n|\ 


This shows that the number of comparisons required by BESTLENX 
is lower by .859(7!) (approximately) in the case of the generation 
of all the 7! arrangements. 

Finally, a modification of the BESTLEX algorithm is sug- 
gested which will reduce the number of comparisons again by 
(1!)/2. The modification involves replacement of lines 2-14 of 
Algorithm 323 by the following. 


begin own integer array q{3:n); integer k, m; 
real t; own Boolean flag; 

comment Own dynamic arrays are not often implemented. The 
upper bound will have to be given explicitly; 


if first then 
begin first := false; flag := true 
for m := 3 step 1 until 1 do g[m] := 1 
end of initialization process; 
if flag then 
begin flag := false; 
ts=x[1]; x[l]:= x[2]; x]2] := 7; 
go to finish 
end; 
flag := true; 


for k := 3 step | until 1 do 


COLLECTED ALGORITHMS (cont.) 


References 
1. Shen, Mok-Kong. Algorithm 202, generation of permuta- 
tions in lexicographical order. Comm. ACM 6 (Sept. 1963), 517. 


Added in proof: An improved version of BESTLEX, viz. Al- 
gorithm 323A, Generation of Permutation Sequences: Part 2, by 
R.J. Ord-Smith [Comp. J. 14, 2 (May 1971), 136-139], which also 
incorporates the modification suggested here, has come to the au- 
thor’s attention. 


323-P 3- 


0 


COLLECTED ALGORITHMS FROM 


ALGORITHAL 3824 

MAXISLOW [H] 

G. Bayer (Reed. 31 July 1967) 

Technische Hochschule, Braunschweig, Germany 


KEY WORDS AND PHRASES: network, linear programming, 


maximum flow 
CR CATEGORIES: 5.41 


procedure mazflow (from, to, cap, flow, v, n, mflow, source, sink, 


inf, eps); 
value v, n, source, sink, inf; 
integer v, 1, source, sink, real inf, eps, mflow; 
integer array from, to; array cap, flow; 


comment The nodes of the network are numbered from 1 to sn. 
It is not necessary but reasonable that each number represent a 
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labj{k] := 3; 
labf(k] := if gjk < labf[j] then gjk else labf[j]; 
if k = sink then go to reached; 
lk := Ik +1; klisi[lk] := k; 
end: 
end; 
ek := eh +1; 7 := klistlek]; 
if 7 ~ 0 then go to path else go to maz; 
comment sink is labeled, find path and possible 
flow, reduce excess capacities along path; 
reached: 
j:= stnk; d := labf[j]; mflow := mflow + d; 
look: k := labj[j]; u:= uplk); 
for s := low[k] step 1 until u do 
begin | := ind[s]; 
if to[/] = j then flow[l] := flow[l] — d 


node. The data of the network are given by arrays from, to, cap 
in the following manner. There is a maximum possible flow of 
cap[t], nonnegative, leading from froml] to tolt], 7 = 1, ++: ,v. 

Compute the maximum flow mflow from source to sink, 
(source and sink given by their node numbers). inf represents 
the greatest positive real number within machine capacity. 
flowlt] gives the actual flow from froml?] to tolz]. Flows abso- 
lutely less than eps are considered to be zero. Literature: G. 
Hadley, Ianear Programming, Addison-Wesley, Reading (Mass.) 


end; 
-u:= uply); 
for s := low[j] step 1 until u do 


begin | := ind[s]; 
if fo{l] = & then flow{l] := flow{l] + d 
end; 
j:=k; if 7 # source then go to look; 
go to lab; 
max:; comment maximal flow found; 


and London, 1962, pp. 337-344. 
Multiple solutions are left out of account; 
begin integer |, j,k, r, lk, ek, u,s; real gjk, d; 
integer array low, up, klist, labj[1:n]}, ind[1:v]; 
labf[1:n]; 
comment Note structure of data lists in up and low; 
tas 1s 
for 7 := 1 step 1 until n do 
begin low[j] := 1; 
for r := 1 step 1 until v do 
begin if from|[r] = j then 


real array 


begin ind{l] := 7; 
flow[l] := cap|l]; l:= 1+ 1 
end 
end; 
uplj] := 1-1 
end; 
mflow := 0.0; 


lab:; 
comment Prepare lists for new labeling; 
for j := 1 step 1 until n do 
begin labj{j] := klist([j] := 0; 
labf[j] := 0.0 
end; 
labf [source] := inf; 
comment labeling; 


j i= source; lk := ek := 0; 
path: 

u i= uply]; 

for s := low[j] step 1 until u do 


begin | := ind{s]; 
k := toll]; gjk := flowl[l]; 
if labj[k] ~ 0 V abs(gjk) < eps 
then go to end; 


for | := 1 step 1 until v do 
flow[l] := cap[) — flow[l] 
end 


Remark on Algorithm 324 {H] 
Maxflow [G. Bayer, Comm. ACM I] (Feb. 1968), 117] 


G. Bayer [Recd. 5 Aug. 1971] 
Technische Universitat, 33 Braunschweig, Germany 


It is necessary to clarify the meaning of input parameters 
from, to and cap describing the given network. 

A connection between two nodes, say a and 5, must be given 
by two arcs like this: At two index-positions, say ia and ib, the 
input arrays have values 


from [ib] = 6 
to [ib] =a 
cap [ib] = capba 


from [ia] = a 
to [ia] = b 
cap [ia] = capab 


Even if one of the two flows, say capab from node a to node Bb, 
is zero, it must not be omitted, for otherwise the algorithm goes 
wrong. 

If there is no connection between two nodes, then no arcs are 
to be given. In this case another input yields the same result: 
Two arcs are given, each with a maximum possible flow of zero. 
(But this case is not physically, or in the sense of the algorithm, 
the same as the first one.) 
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ALGORITHM 325 

ADJUSTMENT OF THE INVERSE OF A SYM- 

METRIC MATRIX WHEN TWO SYMMETRIC 

EKLEMENTS ARE CHANGED [F1] 

GERHARD ZIELKE (Reed. 24 Aug. 1967) 

Institut fir Numerische Mathematik der Martin Luther 
Universitit Halle-Wittenberg, German Democratic 
Republic 


KEY WORDS AND PHRASES: symmetric matrix, matrix in- 
verse, matrix perturbation, matrix modification 
CR CATEGORIES: 5.14 


procedure INVSYM 2 (n, 1, j, c, a, b); 
value n,1,j,c; integer n,7,j; realc; array a, b; 
comment INVSYM 2 computes the inverse A~! = a of a non- 
singular symmetric nth order matrix A = B + c(eie;’ + e;e:') 
which arises from a symmetric matrix B by a change c in two 
elements B:; and B;; = Bi; (¢ ¥ j). The inverse matrix Bo = b 
is assumed to be known. The calculation with the new formula 


a=b-— 5 [b.:Chibs;. + heads.) + 0.;(had;. + Iadbi.)] 


where 
h = 1 + chi; ; he = —cb;; P hg = —cb;; 5 d= hy? — hhs 


requires n? + O(n) multiplications, therefore only about the 
same number of operations as if the well-known Sherman- 
Morrison formula for a change in one element (see Algorithm 
51 (Comm. ACM 4 (Apr. 1961), 180]) is used. In these equations 
e; denotes the 7th column and e;’ the ith row of the unit matrix, 
b.; = be; denotes the 7th column and b;. = e;’b the ith row of 
the matrix b; 

begin integer k,1; real hl, h2, h3, d; 
array 1, s[1:n]; 
Al :=1+¢X bfi, j]; h2:= —c X df, jl; 


h3 := —e X bf, 4]; d:=hl tT 2—h2>*% h3; d := c/d; 
hl :=hl Xd; h2:=h2X d; h3 := h3 xX d; 
for k := 1 step 1 until n do 
begin 
r{k] := hl X bij, kl] + h2 X Olt, k); 
s[k] := h3 X b[j, k] + hl X bfz, k] 
end; 


for k := 1 step 1 until n do 
for | := 1 step 1 until k do 
alk, 1] := all, k] := b[k, 1] — b{k, 2] X r[l] — blk, J] X s{l] 
end INVSYM 2 
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ALGORITHM 326 

ROOTS OF LOW-ORDER POLYNOMIAL 
EQUATIONS [C2 

TERENCE R. F. NoNWEILER (Reed. 14 Apr. 1967) 

James Watt Engineering Laboratories, The University, 
Glasgow W2, Scotland 


KEY WORDS AND PHRASES: rootfinders, polynomial equa- 
tion roots, quadratic equation roots, cubic equation roots, bi- 
quadratic equation roots, polynomial zeros 

CR CATEGORIES: 5.15 


ROOTFINDERS: 
begin 
comment suite of procedures finding the (complex) roots of the 
lower order polynomial equations by the familiar algebraic 
methods; 
procedure BIQUADROOTS(p, r); value p; array p, 7; 
comment finds the roots « = r[l, k] + sgri(—1) X r[2, k] of 
the biquadratic equation p[0] X # fT 4+ .--- + p[4] = 0; 
comment array r defined for subscript bounds [1:2, 1:4] and p 
for [0:4]. Failure occurs (in overflow) if p[0] = 0 and in other 
cases. Uses nonlocal procedures QUADROOTS and CUBIC- 
ROOTS; 
begin real e, 6, d, c, a; 
integer k, j; 
if p[O] ~ 1.0 then 
begin 
for k := 1 step 1 until 4 do p[k] := p[k]/p[0]; p[0] := 1.0 
end; 
e:= 0.25 X pil]; b:= e+e; c:=bXb; d:=0.75 Xo; 
b := p38] + 6 X (c—pl2]); a := pl2] — d; 
c:= p[4] +e X (exXa—p[3]); a:=a—d; pli] := 05 Xa; 
pl2] := (pll]xXp[l]—c)/4.0; [3] := 6 X b/(—64.0); 
if p[3] <0 then 
begin 
CUBICROOTS (p, 7); 
for k := 1 step 1 until 3 do 
if r[2, k] = 0 and 7[l, k] > 0 then 
begin 
d:=r{l,k] X40; a:=a+d; 
pil] := ifa > 0 = b> Othen sgqri(d) else -sgrt(d); 
b := 0.5 X (a+b/p[1]); go to QUAD 
end the general case jumping to QUAD; 
end nonzero p[3]; 
if p[2] < 0 then 
begin 
b := sqri(c); d:=b+b—<a4; 
pll] := if d < 0 then 0 else sqrt(d) 
end 
else 
begin 
b := sgrt(p[2]) X (if p[l]>0 then +2.0 else —2.0) + pl]; 
if b ~ 0 then p[1] := 0 else 
begin 
for k := 1 step 1! until 4 do 


ll 
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begin 
r{l, k] := —e; 
end; 
go to END 
end 
end; 
QUAD: p[2] := c/b; QUADROOTS(p, 1); 
‘fork :=.1,2do 
for j := 1, 2 do rly, k+2] := r[j, k); 
pll] := —plt]; pl2] := 6; QUADROOTS(p, 7); 
for k := 1 step 1 until 4 do 7[1, k] := r[1, k] — e; 
END: 
end BIQUADROOTS; 
procedure CUBICROOTS(p, rr); value p; array p, 7; 
comment finds the roots z = r[1, k] + sgrt(—1) X 7[2, k], ar- 
ranged in order (k=1, 2, 3) of increasing modulus, of cubic equa- 
tion p[0] X « 7 3+ --+ + p[3] = 0; 
comment array 7 defined for subscript bounds [1:2, 1:3] and p 
for [0:3]. Failure occurs (in overflow) if p[0] = 0 and in other 
cases. Assumes 0 < arctan(x) < pi/2 forz > 0; 
begin real s, ¢, b, c, d; 
integer k; 
if p[0] 4 1.0 then 
for k := 1 step 1 until 3 do p[k] := pi[k]/p[0); 
s := p{l]/3.0; ¢:=s X pf]; 
b := 0.5 X (sX (¢/1.5—p[2])+-p[8]); ¢ := (¢—p[2])/3.0; 
c 


r[2, k] := 0 


== tT 3; d:=bXb— Cc; 
if d > 0 then 


d := (sgri(d)+abs(b)) fT (1.0/3.0); 
if d ~ 0 then 
begin 
6b := if b > O then —d elsed; c := t/b; 
end; 
d := r[2, 2] :=sert(0.75) X (b—c); b:= b+; 
c:=r{l, 2] := -05X b-—s; 
ifb >0=s < 0 then 
begin 
rfl, 1]: 
r(2, 3] : 
end 
else 
begin 
rfl, 1]: 
r(2, 3] : 
end 
end the case of two equal or complex roots 
else 
begin 
d := if b = 0 then arctan(1.0)/1.5 else arctan (sqri(—d)/ 
abs(b))/3.0; 
b := sqri(t) X (if b<0 then 2.0 else —2.0); 


c; r[2, 1] := —d; 
0 


r[l, 3] := 


| 

o~ 
I 

iva] 


b—s; 
—d 


r(2, 1] := 0; rfl, 3] := ¢; 


c := cos(d) X 6; t := —sgrt(0.75) K sind) K b — 0.5 X ¢; 
d:=-t—c—s; c:=ece—s8; t:=t—-s8; 

if abs(c) > abs(t) then 7[1, 3] := c 

else 

begin 


rfl, 8) := ¢; t:= ce 
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end; 
if abs(d) > abs(t) then 7[1, 2] := d else 
begin 
r[l, 2} := t; t:=d 
end; 
rfl, 1) := ¢; 


for k := 1 step 1 until 3 do 7{2, k] := 0; 


end the irreducible case; 
end CUBICROOTS; 


procedure QUADROOTS(p, r); array p, 7; 

comment finds the roots x = rfl, k| + sqrt(—1) X 7r[2, k] 
arranged in order (k=1, 2) of ascending modulus, of the quadra- 
tic equation p[0] X z T 2+ pil] x x 4- p[2] = 0; 

comment array p defined for subscript limits [0:2] and r for 
(1:2, 1:2]. The entry values of the array p are preserved. Fails 
(in overflow) if p[0] = 0 and in other cases; 


begin real }, c, d; 


b 


= —p[1)/p[0]/2.0; ¢ := pl2]/pl0]; d:=b6 xX b—c; 


if d > 0 then 
begin 


b := 7[l, 2] := if b > 0 then sgri(d) + b else b — sqri(d); 


rfl, 1] := c/b; 7r[2, 1) := 7[2, 2] := 0 


end 
else 
begin 


d := r{2, 1] := sqri(—d);  r[2, 2] := --d; 
r{i, 1) := rfl, 2] := b 


end 


end QUADROOTS; 


end 
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ALGORITHM 327 

DILOGARITHM [822] 

K. §. Koéusie (Reed. 10 Oct. 1967) 

Applied Mathematics Group, Data. Handling Division, 
European Organization for Nuclear Research (CERN), 
1211 Geneva 23, Switzerland 


KEY WORDS AND PHRASES: dilogarithm function, special 
functions 
CR CATEGORIES: 5.12 


real procedure dilog(z);_ value xz; real z; 
comment This procedure evaluates the dilogarithm function 


id= -| (in |1 — y\/y) dy 


for real arguments x. 13 to 14 significant digits are correct, 

except for values of x near to the zero of d(x) on the positive axis 

(2 fy 12.6). This function appears in several fields of theoretical 

physics. The method of computation is described by Mitchell 

[1]. ForO0 < z < 0.5, a Chebyshev approximation is used, which 

was obtained by economizing the power series ) a=1 2"/n? with 

a multiprecision CERN library program [2]. 

REFERENCES: 

1. Mitcueti, K. Tables of the function { ; (—log | 1—y |/y) dy, 
with an account of some properties of this and related func- 
tions. Phil. Mag. 40 (1949), 351-368. 

2. Cartson, J. R. TCHEBY—telescoping of a polynomial. 


CERN 6600 Computer Program Library E203. (1966), unpub- 
lished; 
begin real f, u, y, 2; 
comment 3.289868 --- = 77/3, 1.644934 --- = «*/6; 
if « > 2 then 
begin 
z2:= 1/7; u:= —0.5 X n(x) TF 2 + 3.289868133696453 ; 
f:i=-l1 
end 
else if < > 1 then 
begin 
z2:= (4-1)/2; 
u:= —0.5 X n(x) X In(zXx—z) + 1.644934066848226 ; 
f:=1 
end 
else if « = 1 then 
begin 
dilog := 1.644934066848226; go to Ll 
end 
else if + > 0.5 then 
begin . 
z2:=1—2; w:i= —ln(z) X In(z) + 1.644934066848226 ; 
f:=-1 
end 
else if x > 0 then 
begin 


else if z = 0 then 
begin 
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dilog := 0; goto Li 
end 
else if x > —1 then 
begin 
z2:= 2/(a—-1); u:= —0.5 X In(l — 2) fT 2; f:=—-1 
end 
else 
begin 
z:= 1/(1—2x); 
= 0.5 X In(z) X In(a Tt 2Xz) — 1.644934066848226 ; 
f:=1 
end; 
:= 0.008048124718341 & z + 0.008288074835108 ; 
<= y X z — 0.001481786416153 ; 
y X 2 — 0.000912777413024; 
y X 2 + 0.005047192127203; 
y X 2 + 0.005300972587634; 
y X 2+ 0.004091615355944 ; 
y X z + 0.004815490327461 ; 
y X z + 0.005966509196748 ; 
y X z + 0.006980881130380; 
y X z + 0.008260083434161; 
y X 2 + 0.009997129506220; 
y X 2 + 0.012345919431569; 
y X z + 0.015625134938703 ; 
y X 2 + 0.020408155605916 ; 
y X z + 0.027777774308288 ; 
y X 2 + 0.040000000124677 ; 
y X 2 + 0.062500000040762; 
y X 2 + 0.111111111110322; 
= y X z + 0.249999999999859 ; 
yXe2et+1; dilog=fxyxXeztu4; 


one oe ee 


ii i 


end dilog; 
comment The procedure dilog was tested on a CDC 3800 com- 


puter, using an ALGoL compiler. It was translated into Fortran 
and run on a CDC 6600 computer. The tests included the fol- 
lowing: 

(i) Calculation of d(x) for x = —1(0.01)1. A comparison 
with the 9-figure table given in [1] revealed in few cases a 
discrepancy of 1 unit in the last figure. 

(ii) Calculation of d(x) for x = +10', 7 = 0(10)100, 

= —3(0.1)15, z = +10',7 = —20(1)0. 

(iii) Calculation of d(z) forz = 1+ 7X 10,7 = —10(1)10, 
m = 10 in the case of the CDC 3800, m = 14 for the CDC 
6600. 

In all. three cases the results have been compared with those 
obtained by summing the power series directly. Agreement to 
13 or 14 significant digits was found, with the exception men- 
tioned in the comment above; 
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Boolean finished; switch decompbranch := return, tir; 
ALGORITHM 328 switch failures := insufficientrank, zerolambda; 
CHEBYSHEV SOLUTION TO AN integer ml, nl, np1, 1, j,k, l, 6, al, al, lst, kmaz, ent; 
OVERDETERMINED LINEAR SYSTEM [F4] real lasteps, Preps, ref, s, t, eps, eta, enor snr 
real array P[0:n, 0:n], lam, rv, sv, x, w, xr[O0:n]; 
RicHarp H. BARTELS AND GENE H. Go.Lus integer array 7(0:n], ix[0:m—1]; 
(Reed. 8 June 1967 and 22 Nov. 1967) comment The subsystem of n + 1 equations currently being 
Computer Science Dept., Stanford University, Stanford, investigated is listed in ix[0], --- , tz[n]. The other equations 
Calif. 94305 are listed in the remainder of iz. r contains row indices. Row 
interchanges during the Gauss decomposition of P are carried 
KEY WORDS AND PHRASES: Chebyshev solutions, over- out by permuting the elements of 7.; 
determined linear systems, linear equations, exchange algo- m:=m-—1; nl:=n-—-1; npli=n+]; 
rithm lasteps := 0; preveps :=— 1; 
CR CATEGORIES: 5.18, 5.14, 5.41 for i := 0 step 1 until n do r{t] := tz[t] := 7; 
for 7 := npli step 1 until m1 do zz[2] := 7; 
procedure Chebyshev (A, d,h, m,n, refset, epz, insufficientrank, comment The initial reference subsystem is chosen by making 
zerolambda) ; a copy of the transpose of A bordered with d and carrying out 
value m,n; integer m,n; real array A, d,h; a Gaussian reduction upon it with row and column inter- 
integer array refset; real epz; label insuffcientrank, zero- changes used to select the largest possible pivot at each stage.; 
lambda; begin 
comment Chebyshev computes a solution in the Chebyshev real array TAB([0:n, 0:m1]; 


sense to an overdetermined system of linear equations, Az = d. 


for 7 := 0 step 1 until m1 do 
Details and notation are given in a paper by Bartels and Golub 


begin 
[Comm. ACM 11 (June 1968), 403-408]. TAB[n, j] := djl; 
The parameters to procedure Chebyshev are: for 7 := 0 step 1 until nl do TAB[i, j] := Af{j, 4] 
end; 
identifier type comments for i := 0 step 1 until n do 
m integer Number of equations begin 
n integer Number of unknowns t:= 0; 
A real array Matrixof coefficients for j := 7 step 1 until n do 
Array bounds—[0:m-1,0:n-1] begin 
d real array Right-hand-sidevector k := r{g]5 
Array bounds—|[0:m-1]} for | := 7 step 1 until ml do 
h real array Solution vector begin 
Array bounds—[0:n-1] ref := TABjk, iz{l)l; 
refset integer Final reference equation numbers if abs(ref) > t then 
array Array bounds—[0:n] begin 
ep2 real Final reference deviation s:= ref; t:=abs(ref); al:=j7; b:=1 
zerolambda label Exit for condition | failure end 
insufficientrank label Exit for condition 2 failure, or . end 
in case rank (A) <n end; 


if 1 = Othen begin; := 1; goto singular end; 
k := rfall; rial] := r[t]; lst := r{t] := k; 


The p: t ,n, A andd t ch d by Chebyshev. ; , ee 
e parameters m,n, A andd are not changed by Chebyshev Eee tae elt al = eek 


We direct the user’s attention to the identifier efa appearing in 


the procedure and to the comment explaining its value and for j = t+ 1 step 1 until ml do 
purpose. ; se all 
begin = Uj) 
real procedure ip (ii, ll, uu, aa, bb, cc); ref is eri l/s; oe 
value ll, uu, cc; realaa, bb, cc; integer it, ll, wu; ye aaa ty OSE anne 
comment single-precision inner-product routine; pee r{k]; 
Bae oe TABlal, 1] := TABlal, t] — TABlal, al] X ref 
sum :=. CC; os 
for a := ll step 1 until uu do sum := sum + aa X bb; “: 
tp := sum er 
end 7p; Soe 


b:=0; al := 1; 

comment The following segment of the program performs a 
column-by-column Gaussian reduction of the matrix associ- 
ated with the reference equations, forming an upper and a 
lower triangular matrix into the array P. (Each diagonal 


real procedure 1p2 (11, ll, wu, aa, bb, cc); 

comment 7p2 is a version of 7p which accumulates the products 
aa X bb-in a double-precision sum, whose final value, rounded 
to single-precision, is taken as the value of 7p2.; 
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element of the lower triangular matrix is one.) Interchanges 
of rows take place so that the largest pivot in:each column is 
employed. It is assumed that 6 — 1 columns have already 
been decomposed. If the matrix is not of full rank, the exit 
insuffictentrank is taken, and it is left up to the user to deter- 
mine if the given overdetermined system can be solved 


exactly.; 
body : 
for 7 := b step 1 until n do 
begin 
l:= i2[t); 
for j := if + = b then 0 else b step 1 until n:do 
begin 
kmax := if j <ithenj — 1 elsez — 1; 
Pli, r{jl] = —ip(k, 0, kmaz, Pf, r[kll, Pie, rfl, 
— (if r[j] = n then d[l] else A[l, r[7]])) 
end; 
ref := 0; 
for j := 2 step 1 until n do 
begin 
t= Pli, rlill; 


if ref < abs(t) then 
begin ref := abs(t); 
end; 
if ref = 0 then begin j := 1; go to singular end; 
if 7 = n then go to decompbranch[al}; 
j= rk]; lk) := r[t]; r[t] = 9; 
for j := 1+ 1 step 1 until n do 
Pit, rfl] := Plt, r[jll/s 
end; 
singular : 
for 7 := 0 step 1 until n do refsel[¢] := tz[t]; 
go to failures{j]; 
comment Solve for the lambdas.; 
return: 
for j := b step 1 until n do 
so[j] = —ipk, 0,7 — 1, sofk], Pik, r[gll, 
— (if r[j] = n then —1 else 0)); 
for 7 := n step —1 until 0 do 


s:=t; k:=jend 


lam{j| := —tp(k,j +1, n, lam[k], Pik, r[jll, —selj))/Ply, rlglls 
comment Compute epsilon for the reference subsystem of 
equations.; 
i:= 0; 


for 1 := 0 step 1 until n do t := ¢ + abs(lam[t]); 

eps := 1/t; 

comment Each new value of eps must be greater than the 
previous one. If this is not so, the solution may have been 
“overshot’’.; 

if eps < lasteps then go to ed; 

lasteps := eps; 

comment Solve for the vector z, the Chebyshev solution of the 
reference subsystem of equations. ; 

for 2 := 0 step 1 until n do zr{t] := stgn(lam[t]) X eps; 

for «1 := 0 step 1 until n do 


wt] = —tp(j, 0,7 — 1, wl], Plt, rial], —xr[t))/Ple, rie]; 
for 7 := n step —1 until 0 do 
alr[i]] := —tpj,¢ +1, n, efrfy]l, Ple, rly], —wle)); 


comment 2/[n] should be —1. It can be used to purify eps and 
the other components of 2.; 

ref: = —a[n]; 

for 7 := 0 step 1 until n1 do x[t] := z[2]/ref; 

eps := eps/ref; 

comment For each index iz[n+1], --- , iz[m—1] compute the 
residual A[iz[j], 0] X [0] +--+ + Alea[j], n—1] X z[n—1] — 
d{ix[j]]. If the largest of these in magnitude is not greater than 
eps, go to itr to refine the vector z, for it may be the Chebyshev 
solution of the full system.; 
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ref := —1; 
for j := npl1 step 1 until ml do 
begin 
ti= taj); 


t := tp(k, 0, nl, x[k], Alt, k] —dlz]); 
if abs(t) > ref then 
begin ref := abs(t); al := j; 
end; 
if ref < eps then go to ?tr; 
ovr: 
k := t2[al]; 
comment The following linear-system solution is computed in 
order to determine which equation is to be dropped from the 
reference set of equations.; 
for 7 := 0 step 1 until n do 
wit] = —ip(j, 0, oe wil, Pij, r[e]], 
— (if r[t] = n then d[k] else A[k, r[z]])); 
for 7 := n step —1 until 0 do 
wt] := —ip(j,i +1, n, wfy], Ply, r[el], —wlt))/Ple, r{c); 
comment $s is the sign of the residual with greatest magnitude. 
Find the largest of the ratios (w[k]/lam[k]) X s. If any compo- 
nent of lam is zero, the exit zerolambda is taken.; 
ref := lam[n]; 6b := n; 
if ref = 0 then begin j := 2; go to singular end; 
ref := (w[n]/ref) X 8; 
for j := 0 step 1 until nl do 
begin 
t := lam{y]; 
if t = 0 then begin j := 2; go to singular end; 
t:= (wljl/t) X 8; 
if ¢ > ref then begin b := 7; ref := tend 
end; 
comment Form a new reference subsystem by exchanging the 
tz[al]-th and i2[b]-th equations. ; 
tzfal] := iz[b]; iz[b] :=k; al := 1; 
ed: 
comment Restore the previous reference substystem. ; 
eps := lasteps; al := 2; 


s:= sign(t) end 


go to body; 


j:= izial]; izlal] := iz[b]; iz[b] := 7; goto body; 
ttr: 

lasteps := 0; cnt := 0; 

comment Iteratively refine the vector z.; 
alp: 


cent := cnt +1; if cnt > 10 then go to insuffictentrank; 
cnorm := snorm := 0; 
for 7 := 0 step 1 until » do 


begin 
k := iz[t]; 
t:= abs(z[z}); 
if snorm < t then snorm := ¢; 
roi] := —ip2(j, 0, n, a2fj], if 7 = n then dk] else 
Al[k, Ps —zxr(t]) 
end; 
‘for z := 0 step 1 until n do 
rolt] := —tp(j, 0,2 — 1, rofl, Plt, rig, —relt))/Ple, rie]]; 
for 7 := n step —1 until 0 do 
wlrlt]] -= —ip(7,t+1, n, w[rly]], Ple, rig, —rele)); 
for 7 := 0 step 1 until 1 do 
begin 
s := ult]; 
zt] := a[t] + 8; 
s := abs(s); 
if cnorm < s then cnorm := s 
end; 


if cnorm/snorm > ela then go to ilp; 
comment eta is to be preset with a small positive multiple 
of the largest positive single-precision machine number w 
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having the property that 1 + » = 1 — w = 1 in a single-pre- 
cision arithmetic. The small multiple will depend upon the 
peculiarities of the machine’s rounding process and will have 
to be empirically determined. ; 

ref := —a[n]; 

for 7 := O step 1 until 11 do z[7| := x{t]/ref; 

eps := eps/ref; 

comment Determine whether a Chebyshev solution has been 
found. If any residual is greater in magnitude than eps while 
eps is smaller than a value produced from an earlier refinement, 
give up, print a warning, and return the best x computed thus 


far.; 
ref := —1; 
for j := npl step 1 until ml do 
begin 

24 := i2([J]; 


t:= wp2(k, 0, nl, z[k], Alt, k], —d[e}); 
if abs(t) > ref then 
begin ref := abs(t); al := j; 
end; 
if ref < eps then finished := true 
else if eps > preveps then finished := false 


s := sign(t) end 


else 
begin outstring (1, ‘DOUBTFUL SOLUTION’); 
go to skip 
end; 
preveps := eps; refset[n] := iz[n]; 
for 71 := 0 step 1 until n1 do 
begin 
refset[t] := t2{2]; 
Alt] := a[t] 
end; 
if — finished then go to ovr; 
skip: 
epz := preveps; 


end Chebyshev 


CERTIFICATION OF ALGORITHM 328 [F4] 

CHEBYSHEV SOLUTION TO AN OVERDETER- 

MINED LINEAR SYSTEM [Richard H. Bartels and 
Gene H. Golub, Comm. ACM 11 (June 1968), 428] 

Norman L. Scuryer (Recd. 14 Nov. 1968, 2 Dec. 1968 
and 27 Jan. 1969) 

University of Michigan, Ann Arbor, Michigan 

KEY WORDS AND PHRASES: Chebyshev solutions, over- 


determined linear systems, linear equations, exchange algorithm 
CR CATEGORIES: 5.18, 5.14, 5.41 


Two modified versions of the procedure Chebyshev have been 
written, one with and one without iterative improvement. The 
algorithms were compiled in Forrran IV on an IBM System/360 
model 67 in double-quadruple and double-precision, respectively. 
When run on the following test system 


ll —8 6 | =34 


0 —15 —12 — 54 
—13 —-8 10 11 
7 8 2 uM 3 

10 —7 9 Le = —64 

0 -—5 5 Xa —19 

7 10 9 13 


—15 0 15 30 
—15 3 —15 | 
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both versions gave the correct answer, x = (—3, 4, —1), to full 
double-precision accuracy (16 digits). The above versions of the 
procedure Chebyshev differ from the published ones in two ways. 
Without these changes the routines have gone into an ‘‘infinite 
loop” in certain circumstances. 

On page 429, first column, the 14th line following the label 
return should be changed from 

if eps < lasteps then go to ed; 
to 

if eps < lasteps then go to ed; 
The above change eliminates the problem of ‘‘infinite loops.” 

When using the version without iterative improvement, one 
additional change is necessary. Change the code following the 
label dtr as indicated on page 405, column 2. Then replace the code 
between labels ed and dir on page 429, column 2, by the following: 


comment Restore the previous reference subsystem; 
eps := lasteps; 
j := t2fal]; ix[al] := ix[b]; ix[b] := 7; 
ref := —1; 
for j := np1 step 1 until ml do 
begin 
7 := iz]; 


t := ip2(k, 0, nl, z[k], Alz, k], —d[z]); 
if abs(t) > ref then ref := abs(t) 
end; 


This change is necessary in order to give the real variable ref the 
proper value for determining if the vector x is a solution or a 
“doubtful solution.”’ That is, the above value of ref will be used 
in the code following the label zér to determine if we have a ‘‘doubt- 
ful solution.” 
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ALGORITHM 329 
DISTRIBUTION OF INDISTINGUISHABLE 
OBJECTS INTO DISTINGUISHABLE SLOTS [G6] 
Rosert R. l’EnticHEL 
(Reed. 24 Aug. 1967 and 8 Dee. 1967) 
Electrical Engineering Department, Massachusetts Insti- 
tute of Technology, Cambridge, Mass. 02139 


KEY WORDS AND PHRASES: 
tions, distribution numbers 
CR CATEGORIES: 5.39 


object distributions, combina- 


procedure dist (k, m, done, q, FirstCall); 
value k, m; integer k,m; label done; 
integer array gq; Boolean FirsiCall; 

comment Successive calls to this procedure compute the 
ee 
— om 
objects into & distinguishable slots. Upon the first call to dist, 
FirsiCall must have the value true. This value is changed to 
false during the processing of the first call. 

Upon return from a call to dist, a new distribution has been 
noted in g[1:k], an integer array. In particular, the number of 
objects to be distributed to the 7th slot has been left as the value 
of qlz]. 


The call following the Gem )-t will cause transfer to the 


label done. 

The values of g must not be altered between calls to dist. 

The method is best introduced by means of aniexample. Sup- 
pose that 9 objects must be distributed among 3 slots. Each dis- 
tribution might be denoted by a three-digit decimal number 
whose digits sum to 9. By the Rule of Nine, each such ‘“‘distribu- 
tion number”’ is divisible by 9. Conversely, many multiples of 9 
are distribution numbers, although some (e.g. 189 and 198) are 
not. 

Now the method is as follows: 

1. Treat g{1] --- g[k] as a k-place number in a number system 
based on (m+1). Usually, return from dist after adding m to this 
number. 

2. If git—1] ¥ q[z] = g{ft+1] = --- = glk] = 0, adding m will 
not result in a distribution number: the sum of the digits will be 
too large. Find the next distribution number by 

a. Setting g[k] := g[t—1] — 1. 
b. Setting g[7—1] := 0. 
c. Adding 1 to g[t—2]. 

The author is indebted to the anonymous referee who, at one 
point in this algorithm’s development, had evidently given it 
more thought than had the author; 

begin integer 7; own integer LefimostZero; 
if FirstCall then 


) distinguishable distributions of m indistinguishable 


begin 
for 7 := 1 step 1 until k — 1 do 
qlt] := 0; 
LeftmostZero := k +1; 
glk] := m,; 


FirstCall := false 
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end 

else if g[1] = m then go to done 

else if LeftmostZero < k + 1 then 

begin 
LeftmostZero := LeftmostZero — 1; 
g{k] := q[LeftmostZero] — 1; 
q[LeftmostZero] := 0; 
qiLeftmostZero — 1] := q[LeftmostZero — 1] +1 

end skip 99, 189, 198, etc. 

else 

begin 
if g{k] = 1 then LeftmostZero := k; 
glk] := glk] — 1; 
glk—1] := glk—-1] +1 

end add m to units place 

end of dist 


REMARK ON ALGORITHM 329 [G6] 

DISTRIBUTION OF INDISTINGUISHABLE OB- 

JECTS INTO DISTINGUISHABLE SLOTS [Robert 

BR. Fenichel, Comm. ACM 11 (June 1968), 430] 

M. Gray (Reed. 20 Sept. 1968) 

Computing Science Department, University of Adelaide, 
South Australia 


As the procedure stands it is incorrect. Preceding 
end skip 99,189,198, etc. 
the following statement should be inserted: 
if g{k] + 0 then LeftmosiZero := k+1 
Thus the compound statement becomes: 


begin 
LeftmostZero := LefimostZero —1; 
g{k] := g[LeftmostZero] — 1; 
q(LeftmostZero] := 0; 
q[LeftmostZero—1] := g[LeftmostZero—1] + 1; 
if q[k] ~ 0 then LefimostZero := k + 1 
end skip 99, 189, 198, etc. 


COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 330 

FACTORIAL ANALYSIS OF VARIANCE [G1] 

Tan OLIVER (Recd. 21 Sept. 1967 and 12 Jan. 1968) 

Computer Center, The Ohio State University, 1314 Kin- 
near Rd., Columbus, Ohio 43212 
(Now at Computer Centre, University of Queensland, 
St. Lucia, Brisbane, Australia 4067) 


KEY WORDS AND PHRASES: 
variance, statistical analysis 
CR CATEGORIES: 5.5 


factorial variance analysis, 


procedure factorial ANOVA (X, n, levels, T); 
valuen; integer 7; integer array levels; real array X, T; 

comment This procedure carries out an analysis of variance on 
the data from a balanced complete factorial experiment. The 
experimental observations are assumed to be stored in the array 
xX. The elements of the array levels are assumed to contain the 
number of levels in each of the n factors. The procedure produces 
the sum of squares for the analysis of variance table in the array 
7. A method of orthogonal transformations [1] is used. 

The levels of the j-th factor are numbered 1, 2, --- , levels[j]. 
The observations are conveniently stored in a multidimensional 
array. For example, for n = 3, X[1, 3, 2] is the observation taken 
at levels 1, 3, and 2 of the first, second and third factors re- 
spectively. factorial ANOVA actually uses the procedure index 
to compute the multidimensional subscript and uses X as a one 
dimensional array so that n may have any value. Thus, if 
factorial ANOVA is called with a multidimensional array as the 
first argument, then index may have to be rewritten for a given 
compiler to correctly compute any multiple subscript. As 
written, index assumes that X has been declared in a statement 


such as real array X[1:levels{1],---,1:levels[n]] and that the com- 
piler arranges storage so that the first subscript varies most 
rapidly. 


Alternatively the data may be transmitted in a linear array 
so that. the factor levels associated with each observation are 
ordered so that the levels of the first. factor vary most rapidly. 
The procedure index will then require no modification. 

The array 7' may also be considered a linear array, or an n-di- 
mensional array declared in a statement of the form real array 
T(1:2,1:2,---,1:2]. Element 7'|2,1,---,1] is the sum of squares 
for the main effect of the first factor. T[1,2,1, ---,1] is the main 
effect for the second factor. 7'[2,2,1,- --,1] is the interaction be- 
tween the first two factors, and so on. If T is considered as a 
linear array, an element may be interpreted by examining the 
bit pattern in the binary value of the subscript minus one. For 
example, 7(6)=7[5+1] is the interaction between the first and 
third factors. 

On return from factorial ANOVA the data array X will con- 
tain orthogonal components of the sums of squares in the array 
T. As written, the components are the squares of values obtained 
by performing an Helmert transformation [2] for each factor. 
The procedure orthog may be modified, if the components are re- 
quired per se, to produce any desired orthogonal contrasts. 

The advantages and limitations claimed for factorial ANOVA 
are as follows. The procedure is very conservative of storage 
provided no factor has a large number of levels. The amount of 
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temporary array storage required is 3n + m(m+2) where m is 
the maximum number of levels in any factor. The procedure 
body is also very short. The routine should therefore be useful 
for small computers or for inclusion as a subroutine in programs 
whose primary purpose is not the statistical analysis. No com- 
parison of running time has been made with other methods but 
this routine requires Il: levels:i(>.; levels:+1) floating multipli- 
cations and may therefore be comparable in speed with the 
method described in [38]. 


This procedure is intended to present an algorithm rather 
than an optimal program for an algorithm and so the coding can 
be considerably improved in efficiency which was somewhat 
sacrificed for clarity. 

Acknowledgment. The author wishes to thank the referee and 
the editor for their valuable comments and suggestions. 
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begin integer factor, k1,k2,7; integer array 7, Ti, Tlimit [1:n]; 

integer procedure index (subscript, limit); 

integer array subscript, limit; 
begin integer j, temp; 

temp := 0; 

for 7 := n step — 1 until 1 do 

temp := temp X limit[j] + subserrpt[j| — 1; 

index := temp + 1; 
end index procedure; 
procedure orthog(Q, size); 

value size; integer size; 
begin integer 7, j; 

for 7 := 1 step 1 until size do Q[z, 1] := 1.0/sqrt(size); 

for j := 2 step 1 until szze do 


real array Q; 


begin 
for 7 := 1 step 1 until 7 — 1 do 
Qli, J] := — 1.0/sqrt(7X G—1)); 


Q(j, j]:= sartG—1)/7); 
for 1 :=j7 +1 step 1 until size do Q[z, j] := 0 


end 
end orthog procedure; 
comment Carry out orthogonal transformation; 


for factor := 1 step 1 until n do 
begin 
real array A, B[1:levels[factor]], Q{1:levels[factor], 
1:levels[factor]]; 
orthog(Q, levels{factor]) ; 
for j := 1 step 1 until n do 7[7] := 1; 
loop1: for z[factor] := 1 step 1 until levels [factor] do 
A[t[factor]] := X[tndex(z, levels) ]; 
for ki := 1 step 1 until levels[factor] do 
begin B{kl| := 0; 
for k2 := 1 step 1 until levels[factor| do 
Bik1] := B [kl] + Q[k2, k1] & A[k2] 
end; 
for i[factor] := 1 step 1 until levels[factor] do 
X {index (i, levels)| := Blt[factor]]; 
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for j := 1 step 1 until n do 
if 7 ~ factor then 
begin 
t(gl:= tg] +1; 


if 7[7] < levels [7] then go to loop! else 7/7] := 1 


end 
end; 


comment Form mean squares and sums of squares; 


for j := 1 step 1 until n do 
begin T7[j] := 1; Tlimit[j] := 2 end; 
loop2: forj := 1step1 until n do7z[j] := T1[j); 
kl := index(Ti, Tlimit); T{k1] := 0; 
loops: k2 := tndex(t, levels); 
X(k2) := X[k2] f 2; Tlkl] := T[k1] + X[k2]; 
for j := 1 step 1 until n do 
if Tr[j] ~ 1 then 
begin 
aj) := a7] + 1; 


if ¢[j] < levels[j] then go to loop3 else i[j] := 2 


end; 
for 7 := 1 step 1 until n do 
begin 
Tilj] = Tilj] +1; 
if Ti[j] < 2 then go to loop2 else Ti[j] := 1 
end 
end factorial ANOVA 
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ALGORITHM 331 

ZAUSSIAN QUADRATURE FORMULAS [D1] 
WALTER GAUTSCHI (Recd. 26 Aug. 1967 and 8 Feb. 1968) 
Purdue University, Lafayette, Ind., and Argonne National 


Laboratory, *Argonne, Ill. 60439 
*Work performed under the auspices of the US Atomic Energy 
Commission 


KEY WORDS AND PHRASES: quadrature, Gaussian quad- 
rature, numerical integration, weight function, orthogonal poly- 
nomials 

CR CATEGORIES: 5.16 


begin 

comment The procedure Gauss below obtains Gaussian quad- 
rature formulas relative to any weight function whose singu- 
larities, if any, are monotonic and located at the endpoints of 
the (finite or infinite) interval of integration. The procedure is 
most useful for (but not restricted to) ‘“‘nonclassical’’ weight 
functions, i.e. weight functions for which the associated or- 
thogonal polynomials are not known explicitly; 

real procedure Fourier (c,n); valuec,n; integern; realc; 

comment This is a subroutine computing 


ee > cos (2mé) 


c == cos @ 
m=1 4m? = 1 : : 


the truncated Fourier series of (7/2) sin 6; 
begin integer m; real cO, cl, c2,t, sum; 
clL:=1; O:=2XexXe-—-1; ¢:=2X 0; 


sum := c0/3; 
for m := 2 step 1 until n do 
begin 


c2:= cl; cl := cO; cO:=tX el — c2; 
sum := sum + c0/(4KmxXm-1) 
end; 
Fourter := 1— 2 X sum 
end Fourier; 
procedure transform (1, phi, phil); valuet; real t, phi, phil ; 
begin real 11; 
tl := abs(t); 
phi := t/(1—tl); phil := 1/((1-t1)K (1—-41)) 
end transform; 
procedure symm transf (t, phi, phil); valuet; real t, phi, phil; 
begin real (2; 
2:= UX 1b; 
phi := t/—t2); | phil := (1+12)/((1—12) x (1—#2)) 
end symm transy ; 
procedure Gauss (sequential, finite left, jinite right, left, right, 
eps, wf, capn, n, results) ; 
value sequential, finite left, finite right, left, right, eps, capn, n; 
integer capn,n; real left, right, eps; 
Boolean sequential, finite left, finite right; 
real procedure wf; 
array results; 


comment This procedure generates approximate values for 
the abscissas and weights of Gaussian quadrature formulas with 
weight function wf. If the Boolean variable sequential has the 
value true, then k-point formulas 
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6 k 
[ g(x)wf(x) de y we gas”); 


—o Sa<r <2fM <... <a <bs , 


are generated fork = 1,2, --- ,n, the abscissa z-) being stored 
in results [k, r], the weight w- in results [n+1—k, n+2—r]. 
The array results, in this case, should be declared to have dimen- 
sions [l:n, l:n+1]. If the value of sequential is false, then a 
single n-point formula is produced with the abscissa z/ being 
stored in results [1, r], the weight w,“ in results. [2, r]. In this 
case, the array results need only have dimensions [1:2, 1:n]. 
The Boolean variable finite left must be assigned the value 
true, if the lower limit of integration, a, is a finite number, 
otherwise the value false. Similarly for the upper limit 6 and 
the associated Boolean variable finite right. The parameter left 
is to be set equal toa, if a is finite, and may be assigned an arbi- 
trary value, if a = —. Similarly for the parameter right, 
which should be equal to }, if 0 is finite, and may be arbitrary, 
if b = «. The parameter eps is a tolerance used to control 
termination of Newton’s iteration for the calculation of the 
abscissas z-‘*). If d significant digits are desired one may set 
eps = .5 X 10-4. Some leeway should be allowed to accommo- 
date moderate accumulation of rounding errors. 

The method of computation is based on a suitable discretiza- 
tion of the inner product (f, g) = f° f(x)g(x)wf(z) dz, the num- 
ber of points used in the discretization being given by capn. 
The desired abscissas and weights are approximated by the 
zeros and weight factors of the resulting orthogonal polynomials 
of a discrete variable. The process converges as capn —> ~, 
provided the singularities of the weight function wf, if any are 
present, are located at the endpoints a, b and are monotonic. 
The traditional approach via moments is deliberately avoided 
because of its ill-conditioned character (when 7 is large). Fur- 
ther details of the method are to appear elsewhere [4]. 

No general rules can be given for the appropriate value of 
capn, the choice depending both on the desired accuracy and the 
rate of convergence of our process. A reasonable approach is to 
try, say, capn = 10 X n, and to repeat with a larger value of 
capn (say twice as large). If the results agree to within the de- 
sired accuracy, those of the second trial may be accepted as 
final. Otherwise, capn might be further incremented. 

The nonsequential version of the procedure is preferable if 
quadrature formulas for only one, or a few, selected values of n 
are desired. 

The procedure Gauss calls on the procedures transform, symm 
transf, and the real procedures Fourier, wf, all of which (except 
the last) are declared above. The real procedure wf has to be 
supplied by the user; 


begin 


integer k, m, r, kmaz, count, 1t; 

real epsl, sum, phi, phil, 10, t1, poll, poll, q, c0, cl, c2, lower 
bound, upper bound; 

array w, z{l:capn], a[0:n—1], b[0:n], p0, pl, p2[—1:capn], 
p[—1in], lrst{O:n]; 

procedure p and pl (bool, m, n, t, p0, p, pl); 
value m, n, t; integer m,n; real t; Boolean bool; 
array p0, p, pl; 
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comment This procedure evaluates the m-times deflated (dis- 
crete) orthonormal polynomials p,(xz)(r=m, m+1,---, n), 
as well as their first derivatives (if beol is true), for given 
argument ¢. The array p0 is assumed to hold the values of the 
(m—1)-times deflated polynomials evaluated at the m-th 
zero Of pn. When m = 0 these are the values 1, 0, 0, --- , 0; 
begin integer r; 
plm] := pOlm—1]/b[m]; pim—1] := 0; 
for r := m step 1 until n — 1 do 
plr+1] := (p0[r]+ ¢—alr]) X p{r]—b[r]X p[r—1]) /o[r +0); 
if bool then 


begin 
pl[m] := pll[m—1] := 0; 
for r := m step 1 until n — 1 do 
pllr tl] := (pir]+(t—alr) Xpl[r]—b[r]X pllr—1pVblr-+1] 
end 
end p and 71; 
lower bound := left; upper bound := right; 


comment The piece of program extending from this point to 
the second following comment sets up the abscissas 2: and 
weight factors wx to be used in the inner product of the dis- 
crete orthogonal polynomials. It is here (and only here) 
where explicit use is made of the given weight function wf; 

kmax := entter (capn/2); 

for k := 1 step 1 until kmaz do 

begin 
az[capn+1—k] := cos(1.5707963268X (2X k—1)/capn); 
zlk] := —2z[capn--1—k]; 
wi{k] := wleapn+1—k] := Fourier (2[k], kmaz) 

end; 

comment In the preceding for-statement the values of the 
cosine could have been generated recursively: with consider- 
able saving of time, but some loss of accuracy, if capn is very 
large. It was decided to sacrifice efficiency in favor of accu- 
racy. 

If the weight function contains a square root singularity, 
typified by x~2 at x = 0, rather improved accuracy may result 
from modifying the last preceding statement to read 
wl[k]:=w[capn+1—k]:=1.5707963268 X sgri(1—-x[k]X2[k]), 
and the second following statement to read 

wl[kmax+1] := 1.5707963268. 
This is especially so if the square root singularity occurs at 
both endpoints; 
if capn/2 ~* kmax then 
begin 
a{kmax+1] := 0; wlkmax+1] := Fourier(0, kmax) 
end; 
if finite left then 
begin 
if finite right then go to Ll else go to L2 
end 
else 
begin 
if finite right then go to L3 else go to L4 
end; 
Li: for k := 1 step 1 until capn do 
begin 
x(k] := ((right-left) Xx2[k]+right+ left) /2; 
wlk] := (right-left) Xw{k]X wf ([k])/capn 
end; 
go to continue; 
L2: for k := 1 step 1 until capn do 
begin 
transform (.5X (1+<2[k]), phi, phil); 
alk] := left + phi; 
w{k] := wik] X wf(a[k]) * phil/capn 
end; 


331-P 2- 0 


go to continue; 
L3: fork := 1 step 1 until capn do 
begin 
transform (.5X(—1+2[k]), phi, phi); 
ak] := right + phi; 
wk] != wlk] X wf(z[k]) X phil/capn 
end; 
go to continue; 
LA: for k := 1 step 1 until capn do 


begin 

symm transf (x{k], phi, pht1); 

a[k] := phi; 

wlk] := 2X wlkl X wf(phi) X phil/capn 
end; 


comment The piece of program extending from this point to 
the second following comment generates the coefficients a, , 
b,41(7=0, 1, -++ , n—1) in the recurrence relation 
Pr+i(x) = ((~—Gr) p+ (x) —b pri ()) /D rat 
for the (discrete) orthononormal polynomials p, associated 
with the inner product 


capn 


If, g] = > wef (cK) g (xx). 
The content of b[0] is set equal to 1/po ; 
continue: sum := 0; 
for k := 1-step 1 until capn do sum := sum + wk]; 


b[0] := sqrt(sum); 
for k := 1 step 1 until capn do 


begin 
pl{k] := 0; p2[k] := 1/6(0] 
end; 
for r := 0 step 1 until n—1 do 
begin 
sum := 0; 
comment If a= —, or b = ~, overflow conditions may 


arise in the following two for-statements, which, if ignored, 
should normally be of no consequence; 
for k := 1 step 1 until capn do 


begin 
_ polk] := pilk]; pik] := p2[k]; 
sum := sum + wk] X z[k] X pl[k] X pl[k] 
end; 
alr] := sum; sum := 0; 
for k := 1 step 1 until capn do 
begin 


p2{k] := (e[k]—a[r]) x pl[k]—b[r] xX pOlk}; 
sum := sum + wk] X p2[k] K p2ik] 
end; 
b[r+1] := sgrt(sum); 
for k := 1 step 1 until capn do p2(k] := p2[k]/b[(r+1] 
end; 
comment Using the values of a,, 6-41 just obtained, the pro- 
cedure now produces upper and lower bounds for the zeros of 
pn(t) when b = ©, or a = —, respectively. The bounds 
are derived by applying the Gershgorin circle theorem to 
the Jacobi matrix associated with the polynomials p,; 
if — finite right then 
begin 
upper bound := a[0}] + O61]; 
for r := 1 step 1 until n — 2 do 
begin 
tO := alr] + blr] + b[r+1); 
if (0 > upper bound then upper bound := 10 
end; 
10 := a[n—1] + b[n—1]; 
if (0 > upper bound then upper bound := 10 
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end; if 0 < list{m—1] V’ 10 = list[m] then 
if — finite left then begin 
begin if count = 0 then 

lower bound := a[0] — 6[1]; begin 


t0 := list[m—1]; 
p and pi (false, 0, k, (0, p2, p, pl); 
pol0 := pik]; q:= 2X (list(m]—listim—1)); 


for r := 1 step 1 until n — 2 do 
begin 
tO := afr] — b[r] — d[r+1]; 


Newton: t1 := 10; 


wt := tt +1; 
pand pl(true, 0, k, tl, p2, p, pl); 
10 := t1 — p[k]/pl1{k]; 


if 7t > 30 then 
begin 


if 10 < lower bound then lower bound := (0 search: fi := 0+ q; 
end; p and pl (false, 0, k, #1, p2, p, pl); 
tO := a[n—1] — b[n—1]; poll := pik); 
if 10 < lower bound then lower bound := i0 if pol0 X poll > 0 then 
end; begin 
comment The remaining section of this procedure determines =O; 
approximations of the desired abscissas and weights. If se- if (0 < list{m] then go to search else 
quential is true, the zeros of the (discrete) orthonormal poly- begin ‘ 
nomials p,(r=1, 2, --+ , m) are determined sequentially using outstring (1, 'search for zeros unsuccessful ); 
Newton’s method. Suitable initial approximations are found outinteger (1, k); outinteger (1, m); 
on the basis of the interlacing property of the zeros. Each go to exit 
Newton approximation is checked on whether or not it satis- end 
fies this property. If not, the appropriate subinterval is end 
searched more thoroughly for possible zeros. If none is de- else 
tected the message ‘‘search for zeros unsuccessful’’ is printed begin 
out. Otherwise, Newton’s iteration is repeated with a revised 10 := (+t1)/2; count := count + 1; 
initial approximation. If again the interlacing property turns go to Newton 
out to be violated the message ‘‘interlacing property of the end 
zeros 18 violated”’ is printed out. The message ‘‘Newton itera- end 
tion diverges’’ is printed if, for any reason, Newton’s itera- else 
tion fails to converge within 30 iterations. In either of these begin ae 
abortive situations the procedure exits, leaving the current outstring (1, ‘interlacing property of zeros is vio- 
quadrature formula, and all subsequent formulas, uncom- lated’); 
pleted. outinteger (1,k); owtinteger (1, m); 
In the nonsequential case, the zeros of pz are obtained by go to exit 
Newton’s method and successive deflation. Each deflation end 
(except the first) is preceded by a refinement of the respective end; 
zero using Newton’s iteration based on the original (unde- if it > 30 then 
flated) polynomial p,. If this iteration fails to converge begin 
within 15 iterations the message ‘‘Newton iteration in re- outstring (1, 'Newton iteration diverges ); 
finement diverges” is printed out. If Newton’s method for outinteger (1, k); outinteger (1, m); 
the deflated polynomials fails to converge within 30 itera- go to exit 
tions, it is checked whether this may be due to the tolerance end; 
eps being too stringent, considering the presence of subtrac- if abs(t1—t0) > eps X abs(t0) A abs(tl1—t0) > eps 
tion errors in the generation of the polynomials and their /\abs(t0) > eps then 
derivatives. If this is the case, the procedure goes on to re- go to Newton; 
fine the particular zero. Otherwise, it prints out the message results [k, m] := (0; 
“Newton iteration diverges.” In either of the two abortive p and pi (false, 0, k—1, 10, p2, p, pl); 
situations the procedure exits, leaving the quadrature for- sum := 0; 
mula unfinished. for r := 0 step 1 until k — 1 do 
The weights are computed by the formula sum := sum + plr] X pir]; 
nee results|n+1—k, n+2—m] := 1/sum 
[wt = DO pales) 2; — 
8==0 end 
end 
p2[—-1] := 1; for k := 0 step 1 until n — 1 do p2[k] := 0; elas 
if sequential then begin 
sei pl~1] := 1; 
list[0] := lower bound; for k := 0 step 1 until n — 1 do pik] := 0; 
results [1, 1] := a[0]; results[n, n+1] := b[0] & 5[0]; 10 := lower bound; 
for k := 2 step 1 until n do for m := 0 step 1 until n — 2 do 
begin begin 
for m := 1 step 1 until k — 1 do for k := m — 1 step 1 until n — 1 do p0[k] := plk); 
listim] := results[k—1, m]; it := 0; 
list{k] := upper bound; Newtonl: tl := (0; 
for m := 1 step 1 until k do it := it +1; 
begin ; . ; pand pi (true, m,n, tl, p0, p, pl); 
10 := (list(m]+list{m—1))/2; count := tt := 0; 0 := t1 — p[n]/pl[n); 
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cO := abs(p0[n—1]); 
cl := abs((tl—a[n—1]) X p[n—1]); 
c2 := abs(b[n—1]X p[n—2]); 
phi := if c0 S cl then 
(if cl Sc2 then c2 else c1) 
else 
(if cOSc2 then c2 else c0); 
phi := phi/db[n]; 
c0 := abs(p[n—1)); 
cl := abs((tl—aln—1]) X pl{n—1)); 
c2 := abs(b[n—1]X pl[n—2]); 
phil := if cC0 S cl then 
(if clSc2 then c2 else cl) 
else 
(if cO S c2 then c2 else c0); 
phil := abs(phi1/(b[n]X pl{n])); 
phi := if phi < phil then phil else phi; 
epsl := if phi > 1 then 10 X phi X eps else 10 X eps; 
if abs((1—t0) > epsl XK abs(t0) / abs(i0) > epsl then 
begin 
outstring (1, 'Newton iteration diverges'); 
outinteger (1, m+1); 
go to exit 
end 
end 
else 
begin 
if abs(t1—t0) > eps X abs(t0) A abs(t1—(O) > eps 
/\abs(t0) > eps then 
go to Newtonl 


end; 
if m > 0 then 
begin 
at := 0; 
refine: tl := 10; 
weiss tt +1; 


p and pl(true, 0, n, tl, p2, p, pl); 
0 := tl — plnj/pl{[n]; 
ifit > 15 then 
hegin 
outstring (1, 'Newton iteration in refinement diverges'); 
outinteger (1, m+1); 
go to exit 
end; 
if abs(t1—t0) > eps X abs(i0) A abs(tl—t0) > eps 
/\abs(t0) > eps then 
go to refine 
end; 
results[1, m+1] := 10; 
p and pl(false, m, n—1, 10, 0, p, pl) 
end; 
results[1, n] := afn—1] — b[n—1] X p[n—1]/p|n—2]; 
for k := 1 step 1 until n do 


(iii) wf@) = [(—2?)d—h'e?) 9, 
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(i) wf(z) = x* Inle/x),0 < «2 < le = 0(1)3,.5,—.5, 
capn = 100. The maximum absolute error (rounded to 


significant figures) in the abscissas and weights is shown be- 
low together with the values of k and r at which the maximum 
1<r<k). For comparison we used the 7-118 


occurs (l<k<n, 
values published by V. I. Krylov and A. A. Pal’cev [5]. 


eS maximum error k - maximum error k ‘ 
in abscissas in weights 

0 1.10i9—6 2 1 5.6310 —6 1 1 

1 2.5419—-7 5 4 3.9019—7 5 3 

2 7.5819—7 5 4 9.5310—7 5 3 

3 3.8819—7 4 3 2.8710—7 4 3 

5) 6.7610—-7 4 1 5.4610—7 5 2 

—.5 1.8610—3 1 1 5.9719 —-2 1 1 

Note the relatively large errors for a = —}; using the modi- 


fication mentioned in the sixth comment, these errors are 
slightly reduced to 6.77 10-4 and 2.17,9-2 respectively. 


(ii) wf(x) = Infe/—z)) Infe/z), O< « <1, n = 5, capn = 


100,200,400. Comparing the results with 118 values given by 
V. I. Krylov and A. A. Pal’cev [5] the following absolute 
errors were observed. 


coon aia error ee ate 
100 9 .3319—7 3 1 1.1310—5 1 1 
200 2.3219—7 3 1 2.8li0—6 1 1 
400 5.80:10—8 3 3 7.0410-7 1 1 


-l1<2<1, k= .1(.2).9, 
.99, n = 10, capn = 100. The weight factors (and, indirectly, 


the abscissas) were checked by comparing the sum >.) wx” 


with the zero-order moment 
1 
m = [ [1 — 2?)(1 — k®x?)- 1? dx = 2K(k). 
1 


The moments mp , and the observed discrepancies, are shown 
below, for the versions with and without the modification 
mentioned in the sixth comment. 


error (without 


k mo error (with mod.) Sods) 
l 3, 1494911230 1.16:0—10 6.9310 —3 
3 3.2160972399 1.7519—10 7.2310—3 
a3) 3.3715007097 5.8219—11 7.9619—3 
By A 3 .6913879968 4.071.—10 9.6610—3 
9 4 .5610982769 4.6610—10 1.58:19—2 
.99 6.7132010474 1.1619—10 4.8819—2 


(The elliptic integral K(k) was computed from a 6th-degree 
polynomial approximation due to W. J. Cody [2].) The rather 
dramatic improvement due to the modification is well worth 
noting. The positive abscissas and corresponding weights 
for k = .5, as obtained by the modified procedure, are given 


0 


n= 5, 


3 


begin 
p and pl (false, 0, n—1, results[{1, k], p2, p, pl); pelon: 
sum := 0; ao) wt 
for r := 0 step 1 until n — 1 do 6 .15746 64996 .31717 65527 
sum := sum + plr] X pir]; 7 .45647 98649 .32407 60350 
results[2, k] := 1/sum 8 .70963 75175 .33617 78803 
end 9 .89237 18385 .384961 83201 
end; 10 .98787 25254 ~° .35870 15666 
exit: end Gauss; 4 a ua ‘a is 


By symmetry, 2%; = —2nyi-r, Wr = Wasi-r (r=1,2, 


comment The procedure Gauss, in both the sequential and non- 
sequential form, was tested on the CDC 3600 computer for a 
number of weight functions. The tolerance eps = .510.-9 was used 
throughout. The following surveys the results obtained in a 
few representative cases. 


seen). 7 
(iv) wf(e) = 1/((at+e2)V/z), O<2<1, » = 1,.1,.01, n = 10, 
20, capn = 800. (The abscissas are the squares of the ab- 
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scissas of the 2n-point formula corresponding to wf (2) = 
1/(x?+n?2), —1 <2 <1, while the weights are twice those 
of the 2n-point formula.) The moments mx satisfy 


2 1 
-are tan[-— }, 
ru be 


(9) 
hd 


2k —-1 


ll 


mo 


Mk = — wm (k = 1,2,-++ ,2n — 1). 


Shown below are the maximum relative errors in the moments 


Mr , i.e. 
nN 
(5, stem) Jn 
r=1 


Invariably, the maximum was attained for k = 2n — 1. 
Again, the modification mentioned in the sixth comment was 
used, 


Tm = max 
0<k<S2n-1 


. 


uh n tn bh n ' 
1.0 10 6.11i0—6 1.0 20 1.25:0—-5 
wl 5.9510 —6 l 1.2419—5 
.O1 5.9419—6 OL 1.24195 


(v) wf(z) = Ei(z) = JP et dt/t, 0 <a < 0, n = 20, capn = 
160,320,640. The maments in this case are given by m = 
k\/(k+1). Shown below are the maximum relative errors rn 
in these moments. The maximum invariably occurred at 


k = 0. 
capn reo 
160 2.2019--6 
320 5.5010--7 
640 1.3710--7 


Because of the intrinsic interest of this quadrature formula 
in transfer problems [1] we list below the abscissas and weights 
obtained with capn = 640, but rounded to 8 significant digits. 


r ap 20 wi20 7 oo (20 we) 

1 = .041573069 .33006847 11 13.919556 1.737364610—7 
2  .27423961 .338501883 12 16.969573 7.719701410—9 
38 73521299 . 20272710 13 20.415565 2.328565315—10 
4 1.4364648 .090679419 14 24.304884 4.549550719—12 
5 2.3868423 .031192649 15 28.701954 5.4035520:.—14 
6 3.5949493 8.396817310—-3 16 33.698290 3.567300310.—16 
7 5.0704204 = 1.7051956:0.-3 17 39.4313867 1.1447950.:0—18 
8 6.8247452 2.6719289:0>—4 18 46.128447 1.4384158319—21 
9 8.8719945  3.152252719—-5 19 54.222968  4.633740719—25 
10 11.229631 2.751164519-6 20 64.825944 1.36239861.—29 


(The exponential integral E(x) was evaluated by the series 
expansion E,(z) = —y — Inz — SO%, (—1)*"/(nn!), if 
0 <a < 2, and from a rational approximation due to Iast- 
ings [8, formula 5.1.56], if « > 2.) 

(vi) wf(z) = |x |"e*, ~0o <xr< wo, a= 1, 2, 3, n = 20, 
capn = 200,400,800. Shown below are the maximum relative 
errors of the abscissas and weights as compared with values 
tabulated by A. H. Stroud and Don Secrest [6]. 


maximum errors maximum errors 


s capn in abscissas in weights 
1 200 6.3lio—3 2.11lip—1 
400 2.7310—6 6.8919—5 
800 6.5919 —7 7 .A0i0—7 
2 200 1.1919 —2 4.30:19—1 
400 1.1219—6 $.4419—9 
800 4.6610-—10 2.2719 —8 
3 200 1.6719—2 ».5910— 1 
400 2.69:9—6 7 .9310—-9 
800 4.5310— 10 2.1210—8 
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end 
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GAUSSIAN QUADRATURE FORMULAS [D1] [Walter 
Gautschi, Comm. ACM 11 (June 1968), 432] 

I. D. Hitt (Recd. 12 Sept. 1968) 

Medical Research Council, Computer Unit (London), 
London, N.1, England 


KEY WORDS AND PHRASES: quadrature, Gaussian quadra- 
ture, numerical integration, weight function, orthogonal poly- 
nomials 


CR CATEGORIES: 5.16 


1. On pages 434 and 435 there are five strings, all of which have 
identical opening and closing string quotes.' and' should be re- 
placed by ‘and’ in each case. 

2. No space symbols appear in these strings. u should be in- 
serted in each space. Otherwise, no spaces will appear in the printed 
messages. 

3. In the second string, the hyphen in the word ‘‘violated”’ 
should be deleted. 

4. In the first column of page 433 there appear: 

kmazx := entier(capn/2); 
and 

if capn/2 ~ kmax then 
Both these are critically dependent upon rounding error in the real 
division. Presumably, 

kmax := capn + 2; 
and 

if capn ¥* 2 X kmax then 
are intended. 

5. A semicolon is necessary before the final end (on page 436). 
As things stand, this end is part of the comment, and the algo- 
rithm never finishes. 

Alternatively, the semicolon after end Gauss, two columns 
earlier, could be deleted (in which case the symbol comment 
could also be deleted if desired, but need not be). If this were done, 
the final end would terminate the comment without the need for 
a preceding semicolon. 
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REMARK ON ALGORITHM 331 [D1] 

GAUSSIAN QUADRATURE FORMULAS [Walter 
Gautschi, Comm. ACM 11 (June 1968), 432-436] 

Wiutuiam R. Wisk, Jr.* (Recd. 28 Jan. 1970 and 2 Mar. 
1970) 


Box 35348, Georgia Institute of Technology, Atlanta, 
GA 30332 


*Work performed at Danish Atomic Energy Commission, Re- 
search Establishment Riso, Reactor Physics Department, 
Computer Group 


KEY WORDS AND PHRASES: quadrature, Gaussian quadra- 
ture, numerical integration, weight function, orthogonal poly- 
nomials, Newton’s method, successive deflation 

CR CATEGORIES: 5.16 


The last Gaussian point calculated in the nonsequential meth- 
od, being the root of a linear equation, is calculated directly rather 
than by Newton’s method. In doing so it misses out on the refine- 
ment process. 

If the m-loop is extended to include this last point also (and of 
course the direct calculation deleted), the results agree more 
closely with those given by Stroud and Secrest [1]. The following 
corrections will achieve this: 

On the 4th line before the line labeled Newion 1 replace: 


for m := Ostep 1 until n — 2 do 


by: 

for m := 0 step 1 until n — 1 do 
Delete the 17th line following the line labeled refine, which now 
reads: 


results [1, n] :=a[n — 1] — b[n — 1] X p[n — 1]/p[n — 2]; 

The following change may be made but is not necessary: 

After the 14th line following the line labeled refine insert the 
line: 

ifm <n — 1 then 


Since this only deletes the call for m = n — 1, which is almost de- 
generate, it usually proves to be a bigger waste of time to include 
the comparisons rather than to have the unnecessary procedure 
call. 

Table I shows test examples indicating the difference between 
the last Gaussian point computed directly and by including the 
last Gaussian point in the m-loop. 


TABLE I. Gauss Pornts 


Limits 
Weight function Tower Upper ™ (a) Computed Stroud and Computed 


with change Secrest [1| without change 


l+¢z —1 - 955041232 - 955041227 - 955041260 


1 7 

(+2) a1 1 7  .959734457 . 959734452 . 959734490 
(i — 22)3 -1 1 7  .876922576 876922518 . 876922615 
(1 — 22)-} @) —1 1 7 .974927912 974927912 .974927938 
1 —1 1 10  .973906536 973906529 .973906610 
|| —1 1 7  .954679076 . 954679025 . 954679111 
e? 0 o 7 19,3958995 19. 3957279 19. 3958995 


(a) A capn of 70 was used excepted for n = 10 in which capn = 90 was used, eps = 
1.0010 —9 throughout. 
(b) The change for square root singularities suggested in comment 6 was used. 


REFERENCE: 


1. Stroup, A. H., anp Srcrest, Don. Gaussian Quadrature 
Formulas. Prentice-Hall, Englewood Cliffs, N.J., 1966. 
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ALGORITHM 332 1. Bateman Manuscript Project, Calif. Inst. of Tech. Higher 
JACOBI POLYNOMIALS [S22] Transcendental Functions, vol. 2. McGraw-Hill, New York, 
Bruno I. W. Wirrs (Recd. 2 Aug. 1967, 11 Oct. 1967, 1953, pp. 168-174. 
8 Dec. 1967, 18 Jan. 1968) 2. Szmaé, G. Orthogonal Polynomials. Collog. Publ., vol. 23. 
U.S. Navy Electronics Laboratory Center, San Diego, diate Mathematical Society, New York, 1939, pp. 136- 
California 92152 3. Stroup, A. H., ANp Secrest, D. Gaussian Quadrature Formu- 
KEY WORDS AND PHRASES: Jacobi polynomials, orthogonal las. Prentice-Hall, Englewood Cliffs, N. J., 1966, pp. 17-31. 


polynomials, three-term recurrences, special functions 
CR CATEGORIES: 5.12 


comments JACOBI evaluates in double-precision the Jacobi 
polynomial F = P,(z), defined by Rodrigues’ formula 


20-nts Paw) = (—1)"(1—a)-*(1-+2)-8- Da (1-2) (1+-2) 9], 


for degrees n from 0 through 25, and for the given values of the 
double-precision arguments a, 8, and x. The subroutine uses the 
three-term recurrence relation (see, for example, [1, p. 169]): 


P;(z) = (U;+V;-x)- Pj-1(2) = W,;-Pj—2(2). (1) 


Also calculated are the derivative FD = dF /dz and estimates of 
the relative errors EF and ED of F and FD. U,;,V;, and W; 
are computed only once whenJACOBI is called repeatedly with 
the same values of & and £. 

To explain the method for finding EF and ED, we refer to the 
two recursions (2) and (8) below: 


PL SOP, = WeP es, (2) 
Q, = H-Q;-1 — W;-Qi-2 + 8. (3) 


Relation (2) is an abbreviated form of (1); relation (3) describes 
a parallel recursion for a sequence of error-perturbed polynomial 
values Q; which does two things: (a) it propagates previous 
errors of the polynomial values P;_, or Q;-1 , and P;~2 or Q;-2 
into Q; ; and (b) it includes the effects of two errors generated 
‘Jocally” at the j step: the error of G which is included in 7, 
and the error s which arises when forming the difference itself 
in (2). The error s is estimated from s = max(F, , E2), where 
&, and FE, are the magnitudes of the errors of the two terms on 
the right side of (2),i.e.#, = | #,-Pj-1| and BH, = | By - Pj-2 |. 
Here E, is the error of G, and EZ. is the error of W;. FE, and Ey 
are estimated from E, = max(|y-U;|, , y-V;-x|) and Ey = 
|y-W; |. The value 3E-26 given to y in the DATA statement 
reflects the accuracy of the CDC-1604. A in (3) is given as H = 
G+ HE, . Finally, the relative error F of P,is obtained from EF = 
11 — Q2/Pu |. 

One might argue that the use of (3) could have been avoided 
if the error of P;_; had been taken into account in the evaluation 
of EF, , and the error of P;_2 in the evaluation of H, . However, 
in numerical tests this led to serious instability in the vicinity 
of the zeros of P, because of correlations between the errors. 


Algorithm 332 is the first algorithm written in FORTRAN to be pub- 
lished in the Algorithms department of Communications of the ACM. 
The department policy was extended to allow for algorithms in FOR- 
TRAN in August 1966. (For details see September 1966 issue, page $83.) 
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SUBROUTINE JACOBI 
HARARE REET EE 


* 


DOUBLE PRECISION 


x x 


* 


REAL 


INTEGER 


DIMENSION 


IF 


(DEGREE «EQe0) 


( DEGREE sALFAyBETA»X sF #FDy Eo ED) 


AsALF sALFA» 
BsBET sBETAsCoDoFoFDy 
GoHoP sPDsQsQD, 
T1leT2oUsVoWwex 


EsEDsEGsE1sE295eY 

I sJoKsMeN»DEGREE 
U(25) 9 VI25)9 W(25)¢ 
P(25) ePD(25)5 
Q(25)sQD(25) 

M /-2 /9 

ALF /~2D+00/ 9 
BET/-2D+00/ 

Y /+3&-26/ 


GO TO 8 


IF ((ALFAsNE eALF) 
(BETAeNEsBET)} GO TO 1 


HeARe 


IF 
I 
K 
M 


IF 


(DEGREE «LE eM) 


Hondu 


GO TO 5 


M 
DEGREE=1 
DEGREE 


(I=2) 29 39 3 


CALCULATE THE U(J)soV(J)sw(J) IN 


THE RECURRENCE RELATION 


P(J)s 


P(IJ-1)# (UCSD 4+V (I) RX) =P (J-2 HWE) 


M = DEGREE 
ALF = ALFA 
BET = BETA 
A = ALF+BET 
B = ALF-BET . 
UC1) = B/26« 
VC1) = LletA/20 
W(1) = ODO 
IF (DEGREEsEQe1) GO TO 5 
U2) = A*BH#(At3¢)/(Get(At+20) #¥2) 
VIZ) = (At3Be) #( Ate) / (Get (At+20)) 
WO2) = CLetALF)¥( Le tBET) #(At4e) 
WO2) = WO2)/( 20 (At20 )R*2) 
I = 2 
K = DEGREE-1 
3 IF ((DEGREE«EQe2) 
ReORe ( TeGTeK 4) GO TO 5 
DO 4 J = I9K 
A = 2*J+2 
D = ALF+BET 
A = A+D : 
B = D¥(A-1.)*(ALF-BET) 
C a J+] , 
Cc = 2etC#(A-20)*( C+D) 
U(J+1)= B/C 
D = A®(A~1le)*(A—26) 
V(J+t1)= D/C 
D = J 
A = 2¢%*(D+ALF)*(D+BET)* 
W(JtlL)= A/C 
4 CONTINUE 


c 


C 


FIND THE 
AND J=2 
5 Tl = V 
P(1) U 
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STARTING VALUES FOR J2=1 
FOR USE IN THE RECURSION ee 
(1)*xX 
(1)4T1 


S = Y#DMAX1(DABS(U(1)j)> 


* 
Q(1) = P 


DABS(T1)) 
(1)+S 


PD(1)= V¢l) 
QD(1)= V1) 


IF (DEGREE «EQe1) 


Ti 

G 
£G 

* 

H 

T1 
El 
P(2) 
S 

S 
Q(2) 
PD(2)= 
QD(2)= 


nots 


‘IF (D 


USE TH 

DO 6 
T2 
G 
EG 


H 
Ti 
T2 
El 
E2 
P(J 
S 
QtJ 
PD( 
QD ( 
PD( 
QD { 
6 CON 


PREPAR 
7N = 


GO TO 7 
V2) *X 
U(2)+T1 
Y*#DMAX1(DABS(U(2))» 
DABS(T1)) 
G+EG 
G*P(1) 
DABS(EG#P(1)) 
Tl-wt2) 
Y*DABS(W(2)) 
AMAX1 (E195) 
H¥Q(1)-W(2)45S 
G¥PD(1)I+V(2)*P(1) 
H*QD(1)+V(2)*#Q(1) 


EGREEseEQse2) GO TO 7 


E RECURSION ee 

J = 3,DEGREE 

VCS) EX 

U(J)+T2 

Y*DMAX1(DABS(U(J)) > 
DABS(T2)) 


G+EG 

G¥P(J-1) 

W(J)#P(J-2) 

DABS (EG#P(J~1)) 
DABS(T2)*Y 

T1-T2 

AMAX1(E19E2) 
H¥*Q(J-1)-W0 JS) #Q(J-2 94S 
G¥PD (J-1)-W( JS) #PD( J—2) 
J) H#QD ( J-1)—-WU) #QD( J-2) 
J) PD(J)4+V( J) *P( J-1) 

J) = QD(J)4V0I) FQ( 5-1) 
TINUE 


) 


) 
J) 


mrioinww b sw tonn 


E THE OUTPUT . oe 

DEGREE 

P(N) 

Y+DABS(P(N)—~Q(N)) 
/DABS(F) 

PD(N) 


= Y+DABS(PD(N)—QD(N)) 


/DABS(FD) 


0 
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REMARKS ON: 

ALGORITHM 332 [S22] 

JACOBI POLYNOMIALS [Bruno F. W. Witte, Comm. 
ACM 11 (June 1968), 436] 

ALGORITHM 344 [814] 

STUDENT'S t-DISTRIBUTION [David A. Levine, 
Comm. ACM 12 (Jan. 1969), 37] 

ALGORITHM 351 [D1] 

MODIFIED ROMBERG QUADRATURE [Graeme 
Fairweather, Comm. 12 (June 1969), 324] 

ALGORITHM 359 [G1] 

FACTORIAL ANALYSIS OF VARIANCE [John R. 
Howell, Comm. ACM 12 (Nov. 1969), 631] 


Artuur H. J. Saute (Recd. 16 Feb. 1970) 
Basser Computing Department, University of Sydney, 
Sydney, Australia 


KEY WORDS AND PHRASES: Fortran standards 
CR CATEGORIES: 4.0, 4.22 


An unfortunate precedent has been set in several recent al- 
gorithms of using an illegal FORTRAN construction. This oon- 
sists of separating an initia] line from its continuation line by a 
comment line, and is forbidden by the standard (see sections 3.2.1, 
3.2.3 and 3.2.4 of [1, 2]). The offending algorithms are to date: 
332, 344, 351 and 359. 

While this is perhaps a debatable decision by the compilers of 
the standard, and trivial to correct, it seems a pity to break the 
rules just for a pretty layout as has been done. 

REFERENCES: 

1. ANSI Standard FORTRAN (ANSI X3.9-1966), American 
National Standards Institute, New York, 1966. 

2. FORTRAN vs. Basic FORTRAN, Comm. ACM 7 (Oct. 1964), 
591-625. 


Remark on Algorithm 332 [S22] 
Jacobi Polynomials [Bruno F.W. Witte, Comm. ACM 
I] (June 1968), 436] 


Ove Skovgaard (Recd 23 April 1974 and 22 July 1974) 
Institute of Hydrodynamics and Hydraulic Engineering, 
Technical University of Denmark, DK-2800 Lyngby/ 
Denmark 


In the last section of Algorithm 332, there are the following 
statements: 


E = Y + DABS(PD(N) -— Q(N))/DABS(F) 


where E should be an estimate of the relative error of the computed 
value F (Jacobi polynomial) ; 


ED = Y + DABS(PD(N) — QD(N))/DABS(FD) 


where ED should be an estimate of the relative error of the com- 
puted value FD (derivative of the polynomial). 

The value of F or FD can be zero, but they are not checked in 
the program. In addition the above statements are not in accordance 
with the formulas for the relative errors, which are given by Witte 
in the comments which precede the program. 
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A reasonable modification of Algorithm 332 is: (i) calculate 
absolute errors (instead of relative errors) if F or FD is close to zero 
(here is used |F| < y or | FD| < y); (ii) otherwise assign the 
relative errors E and ED in accordance with the formulas E = 
11 — Q,/P,| and ED = |1 — QD,/PD,|; and (iii) add two 
flag variables FLAGF and FLAGFD indicating what kind of error 
(absolute, relative, or no error) is estimated. The variable FLAGF 
corresponds to the error E of F. The variable FLAGFD corresponds 
to the error ED of FD. 

The two flag variables FLAGF and FLAGFD are assigned the 
values 0, 1 or 2: 

If a relative estimate of the error is used, the flag is assigned the 
value 0. If an absolute estimate of the error is used, the flag is as- 
signed the value 1. If DEGREE = 0, both the errors are equal to 
zero, and the flags are assigned the value 2. 

The following corrections should be made in the program: 


The first statement in the subroutine should read: 


SUBROUTINE JACOBI 
* (DEGREE,ALFA,BETA,X,F,FD,E,ED,FLAGF, FLAGED) 


The declaration of the integer variables: 
INTEGER I,J,K,M,N, DEGREE 
should read 
INTEGER I,J,K,M,N,DEGREE,FLAGF,FLAGFD 
The first /F in the program: 
IF (DEGREE.EQ.0) GO TO 8 
should read 
IF (DEGREE.EQ.0) GO TO 10 
The last section (‘‘Prepare the output’’) should read: 


Cc PREPARE THE OUTPUT oe 
7N = DEGREE 
F = P(N) 
IF (OABS(F )eLTeY?) GO TO 8 
FLAGF=0 
E = DABS(14e¢-Q(N)/F) 
GO To 9 
8E = DABS(F-Q(N)) 
FLAGF=1 
9 FO = PD(N) 
IF (DABS(FD) lL TeY) Go TO 11 
FLAGFD=0 
ED = DABS(1.-QD(N)/FD) 
Oo 12 


10 F 


O 12 
1l ED =DABS(FO-QD(N)) 
FLAGFO=1 
12 RETURN 
END 


The value 3£-26 given to-y in the DATA statement reflects, ac- 
cording to the author, the accuracy of the CDC-1604. The author 
gives no information how one can calculate this constant from the 
given computer parameters (radix, number of digits in the mantissa 
and information whether the machine is doing the chopping or 
rounding). The constant y must be some sort of ‘“‘machine epsilon,” 
e.g. the smallest number (provided by the implementation and the 
chosen precision) for which 


l+y>1. (1) 
According to e.g. [4 pp. 7-9], we have 


B'~* chopping, 
m Te i 2 rounding, @2) 
where @ is the radix or base for the floating point numbers and f¢ is 
the number of digits (with radix 8) in the mantissa of the floating 
numbers. In [2] algorithms and corresponding programs (in For- 
tran) are published which for any ‘‘reasonable’”’ floating point com- 
puter compute the radix, number of digits of used floating-point 
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numbers, and determine whether rounding or chopping is done. by 
the machine, see also [5]. The CDC-1604 has according to e.g. [3] 
binary base, i.e. 8 = 2 with a normal word-length of 48 bit. 
The word is divided into an exponent with 12 bit and a man- 
tissa or fraction with 36 bit. For the double-precision calcula- 
tions the rounding CDC-1604 has therefore 
t = 36 + 48 = 84, ie. y = 16 X 2Qh%i= 5.2 X 107%, 
With these modifications Algorithm 332 ran successfully on an 
IBM 370/165 with operating system 21.6, and with the IBM For- 
tran IV G compiler. For double-precision calculations on this chop- 
ping computer we have: 6 = 16,¢ = 14,ie.y = 16-8 = 2,2 
10716, see [1 p. 163]. 
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ALGORITHM 333 
MINIT ALGORITHM FOR LINEAR 
PROGRAMMING [H] 
Ropotro C. SALAZAR AND SuBRATA K. SEN 
(Recd. 26 June 1967 and 25 Jan. 1968) 
Graduate School of Industrial Administration, Carnegie- 
Mellon University, Pittsburgh, Penna. 15213 


KEY WORDS AND PHRASES: linear programming, dual 
simplex method, primal problem, dual problem 
CR CATEGORIES: 5.41 


real procedure MINIT (m, n, p, eé, td); 
integer m,n, p; array e; real td; 
comment MINIT(MINimum [Terations) is designed to solve a 


linear programming problem of n variables and m constraints 
of which the last p are equality constraints. The problem can 
be stated as follows: 


ll 
e 


Maximize 2 
subject to AX Sb 
X20 


c is a (1Xn) row vector, X is a (nX1) column vector, A is a 
(mXn) matrix, and b is a (mX1) column vector. e is a matrix 
with (m+1) rows and Icol columns (where lcol=m+n—p+1) 
and forms the initial tableau of the algorithm: 


Pe yee a heed yA nlc he ie eh ns Icol 
1{,-¢Qdxn)| 0( Xm — p) 0 ) 
Identity (m — p) 
Matrix inequality 
A (m Xn) | (m — p X m— p) b constraints 
Zero (m X 1)||p 
Matrix equality 
: (p X m — p) constraints 
m+ Ek 


td is read into the procedure and should be a very small number, 
e.g. 10°8. The condition of optimality is the nonnegativity of 
ell, j] for 7 = 1,---, leol-1 and of e[z, lcol] for i = 2,--- , 
m-+ 1. If the e[z, 7] values are greater than or equal to —td 
they are considered to be nonnegative. The value of td should 
reflect the relative magnitude of the coefficient matrix. 

It should be noted that when equality constraints are present, 
the dual solution vector is not complete, i.e. the procedure does 
not compute the values of the dual variables corresponding to 
the equality constraints. However, knowing the optimal solu- 
tion to the primal problem and the values of the dual variables 
corresponding to the inequality constraints, it is a simple mat- 
ter to compute the values of the remaining dual variables. In 
the initial tableau, the elements of the vector b must be non- 
negative for the equality constraints. 

MINIT is based upon a technique suggested by Llewellyn 
|1] and is a specialized algorithm based on the principle of the 
dual simplex method. Llewellyn states that he has found the 
MINIT algorithm to be more efficient than any other method 
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he has used. MINIT’s efficiency is based upon the fact that the 
solution method confines the iterations to those constraints 
which are defining (equality constraints and those inequality 
constraints whose slack variables are zero in the optimal solu- 
tion). The algorithm starts with an infeasible solution as in the 
dual simplex method. When “‘greater than or equal to’’ con- 
straints are involved, it also starts with an incomplete solution 
since it avoids the use of artificial variables. This feature of the 
algorithm considerably reduces the number of iterations re- 
quired to obtain the optimal solution. Both the primal and dual 
problems are solved simultaneously and the pivotal element at 
each iteration is so chosen that there is a maximum increase in 
the functional value of the primal or a maximum decrease in 
the functional value of the dual. The details of the algorithm 
and a discussion of the theoretical reasons for its computational 
efficiency may be obtained from the reference cited below. 

The experience of the authors with the MINIT algorithm 
has been very satisfactory. For example, on a CDC G-21 com- 
puter, the Simplex code available in the Carnegie Tech. program 
library took 4 minutes 56 seconds to solve a 51 X 72 linear pro- 
gramming problem (consisting only of inequality constraints) 
while the same problem was solved in 2 minutes 58 seconds by 
the MINIT algorithm. For problems with mixed constraints, 
i.e. equality and inequality constraints, the advantage of the 
MINIT algorithm is even more pronounced. 


REFERENCE: 


1. LLEWELLYN, R. W. Linear Programming. Holt, Rinehart and 


Winston, New York, 1964, pp. 207-220; 


begin integer 1, j,k, L, im, jmin, jm, imax; real gmin, phimaz; 


integer array ind{l:lcol], indl[l:m+1], chk[2:m+1]; 

procedure results; 

comment prints out the output. The value of the functional 
is given by z. The optimal values of the variables are given by 
z|t] forz = 1, --- , m and the values of the dual variables are 
given by w[j] for j = 1,--+ ,m; 

begin real z; array z[1:n], w[l:m]; 


z :=e[1:lcol]; 

for 7 := 1 step 1 until n do z[7] := 0; 
for j := 1 step tf until m do w{j] := 0; 
for 7 := 2 step 1 until m + 1 do 
begin 


if chk[t] > n then chk[i] := 0; 
if chk[i] > 0 then a[chk[i]] := 
end; 
for 7 := n +1 step 1 until lcol-1 do w[j—n] := e[1, 7); 
comment Insert output statements to print out z, z[Z], 
and w[j], for example, the following six statements sepa- 
rated by semicolons: (1) outstring (1, 'value of the func- 
tional’), (2) outreal (1, 2), (3) outstring (1, ‘optimal values 
of the variables'), (4) owtarray (1, x), (5) outstring (1, ‘values 
of the dual variables'), (6) outarray (1, w); 
go to LAST 
end results; 
procedure rowtrans(im, jmin); 
integer 1m, jmin; 
comment performs the usual tableau transformations in a 
linear programming problem, (im, jmin) being the pivotal 
element; 
begin real dummy; 
if im = 0 then 


e[z, lcol] 
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begin 
comment Insert an output statement to print “no solu- 
tion’’, for example, the statement, outstring (1, 'no solu- 
tion'); 
go to LAST 
end; 
if jmin = 0 then 
begin 
comment Insert an output statement to print ‘‘no solu- 
tion’’, for example, the statement, outstring (1, 'no solu- 
tion’); 
go to LAST 
end; 
dummy := e[im, jmin]; 
for j := 1 step 1 until Icol do elim, j] := efim, j]/dummy; 
for 1 := 1 step 1 until m + 1 do 
begin 
if 7 ~ 1m then 
begin 
if elt, jmin] + 0 then 
begin 
dummy := e[t, jmin]; 
for j := 1 step 1 until lcol do 
elt, J] := elt, j] — elim, j] & dummy 
end 
end 
end; 
chk[um] := jmin 
end rowtrans; 
procedure progamma; 


comment performs calculations over columns to determine 


the pivot element; 
begin integer 7, [1; real theta, gamma; array thmin([l1ilcol]; 
integer array imin(1:lcol]; 
gmin := 10°; jmin := 0; 
comment gmin is set equal to a large number for initializa- 
tion purposes; 
for L1 := 1 step 1 until L — 1 do 
begin 
imin{ind[£L1}]] := 0; thmin[ind[L1]] := 108; 
for 7 := 2 step 1 until m + 1 do 
begin 
if e[z, zna(L1]] > td A elt, lcol] > -id then 
hegin 
theta := e[t, lcol}/e[z, ind[Z1]]; 
if theta < thmin|ind[L1]] then 
begin 
thmin|ind[Z1]} := theta; imin[ind[L1]] := 7 
end 
end 
end; 
if thmin[ind[L1]] = 10° then gamma := 108 
else gamma := thmin{[ind[L1]] x e[1, ind[L1]];: 
if gamma < gmin then 


begin 
gmin := gamma; jmin := ind{L1]} 
end 
end; 
if jmin > 0 then im := imin[jmin] 


end progamma, 

procedure prophi; 

comment performs calculations over rows to determine the 
pivot element; 

begin integer j, kl; real delta, phi; array delmaz[1:m+1]; 
integer array jmaz[1:m-+1]; 
phimax := —~ 108; tmax := 0; 
comment  phimaz is set equal to a small number for initial- 

ization purposes; 
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for kl := 1 step 1 until k — 1 do 


begin 
jmax(indl [k1]] := 0; delmaz{ind1 [k1]] := — 108; 
for j := 1 step 1 until lcol — 1 do 
begin 
if e{¢nd1[k1], j] < —td A el, 7] > -td then 
begin 


delta := e[l, j]/elindl1[k1], 7]; 
if delta > delmaz{indi[k1]] then 


begin 
delmax|indl[k1]] := delta; jmazfind1[k1]] := 7 
end 
end 
end; 
if delmax|ind1[k1]] = — 10° then phi := — 108 


else phi := delmaz{ind1|k1]] * e[tnd1[k1], lcol]; 
if pht > phimaz then 
begin 
phimax := phi; imax := indi[k1] 
end 
end; 
if zmax > 0 then jm := jmaz[imaz] 


end prophi; 
procedure phasel; 
comment applied only to equality constraints if any; 
begin integer 7; real theia, gamma; array thmin[1:lcol]; 


integer array imin(1:lcol]; 
for r := 1 step 1 until p do 
begin 

gmin := 108; L:=1; 


comment gmin is set equal to a large number for initial- 


ization purposes; 
for 7 := 1 step 1 until n do 


begin 
thmin|j] := 108; if e[1, 7] < 0 then 
begin 
indit] := 9; LD:=L+i1 
end 
end; 
if L = 1 then 
begin 
for j := 1 step 1 until n do ind[j] := 7; L: 
end; 
for k := 1 step 1 until L — 1 do 
begin 
for 7 := m—p+2step 1 until m + 1 do 
begin 
if chk{t] = 0 then 
begin 
if elt, end[k}] > 0 then 
begin 
theta := e[t, leol]/e[i, ind|k]]; 
if theta < thmin{ind[k]| then 
begin 
thmin|ind[k]] := theta; tmin[ind[k]] := 
end 
end 
end 
end; 


gamma := thmin{ind{k]] X e[1, tnd[k]]; 
if gamma < gmin then 


begin 
gmin := gamma; jmin := ind[k] 
end 
end; 
am := imin[jmin]; rowtrans(im, jmin) 
end 
end phasel; 


n+1 
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for 7 := 2step 1 until m + 1 do chh{:| = 0; 
if p = O then go to RCS else phasel; 
comment If there are any equality constraints in the problem 
the program first goes to phasel, otherwise it goes directly to 
RCS; 
RCS: L:= 1; k:= 1; 
for 7 := 1 step 1 until col — 1 do 


hegin 
if e(l. yg] < — éd then 
hegin 
indihj| i= yj; Lie Le +4; 
comment = ind|L] keeps track of the columns iu which e[1, /! 
is negative; 
end 
end; 
for ¢ := 2 step 1 until m + 1 do 
begin 
if e[7, fecol] < — td then 
begin 


mndk] = 7; Acs k +1; 
comment indllA] keeps track of the rows in which ef7, leoll 
is negative; 


end 
end; 
if LD =.1 then 
begin 
if k = 1 then results else 
begin 
iff = 2then 
begin 
for j := I step 1 until [col — 1 do 
begin 
if e{ind1[1], 7] < Othen goto R 
end; 


comment Insert an output statement to print ‘primal 
problem has no feasible solutions, dual objective fune- 


tion is unbounded", for example, the statement ové- 
string (1, ‘primal problem has no feasible solutions, dual 
objective function is unbounded’); 
go to LAST 
end else go to R 
end 
end 
else 
hegin 
if = 2 then 
hegin 
ifk = 1 then 
hegin 
for ¢ := 2 step 1 until » + 1 do 
hegin 
if e|7, ind|1|] > O then go to C 
end; 
comment Jnsert an output statement to print ‘primal 
objective function is unbounded, dual problem has no 
feasible solutions’, for example, the statement oul- 
string (1. primal objective function is unbounded, dual 
problem has no feasible solutions’) ; 
go to LAST 
end else go to § 
end; 
if k = 1 then goto C else go toS 
end; 
PR: propht; go to RCS; 
go to RCS; 


rowlrans(imayx, jm); 
C; progamma; rowtrans(tm, jmin); 
S: progamma; propht; 

if gmin = 10° then 
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begin 
rowlrans(imar, jm); goto RCS 
end; 
if phimar = -— 10° then 
hegin 


rowlrans(im, jmin); goto RCS; 
end; . 
if abs(phimar) > abs(gmin) then rovwelrans(imas, jm) 
else rowlrans(tim, jmin), 
go to RCS; 

LAST: end MINIT 
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MINIT ALGORITHAL FOR LINEAR PROGRAM- 

MING [Rodolfo C. Salazar and Subrata K. Sen, Comm. 
ACM 11 (June 1968), 487] . 

D. Kk. MessoHam (Reed. 27 Nov. 1968 and 28 Feb. 1969) 

Nelson Research Laboratories, The English [lectrice Co. 
Ltd., Stafford, England 


KEY WORDS AND PHRASES: linear programming, dual sim- 
plex method, primal problem, dual problem 
CR CATEGORIES: 5.41 


The procedure has heen tested with Marconi Myriad Algol, and 
it ran successfully when the following changes had been made. (the 
first is merely a misprint): 

1. The first statement in procedure results was changed 

from z := e[l : lcol]; 

to z:= ef1, lcol]; 

2. To satisfy an ALGoL 60 restriction that a type procedure 
should contain au assignment to its procedure identifier, the real 
on the first line of the procedure was removed. 

3. It is possible for the published algorithm to give incorrect 
results when it reaches a state in phasel where there are no possible 
pivotal elements in one column of the tableau. (For example, 
maximize — 21 — 2, — #3, with 2x, + 2, = 3 and #3 = 1, reaches 
this state.) To correct this the line in procedure phasel 

if gamma < gmin then 
was changed to 

if gamma < gmin A thmin [ind|k]]} < 106 then 
All the appearances of 108 in this algorithm should be written as 
106. 

The following improvements are also suggested : 

4. It is assumed that lcol is a global integer with the correct 
value. This was made unnecessary by adding lcol to the list of 
integers declared on the line immediately following the initial com- 
ment; the bounds of the array tnd, declared on the next line, were 
changed 

from {1 : lcol] 

to {l : m+n—p+t+1]); 

and lcol:=m+tn-—-pt+1; 
was inserted as the first executable statement of the procedure 
MINIT (after end phasel;). 

5. It is assumed that equality constraints will be given with 
positive right-hand sides. This restriction was overcome by insert- 
ing in the procedure phasel after the line 
[1 : 7col]; the following: 

for 7 := m— p+ 2 step 1 until m + 1 do 

if e[z, leol] < 0 then 

for 7 := 1 step 1 until [col do eli, j] := — efi, 7]; 


integer array imin 
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MINIT ALGORITHM FOR LINEAR PROGRAM- 
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A. Koim ano T. DAHLSTRAND (Reed. 15 Sept. 1969) 
Information Processing Department, ASEA. 8-721 83 
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When we tried to run the program on a GiE-625 computer it 
became apparent that the following, rather gbvious changes in 
the procedure phasel of the original program are necessary: 

1. The statement after the statement thmin(j] := 108; should 
begin with 
if e{1,j] < — td then --- 
2. The beginning of the statement following the statement if 
chk[1] = 0 then should be replaced by 
begin 
if e[t,ind[k]] > td then 
3. The statement gamma := thmin [ind{k]] & e[1,¢nd[(k]]; should be 
preceded by 
if thmin{ind{k]] = 108 then gamma := 108 else 
We also suggest that the parameters m, n, p, td of the procedure 
should be value-specified. 

After these corrections the procedure has been successfully 
tested in several problems. For problems of moderate size, which 
without further modifications can be solved by the procedure, 
the algorithm turned out to be most efficient. The numerical 
accuracy was also good. 


Remark on Algorithm 333 [H] 

Minit Algorithm for Linear Programming [Rodolfo 
C. Salazar and Subrata K. Sen, Comm. ACM I1 
(June 1968), 437-440] 


D. Obradovic* 

Boris Kidric Institute of Nuclear Sciences, 

11001 Beograd, Yuguslavia 

* Present address: Institute of Investment Research, 1100 Beograd, 
Yugoslavia. 


The procedure has been tested with CDC 3600 Algol, and it 
ran successfully when the following changes had been made in the 
procedure phase 1: 

1. After the line 
comment applied only to equality constraints if any; 
instead 
begin integer r; 
one has to introduce 
begin integer r, im1, jmin 1; 
2. After the line 
integer array imin[\:/col]; 
one has to introduce a new line 
im 1 := jmin 1 := 0; 
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3. The line 

if L = 1 then 
should read 

Li: if L = 1 then 
4. Last nine lines of the procedure phase | should be changed to 
read 

if thmin|ind[k]| < 10° then 

begin 

gamma := thmin{ind[k]] X e[1, tnd[k]; 

if gamma < gmin then 

begin 

gmin := gamma; jmin := ina[k]; 

end 

end; 

end; 

im := imin{jmin]; 

if im = im! A jmin = jmini| then 

begin 

L:=1; 

end; 

rowtrans(im, jmin) ; 

im1 := im; jminl := jmin, 

end 

end phase 1; 

These changes are necessary to avoid incorrect results in the 

case if after application of the procedure rowrrans all e[i, ind[k)] 
are negative as in the following example 


z= —0.9 x, — 1.255632 x2 + 0.925 x3 + 0.375 x4 
xi <2, i= 1,2,3,4 
2.19069 x, — 0.925 x2 — 0.325 x3 — 0.1875 x, = 0.76569 
Xy — 0.1 x3 + 0.740896x, = 1.640896 


when the published algorithm ignores some of the equality con- 
straints. 


go to LI 


Remark on Algorithm 333 [H] Minit Algorithm for 
Linear Programming [Rodolfo C. Salazar and 
Subrata K. Sen, Comm. ACM II (June 1968), 
437-440] 


B. Holmgren,* D. Obradovic¢,t and A. Kolm* 
[Recd. 13 May 1971] 

* Information Processing Department, ASEA 
S-721 83 Vasteras Sweden 

+ Boris Kidri¢ Institute of Nuclear Sciences, 
11001 Beograd, Yugoslavia 


In addition to previously given remarks on the algorithm, the 
following changes in the procedure phase 1 are necessary in order 
to avoid incorrect results for some types of problems with equality 
constraints: 

1. Introduce into phase | the variable first by the declaration 


Boolean /irst; 

2. After the statement Z:= 1; one has to set 

jmin := 0; first := true; 

3. The statement if Z = 1 then...should be replaced by 


Ll: if L = 1 then... 
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4. The statement im := imin [jmin]; should te preceded by 
if jmin = O then 
begin 
if first then 
begin 
first := false; L:=1; gotoZLl 
end else im := 0 
end else 


After these changes MINIT can handle problems, for which equal- 
ity constraints cause all the current values of e[i, ind[k]] to be 
negative at some stage in phase 1. For such cases the variables im 
and jmin in the old version either were left undefined or remained 


unchanged before entering the procedure rowtrans. An example 
of this is the trivial problem 
max xX, , when 


Nt, X2 > 0, 
Ag| < 1, 
2 > I, 


where the original procedure completely failed. 


we" 


nd 
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ALGORITHM 334 

NORMAL RANDOM DEVIATES [G5] 

James R. Bett (Recd. 13 Dec. 1965, 29 Nov. 1967, 
and 23 Jan. 1968) 

Stanford Research Institute, Menlo Park, Calif. 


KEY WORDS AND PHRASES: normal deviates, ndrmal distribution, random 
number, random number generator, simulation, probability distribution, 
frequency distribution, random 

CR CATEGORIES: 5.5, 5.13 


procedure norm (D1, D2); 

real D1, D2; 

comment This procedure generates pairs of independent 

normal random deviates with mean zero and standard deviation 
one. The output parameters D1 and D2 are normally distributed 
on the interval (— 0, +), The method is exact even in the 
tails. . 

This algorithm is one of a class of normal deviate generators, 
which we shall call ‘‘chi-squared projections” [1, 2]. An al- 
gorithm of this class has two stages. The first stage selects a 
random number’ LZ from a x:’-distribution. The second stage 
calculates the sine and cosine of a random angle 6. The generated 
normal deviates are given.by L sin (@) and L cos (6). 

The two stages can be altered independently. In particular, 
as better x2? random generators are developed, they can replace 
the first stage. (The negative exponential distribution is the 
same as that of x?.) 

The fastest exact method previously published is Algorithm 
267 [4], which includes a comparison with earlier algorithms. 
It is a straight chi-squared projection. Our algorithm differs 
from it by using von Neumann rejection td generate sin (¢) and 
cos (¢), [6 = 26], without generating ¢ explicitly [8]. This 
significantly enhances speed by eliminating the calls to the 
sin and cos functions. 

The author wishes to express his gratitude to Professor 
George Forsythe for his help in developing the algorithm. 

REFERENCES 

1. Box, G., anp Mutter, M. A note on the generation of normal 
deviates. Ann. Math. Stat. 28, (1958), 610. 

2. Mutter, M. E. A comparison of methods for generating 
normal deviates on digital computers. J. ACM, 6 (July 
1959), 376-383. 

3. von Neumann, J. Various techniques used in connection with 
random digits. In Nat. Bur. of Standards Appl. Math. Ser. 
12, 1959, p. 36. 

4. Pixr, M. C. Algorithm 267, Random Normal Deviate. 
Comm. ACM, 8 (Oct. 1965), 606.; 

comment f is any parameterless procedure returning a random 

number uniformly distributed on the interval from zero to one. 
A suitable procedure is given by Algorithm 266, Pseudo-Random 
Numbers [Comm. ACM, 8 (Oct. 1965), 605] if one chooses 
a=0, 6b = 1, and initializes y to some large odd number, such 
as y = 13421773.; 
begin 

real X, Y, XX, YY, S, L; 

comment von Neumann rejection for choosing a random 

angle @ = 26,6 = tan”? (Y/X); 

A: X:=R; Y:=2XR-1; 

XX := XT 2; YY s= ¥ fT 2 
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S:= XX + YY; 
if S >1then goto A; 
comment chooses L randomly from a x.?-distribution and 
normalizes with S; 
L := sqrt (—2XIn(R))/S; 
comment computes deviates as L X sin (¢) and L X cos (4); 


= (XX-YY)xL; 
D2:=2XXXYXL; 
end norm; 
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NORMAL RANDOM DEVIATES [James R. Bell, 
Comm. ACM 11 (July 1968), 498] 

R. Knop* (Recd. 5 Aug. 1968 and 8 Nov. 1968) 

Physics Dept., University of Maryland, College Park, 
MD 20742 


This work was supported in part by an Atomic Energy Commission 
contract. 

* Present address: 
Brunswick, NJ 08903 


KEY WORDS AND PHRASES: normal deviates, normal dis- 
tribution, random number, random number generator, simula- 
tion, probability distribution, frequency distribution, random 

CR CATEGORIES: 5.13, 5.5 


Physics Dept., Rutgers University, New 


Algorithm 334 produces pairs of normally distributed random 
deviates with zero mean and unit variance by the method of Box 
and Muller [1]. The sine and cosine required by the Box-Muller 
method are calculated by the von Neumann rejection technique 
[2]. This technique allows the calculation of the sine and cosine of 
an angle uniformly distributed over the interval (0, 27) without 
referencing the sine, cosine, or square root functions. We note 
however, that Algorithm 334 require as square root calculation in 
inverting the distribution function of the radius ‘(equal to L x S 
in the notation of the algorithm). 

We suggest that since the square root calculation seems un- 
avoidable, it can be used to obtain the required sine and cosine by 
more conventional means. Thus we propose sampling points from 
a density uniform over the unit disk in the X, Y-plane and cal- 
culating the sine and cosine from their definition in terms of the 
legs and hypotenuse of a right triangle. The following changes in 
Algorithm 334 are then necessary: 

a. Replace X := Rby X :=2XR-—-1 

b. Replace L := sqri(—2XIn(R))/S by 

L := sqri(—2XIn(R)/S) 
c. Replace Dl := (XX—YY) X Lby Dl := XXL 
d. Replace D2 := 2X XX YX Lby D2:= YXL 


Acknowledgment. The author thanks B. Kehoe for comments 
concerning this algorithm. 
REFERENCES: 


1. Box, G., AND MuLLER, M. A note on the generation of normal 
deviates. Ann. Math. Stat. 28 (1958), 610. 

2. Von NEUMANN, J. Various techniques used in connection with 
random digits. In Nat. Bur. Standards Appl. Math. Ser. 12, 
US Govt. Printing Off., Washington, D. C., 1959, p. 36. 
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REMARK ON ALGORITHM 334 


Normal Random Deviates 
[James R. Bell (with modifications due to R. Knop), Commun. ACM 12, 5 (May 
1969), 281.] . 


Allen E. Tracht [Received 12 December 1981; revised 16 December 1981; accepted 
16 December 1981] 

Biomedical Engineering Department, Case Western Reserve University, Cleve- 
land, OH 44106. 


As modified by Knop, Algorithm 334 produces pairs of normally distributed 
random deviates with zero mean and unit variance by a modification of the 
“polar” method due to Box, Muller, and Marsaglia [2]. The following change 
converts Algorithm 334, as modified by Knop, to the “polar” method: 


Replace: L := sgrt(—2 x In(R)/S) 
byL := sqrt(—2 x In(S)/S). 


Note that this modification eliminates one invocation of the uniform random 
number generator R. Using timing information given by Brent [1] in Algorithm 
488, the “polar” method would be expected to take (83 + 1.27U) microseconds 
rather than (83 + 1.77U) microseconds per call. This is faster than the (91 + 
1.38U) microseconds given by Brent for Algorithm 488. 


REFERENCES 
1. BRENT, R.P. Algorithm 488. A Gaussian pseudorandom number generator, Collected Algorithms 
of the ACM., Vol. 2, ACM, New York, 1978. 
2. Knutu, D.E. The Art of Computer Programming, Vol. 2, Addison-Wesley, Reading, Mass., 
1981, pp. 117-118 or 1969, pp. 104-105. 
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ALGORITHM 334 

NORMAL RANDOM DEVIATES [G5] | 

James R. Bett (Recd. 13 Dec. 1965, 29 Nov. 1967, 
and 23 Jan. 1968) 

Stanford Research Institute, Menlo Park, Calif. 


KEY WORDS AND PHRASES: normal deviates, normal distribution, random 
number, random number generator, simulation, prokability distribution, 
frequency distribution, random 

CR CATEGORIES: 5.5, 5.13 


procedure norm (D1, D2); 

real D1, D2; 

comment This procedure generates pairs of independent 

normal random deviates with mean zero and standard deviation 
one. The output parameters D1 and D2 are normally distributed 
on the interval (—, +). The method is exact even in the 
tails. 

This algorithm is one of a class of normal deviate generators, 
which we shall call ‘chi-squared projections’? [1, 2]. An al- 
gorithm of this class has two stages. The first stage selects a 
random number L from a x,?-distribution. The second stage 
calculates the sine and cosine of a random angle #. The generated 
normal deviates are given-by L sin (6) and L cosi(@). 

The two stages can be altered independently. In particular, 
as better x2? random generators are developed, they can replace 
the first stage. (The negative exponential distribution is the 
same as that of x.) 

The fastest exact method previously published is Algorithm 
267 [4], which includes a comparison with earlier algorithms. 
It is a straight chi-squared projection. Our algorithm differs 
from it by using von Neumann rejection to generate sin (¢) and 
cos (¢), [6 = 26], without generating ¢ explicitly [3]. This 
significantly enhances speed by eliminating the calls to the 
sin and cos functions. 

The author wishes to express his gratitude to Professor 
George Forsythe for his help in developing the algorithm. 

REFERENCES 

1. Box, G., anp Muturr, M. A note on the generation of normal 
deviates. Ann. Math. Stat. 28, (1958), 610. _ 

2. Mutter, M. E. A comparison of methods, for generating 
normal deviates on digital computers. J. .ACM, 6 (July 
1959), 376-383. 

3. von NEUMANN, J. Various techniques used in connection with 
random digits. In Nat. Bur. of Standards Appl. Math. Ser. 
12, 1959, p. 36. | . 

4. Pike, M. C. Algorithm 267, Random Normal Deviate. 
Comm. ACM, 8 (Oct. 1965), 606.; 

comment £ is any parameterless procedure returning a random 

number uniformly distributed on the interval from zero to one. 
A suitable procedure is given by Algorithm 266, Pseudo-Random 
Numbers [Comm. ACM, 8 (Oct. 1965), 605] if one chooses 
a=0, b =1, and initializes y to some large odd number, such 
as y = 13421773.; 
begin 

veal X, Y, XX, YY, S, L; 

comment von Neumann rejection for choosing a random 

angle ¢ = 26,6 = tan? (Y/X); 

A: X:=R; Y:=2xXR-1; 

XX :=X f2; YY := Y fT 2; 
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S:= XX + YY; 
if S >1then goto A; 
comment chooses L randomly from a x,?-distribution and 
normalizes with S; 
L := sgrt (—2XIn(R))/S; 
comment computes deviates as L X sin (¢) and L X cos (4); 
D1 := (XX-YY) XL; 
D2:=2XXxXYXL; 
end norm; 
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Algorithm 334 produces pairs of normally distributed random 
deviates with zero mean and unit variance by the method of Box 
and Muller [1]. The sine and cosine required by the Box-Muller 
method are calculated by the von Neumann rejection technique 
{2]. This technique allows the calculation of the sine and cosine of 
an angle uniformly distributed over the interval (0, 27) without 
referencing the sine, cosine, or square root functions. We note 
however, that Algorithm 334 require as square root calculation in 
inverting the distribution function of the radius (equal to L X S 
in the notation of the algorithm). 

We suggest that since the square root calculation seems un- 
avoidable, it can be used to obtain the required sine and cosine by 
more conventional means. Thus we propose sampling points from 
a density uniform over the unit disk in the X, Y-plane and cal- 
culating the sine and cosine from their definition in terms of the 
legs and hypotenuse of a right triangle. The following changes in 
Algorithm 334 are then necessary: 

a. Replace X := Rby X :=2XR-—-1 

b. Replace LZ := sqgrt(—2XIn(R))/S by 

L := sqrt(—2XIn(R)/S) 
ce. Replace D1 := (XX-—YY) X Lby Dl := XXL 
d. Replace D2 := 2X XX YX Lby D2:= YL 


Acknowledgment. The author thanks B. Kehoe for comments 
concerning this algorithm. 
REFERENCES: 


1. Box, G., anp Mutter, M. A note on the generation of normal 
deviates. Ann. Math. Stat. 28 (1958), 610. 

2. Von NEuMANN, J. Various techniques used in connection with 
random digits. In Nat. Bur. Standards Appl. Math. Ser. 12, 
US Govt. Printing Off., Washington, D. C., 1959, p. 36. 


COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 335 

A SET OF BASIC INPUT-OUTPUT PROCEDURES 
[15] 

R. DE VoGELaERE (Recd. 8 Sept. 1966 and 18 Nov. 1966; 
description revised 2 Nov. 1967) 

Department of Mathematics and Computer Center, Uni- 
versity of California, Berkeley, CA. 94720 


By means of the primitives insymbol, ouftsymbol and length, 
as requested by this journal’s Algorithms Policy [Comm. ACM 
10 (Nov. 67), 729] a basic set of input-output procedures is 
defined aiming at quality and flexibility. outrea!, for in- 
stance, is written as a derived procedure; it outputs using the 
fixed point or the floating point representation, and rounds 
properly. Variants can easily be written because of the explicit 
call of the procedures decompose integer and decompose real. 
The highly recommended practice of echoing input is made 
easy with one subset of derived procedures (ioi, jor, iob, 
ioa). The documentation of output in the form of equivalent 
ALGOL statements is also provided when use is made of the 
subset ofi, ofr, otb, ota. The Berkeley style of providing in- 
formation on the form of output using prior calls of procedures 
such as real format is defined. A use of the parameter ouf- 
channel to provide information for simultaneous output to 
several channels is suggested. Interrelationship between the 
declared procedures is furnished in tabular form. 


KEY WORDS AND PHRASES: input output, transput, input output pro- 
cedures, input echo, quality output, decompose integer, decompose real, 
style, Berkeley style, procedures relationship, output documentation, equiva- 
lent ALGOL statements, ALGOL, ALGOL 60, integer format, real format, 
out integer, read real, input output Boolean, input output array, fixed point 
representation, floating point representation, output channel interpretation 

CR CATEGORIES: 4.0, 4.41 


l. Introduction 

The reader will find below a set of basic input-output proce- 
dures. Let me state first some of the purposes for writing this set 
and give a general description and specific information about 
the procedures and their interrelationship. 

In the October 1964 issue of the Communications of the ACM 
[1], a report on input-output procedures for ALGoL 60 was pub- 
lished. This report was prepared by a working group (WG 2.1) 
of the International Federation for Information Processing 
(IFIP/TC2) and approved by its Council. 

The approved primitives were: 


insymbol, outsymbol, length, inreal, outreal, inarray, outarray 
In the examples the following derived procedures were defined: 


outboolean, outstring, ininteger. 
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It is stated therein that ‘‘one needs, in practice, a fuller set of 
input-output procedures” and it is observed also that ‘different 
scheme of I/O procedures can be defined in it, largely by means of 
these primitives.” 

Since then, a few procedures have been published (see for in- 
stance [2, 3]) and the Algorithms Policy of this journal has re- 
quested [6] the use of the primitives of [1] and the use of out- 
boolean, outstring, ininteger and outinteger for input-output. 

The purpose of this algorithm is to present part of a consistent 
scheme of input-output procedures. The set uses as primitives, 
insymbol, outsymbol, and outstring (or equivalently length). 

First in integer, out integer, in real, out real, in Boolean, out 
Boolean are derived. in real is related to [2]; out integer and out real 
call the more basic procedures decompose integer and decompose 
real. out real allows not only for floating point representation [3] 
but also for fixed point representation and for correct rounding. 

Several sets of procedures, which point in several directions 
and which call the more basic ones, are then introduced. One set 
consists of parameterless input function designators akin to the 
procedure read of the Amsterdam Mathematisch Centrum. One 
set provides for echo of input to insure that the correct numbers 
have been read in—a practice which I recommend highly; it also 
provides for easy documentation of the output in the form of 
equivalent ALGOL statements. Another set with the same docu- 
mentation feature is for output only; the last set outputs num- 
bers, but no text. 

It is not suggested that the set of procedures of this algorithm 
be used for quantity output. Its main purpose is for quality output. 


2. General Description 

2.1. The only primitives used are insymbol, outsymbol, and 
length (through outstring). insymbol and outsymbol assume that 
the value —1 is associated with the symbol carriage return-line 
feed (or new card), which is not a basic symbol of Aucot 60. 
This is done in accordance with the convention of [1, Sec. 3]. 
outstring could have been avoided with some loss of clarity in the 
description of the procedures. insymbol, outsymbol, and outstring 
are defined in [1]. 

inreal and outreal are defined as in [2, 3] in terms of insymbol, 
outsymbol, and outstring. I do not believe that inreal and outreal 
should be primitives, firstly, because these procedures can be 
defined in terms of other primitives, and secondly, because many 
definitions will satisfy the requirements of [1]. On the other hand, 
the requirements set forth in [1] are most desirable. 

in channel and out channel must be declared as integers and as- 
signed a value in accordance with the requirements of insymbol 
and outsymbol [1]. 

I would like to observe in passing that the integer out channel 
cannot only be interpreted as identifying a single channel, but 
can also be interpreted as identifying a set of channels to all of 
which the output is to be sent. (If the binary representation of out 
channel is >> alt] X 21 7, the output is sent to channel ¢ if a[i] = 1 
and is not sent if a [7] = 0.) Although this is not yet implemented 
at Berkeley in this fashion, all output going to a terminal is now 
also sent to the printer. When time-sharing becomes widespread 
this interpretation will, I hope, be increasingly popular. 

2.2. The more basic input-output procedures are in integer, 
in real, and in Boolean; the first two use in symbol only through 
the integer procedure symbol. 

symbol recognizes only the following basic symbols: 
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and carriage return-line feed (or new card). 

in integer associates to the second parameter, which is of type 
integer, the next integer read from channel (the first parameter). 
Any number of consecutive spaces are ignoréd kefore the first 
digit; after the first digit, termination occurs wit 1 two consecu- 
tive spaces, ® comma, or a carriage return-line jeed. A comma 
before the first digit or sign, a period, (10), or any other illegal 
symbol will call the procedure error. 

in real associates to the second parameter, which i is of type real, 
the next real number read from channel (the! first parameter). 
Any number of consecutive spaces are ignored before the first 
digit, period, or (io); after that, termination occurs with two 
consecutive spaces, a comma, or a carriage return-line feed. A 
comma before the first digit, sign, period, or (10), or any other 
illegal symbol will call the procedure error. CO6mmunication be- 
tween in integer, in real, and in symbol to take care of separation 
between integers or reals requires the nonlocals 281000 and 
28100bc. 

in Boolean associates to the second parameter, which is of type 
Boolean the next Boolean read from channel (the first parameter) ; 
any number of leading spaces or carriage returns-line feed are 
ignored; any illegal symbol will call the procedure error. 

The procedure error has one parameter of type integer. It can 
be written according to the wishes of a user or of'a group of users. 
An example with diagnostics in full is given below. 

2.3. The more basic output procedures are out integer, out real, 
and out Boolean. The information on the form of the output can be 
given in various ways; the style used for these output procedures 
is what I will call the Berkeley style by contrast with the style 
used for output procedures at, for instance, the Amsterdam’s 
Mathematisch Centrum or at Copenhagen’s Regnecentralen. 
Call of these output procedures must be preceded by a call of 
corresponding procedures integer format, real format and Boolean 
format. 

The only parameter of integer format determines the field width 
of any integer sent to the output channel. The parameters of 
real format are a Boolean, which determines when the value is 
true that fixed point representation is desired for the output of 
real numbers and when the value is false that floating point repre- 
sentation is desired. The second parameter determines the field 
width, the third parameter determines the number of decimal 
places and affects also the rounding of the number. The only 
parameter of Boolean format determines the field width. 

The following decisions were made for out integer, out real, and 
out Boolean: If the field parameter is less than required, it is re- 
placed by 20. The sign is outputed before the most significant 
digit if the number is negative. In floating point form, the first 
significant digit is immediately to the left of the decimal point. 
The exponent is replaced by four spaces if it is zero; otherwise the 
sign of the exponent is always outputed and ne exponent is 
restricted to the interval —99 to 99. 

If the user wishes to write variants of the Bariieléy style, for 
instance if he wishes always to print the sign, or if he wishes to 
output it as the first character of the field, or if he wishes to out- 
put a space between every third or fifth digit, his task will be 
greatly eased by the introduction of the procedures decompose 
integer and decompose real which provide the basic information 
about an integer (its sign, the number of significant decimal 
digits, and the digits) or about a real (its sign, its size, the scale 
factor such that the scaled number ‘has its first significant digit 
immediately to the left of the decimal point and the digits). 

In decompose real, the size information determines if the num- 
ber is too small; an integer declaration has been chosen instead 
of a Boolean to provide for the possibility of another test, which 
would determine if the number is too large. The rounding for reals 
is taken care of in decompose real. 
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Correct rounding is essential for a set of input-output pro- 
cedures of quality. Although the point may be argued, I consider 
incorrect the output of 2 to two decimals as 1.99 unless computer or 
computations have only that precision. Examples: 


real format (true, 5,3); out real (1, 0.99099); 

real format (false, 10, 2); out real (1, —0.99099) ; 
will output 

0.991-9.9110 — 1. 


2.4. Four more sets of input-output procedures follow; these 
procedures do not require explicit calls of the format procedures: 

read i, read r, read b are function designators without parameters 
which can be used to input respectively an integer, a real or a 
Boolean. 

101, tor, tob are function designators and ioa is a procedure to 
input respectively an integer, a real, a Boolean or a real array and 
to output an equivalent ALGoL statement. 

This style, which I have introduced to give the output in the 
form of parts of an ALGOL program in connection with the genera- 
tion of the nonlinear equations satisfied by Runge-Kutta type 
methods (to be published elsewhere), can also be used to describe 
input and output within the conventions of the ALGor language. 

For 707, ior, iob, the second parameter gives the string to be 
outputted; the others give the parameters corresponding to those 
of the format procedures. For toa, the second and third parameters 
are the first and last subscript of the element of the one dimen- 
sional array to be read and the last parameters give the string to 
be outputted as well as the format information. Examples: 

tor(r, ‘timeuinuminutes’, true, 5, 2); 
toala, 1, 3, ‘hippopotamus’, true, 4, 1) 
would output with appropriate input: 
time in minutes := 21.05; 
= 1; for hippopotamus |i) := 15.1,6.2,7.0 doi := 7+ 1; 

The next four procedures of, oir, otb, and ota are for output 
only; the form of output is identical to that of 701, ior, iob, and 
toa. 

The last four procedures out?, outr, outb, and outa are for output 
only. They output an integer, a real, a Boolean, or a sequence of 
reals, the format information being provided by the parameters 
of these procedures. 


3. Specific Information About Procedures, Their 
Relationship, and the Nonlocal Parameters 

To ease the local exchange of procedures and nonlocal iden- 
tifiers of procedures between people at Berkeley, conventions 
have been introduced which are examplified in the procedures of 
this algorithm. All appropriate nonlocal identifiers are formed 
using as first symbols the letter z followed by a digit associated 
to the writer (I use 8) followed by 3 digits corresponding to the 
number of the procedure in which the nonlocal identifier is first 
used (my procedure symbol is number 100, in integer is number 101, 
etc.) followed by an ordinary identifier. 

The following declarations must be made in the same block as 
that of this algorithm or in an outer block: 


integer in channel, out channel, 28106n, 28107n, 28107d, 28108n; 

Boolean 28100b, 28100bc, 281078; 

procedure in symbol (channel, string, destination); (see Comm. 
ACM 7 (Oct. 1964), 628-630) 

procedure out symbol (channel, string, destination); 

procedure out string (channel, string); (Idem) 


in channel and out channel must be assigned an appropriate value 
before a call of many of the input-output procedures (see Table I). 

Table I indicates the relationship between the procedures 
and the nonlocal variables. Moreover, an explicit call of out integer, 
out real, and out Boolean requires a preceding call of the corre- 
sponding format procedure integer formal, real format, and Boolean 
format. 
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In Table I, each of the procedures is identified by a number. 
An X indicates that the procedure corresponding to the number 
in the same column or the nonlocal identifier on top of the same 
column is used explicitly (and perhaps also implicitly); + in- 
dicates that the corresponding procedure or identifier is used 
implicitly; 0 is placed in the column corresponding to the number 
of the procedure. Related procedures are grouped together in a 
file whose name appears in the first column. This inf ormation will 
be used in further publications. 

The following declaration can be used for the ptocedure error: 


procedure error (i); value; integer 7; 
begin procedure nlcr;  outsymbol (channel, ‘’, —1); 
nler; 
if ¢ = 8100 then out string scat nacre ‘uu 
, ~U-+Ut(epace)Licarriagelreturn-linettfeed’ ) else 
if t = §10100 then out string (1,‘whileLireadingUantinteger, LantuitlepalLey mbolu 
tsLireadLibeforeLitheLifirstLidigit’) else 
if t = 810101 then out string (1,‘whileUreadingtanUinteger, LantillepalLieymbolt 
teLlreadUafterLitheLifirstLidigit’) else 
if 1 = 810200 them out string (1, ‘whileUlreadingLaLireal,anLiillegaltisy mboltt 
tsLireadLiwhiletreadingltheLUdectmalUfraction’) else : 
if ¢ = 810201 then oué string (1,‘whileUreadingUaUreal, UariLiillegalUsymbolLisut 
readUbeforeLitheLlfirstUdigitliperiodUorts10’) else 
if ¢ = 810202 then vut string (1,'whileltreadingUaUreal UanLkllegallsymbolUisli 
readUwhiletlreadingLtheLler ponentUpart’) else 
if « = 810203 then out string (1,‘ alrealLinumberLisLlim@jroperlyier minated) 
else 
out string (1, ‘whilerendingLJaUBooleanLiaLisymbolt wohichLisunotuitrueLloruifalse, 
tsLreadLibeforellter mination’); 
nler 
end error 
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integer procedure. symbol(s); integer s; 

comment symbol := s := the integer represehtation of the 
next symbol read, 0 to 9 for the integers, 10 for ‘-’, 11 for 
‘—? 12 for ‘+’, 18 for ‘0’, and 14 for ‘,’ or for carriage 
return (or new card) represented by —1 when processed by 
in symbol or for two consecutive spaces when the nonlocal 
Boolean 28100) is false. When 281000 is true any number of 
consecutive spaces are ignored. Any other symbol will call a 
nonlocal procedure error with parameter equal to 8100; 
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begin 

read: in symbol(in channel, ‘0123456789.—-++10u,’, 8); 
if s = —1 A 28100bc then go to read; _ 
if s = 15 then 
begin 

if z8100b then go to read 
else in symbol(in channel, ‘(0123456789.—-+iou,’, s) 

end; 
ifs=-—-1Vs= 
else 
begin if s < 0 then error(8100); 

end symbol; 

procedure in integer(channel, 1); 
integer channel, 7; 

comment 7 := the next integer read from channel, any number of 
consecutive spaces are ignored before the first digit, after the 
digit termination occurs with two consecutive spaces, a comma 
or @ carriage return, any illegal symbol will call a nonlocal 
procedure error with parameter equal to 8100 or 810100 or 
810101; 


16 then symbol := s := 14 
symbol := s := s — lend 


value channel; 


begin 
integer s; Boolean negative; 
negative := false; 28100b := 28100bc := true; 


in channel := channel; 
symbol(t); 28100bc := false; 
if 7 = 12 then symbol(i) 
else if 7 = 11 then begin negative := true; 
if 7 = 10 then error(810100); 
28100b := false; 
Li: if symbol (s) < 10 then begin? := 1° X 1+ 8; 
if s ~ 14 then error(810101); 
if negative then 7 := —7 
end in integer; 
procedure in real(channel, r); value channel; 
integer channel; real r; 
comment r := the next real number read from channel, any num- 
ber of consecutive spaces are ignored before the first digit. 
After the first digit termination occurs with two consecutive 
spaces, a comma or a carriage return. Any illegal symbol will 
call a non local procedure error with paramater equal to 8100 
or 810200 or 810201 or 810202 or 810203. The main differences 
with ALGORITHM 239 of W. M. McKeeman [2] are the substi- 
tution of his integer procedure CH AR by symbol, the introduc- 
tion of the Boolean 28100b, the introduction of a parameter in 
the nonlocal procedure error and the change of type of a few 
declarations; 
begin 
real sig, fp, d, ep, ip; imteger esig, ch; 
real procedure unsigned integer; 
begin 
real u; 
u:= ch; 
K: if symbol(ch) < 10 then begin u:= u X 10 + ch; 
unsigned integer := u 
end unsigned integer, 
sig := 10; ep := fp := 
in channel := channel; 


symbol(i) end; 


goto Ll end; 


go to K end; 


0; 28100b := 28100bc := true; 


symbol(ch); 28100bc := false; 

if ch = 12 then symbol(ch) 

else if ch = 11 then begin sig := —1.0; symbol(ch) end; 
28100b := false; 

if ch S 10 then 

begin 


tp := if ch < 10 then unsigned integer else 0; 
if ch = 10 then 


begin 
if symbol(ch) = 10 then error(810200) ; 
fo := 0; d:= 0.1; 
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M: fp:=fp+ehXd; d:=dX0.1; 
if symbol(ch) < 10 then go to M 
end decimal fraction 
end decimal number 
else if ch = 13 then ip := 1 
else begin error(810201); ip := 1 end; 
if ch = 13 then 
begin esig := 1; 
if symbol(ch) = 12 then symbol(ch) 
else if ch = 11 then begin esig := --1; symbol(ch) end; 
if ch < 10 then ep := unsigned integer X esig 
else begin error(810202); ep := 0 end 
end exponent part, 
if ch # 14 then error (810203); 
r:= sig X (ipt+fp) X 10.0 T ep 
end in real; 
procedure in Boolean(channel, b); value channel; 
integer channel; Boolean b; 
comment 6 := the next Boolean read from channel, any number 
of spaces or carriage returns are ignored, any other symbol will 
call a nonlocal procedure error with parameter equal to 8103; 
begin 
integer 7; 
L:in symbol(channel, ‘true falseu’, i); 
ifi = 3 \/i = —1 then go to L; 
if 7 < 0 then error(8103); 
b:=i=1 
end in Boolean; 
procedure decompose integer(i, negative, n of digits, digit); 
value 71; integer 1, n of digits; Boolean negative; 
integer array digit; 
comment negative := 1 < 0, n of digits := the number of decimal 
digits of ¢ (if 7 = 0 then n of digits := 0), digit [0: n of digits — 1] 
:= the decimal digits of ¢ starting from the right; 
begin 
integer 7; 
if i < 0 then begin negative := true; 1 := —i end 
else negative := false; 
n of digits := 0; 


if 7 > 0-then 
begin 
j= t+ 10; digitln of digits] := i — 7 X 10; 
n of digits := n of digits +1; t:= 7; goto L 
end 
end decompose integer; 
procedure decompose real(r, maz n of digits, negative, size, exponent, 


digit); 
value 7; integer max n of digits, size, exponent; real r; 
Boolean negative; integer array digit; 

comment negative := r <0, size := —1 if r is too small, i.e. is 
such that when abs(r) is multiplied repeatedly by 10 it does 
not become eventually larger than one, size := 0 otherwise, 


exponent := the power of 10 by which r is to be divided to ob- 
tain a number whose first significant digit is immediately to 
the left of the decimal point, digit (0: mux n of digits — 1] := 
the decimal digits of r starting with the first significant digit 
to the left; 

begin 
integer 7, k, m; 
Boolean procedure too small(r);_ real r; 

too small ;:= abs(r) < 2 f (—127); 

comment this procedure should be replaced appropriately; 
negative := false; 
if too small (r) then 
begin size := 1; go to end decompose end 
else size := 0; 
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if r < 0 then begin negative := true; r := —r end; 
if r < 1 then 
begin | 

exponent := —1; 


scale up: r := r X 10; 
if r < 1 then 


begin exponent := exponent — 1; go to scale up end 
end 
else 
begin 
exponent := 0; 
lest: 


if r = 10 then 
begin exponent := exponent + 1; 
go to lest end 
end; 
m := max n of digits; 
rs=r+5x0.1 f m; 
t := entier(r); 
if 7 = 10 then 
begin 
t#:=1; 
end 
else if 7 = 0 then? := 1; 
digit(0] := 7; 
for k := 1 step 1 until m — 1 do 
begin 
r:= (r—t) X 10; 7 := entier(r); 
4 := digit(k] := if 7 S 0 then 0 else if 7 = 10 then 9 else 7 
end; 
end decompose: 
end decompose real; 
procedure integer format(n); 
procedure real format(B, n, 4); 
begin 
28107B := B; 
end real format; 
procedure Boolean formai(n); 
procedure out integer(channel, 7); 
integer channel, 1; 
comment the style of this procedure and of the out real and out 
Boolean procedures given below is what I will call the Berkeley 
style by contrast with that used for output procedures at the 
Amsterdam Mathematisch Centrum or at the Copenhagen 
Regnecentralen, for instance. It is characterized by the use of 
a field width parameter n and for real numbers, by the use of a 
parameter B which decides if the fixed point (value true) 
or the floating point representation (value false) is requested 
and by the number of digits d after the decimal point. The 
sign is outputed just before the most significant digit, if the 
number is negative. In floating point form the first significant 
digit is immediately to the left of the decimal point. If the 
field parameter is less than required, it is replaced by 20. These 
procedures pair with the corresponding input procedures if the 
field width is at least two units greater than required; 
begin 
integer n of digits, j,k; Boolean negative; 
integer array digit[0: 19]; 
decompose integer(1, negative, n of digits, digit); 
if n of digits = 0 then 
begin n of digits := 1; digii[0] := 0 end; 
j := n of digits + (if negative then 1 else 0); 
for k := (if 7>z2z8106n then 19 else 28106n—1) 
step —1 until j do out string(channel, ‘u’); 
if negative then out string(channel, ‘—’); 
for k := n of digits —1 step —1 until 0 do 
out symbol(channel, 0123456789’, digit{k]+1) 
end out integer; 


rs=rX 0.1; 


exponent := exponent-+1; m:=m+1; r:=r/10 


integer n; 28106n := n; 
integer n, d; Boolean B; 
28107n := n; 28107d := d 

integer n; 281087 := n; 
value channel, 7; 
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procedure out real(channel, r); 
integer channel; realr; 
comment this procedure outputs r properly rounded to channel 
using the Berkeley style. In this variant, the exponent part 
in the floating point form is replaced by 4 spaces if the exponent 
is zero. The sign of the exponent is always outputed, for com- 
patibility with in real. The exponent is restricted to the interval 
—99 to 99; 
begin 
integer j, k, size, exponent; Boolean negative; 
integer array digit(0: 28107d+1+ (if 28107B then 
entier (In (abs(r) +1) X0.4343) else 0)]; 
procedure out digit(d); integer d; 
begin 
out symbol(channel, ‘0123456789’, d+1) 
end out digit; 
if z28107B then 
begin 
decompose real(r, if z8107d+exponentS0 then 1 else 1+ 
28107d-+ exponent, negative, size, exponent, digit); 


value channel, 7; 


if size = —1 then 

begin 
exponent := if 28107d = 0 then 0 else —28107d — 1; 
digit(0) := 0 

end 


else if 28107d = 0 A exponent < 0 then 
begin exponent := 0; digii[0] := end; 
j := (if negative then 3 else 2) + 
(if 28107d = 0 then —1 else 28107d) + 
(if exponent 2 0 then exponent else —1); 
for k := (if j>28107n then 19 else 28107n--1) step —1 
until j do out string(channel, ‘u’); 
if negative then out string (channel, ‘—’); 
for k := 0 step 1 until exponent do 
out digit(digit{k]) ; 
if 28107d > 0 then 
begin 
out string(channel, ‘-’); 
for k := exponent + 1 step 1 until exponent + 28107d do 
if k < 0 then out string(channel, ‘0’) else out digit(digit{k]) 
end 
end fixed point representation 
else 
begin 
decompose real(r, 28107d+1, negative, size, exponent, digit); 
if size = —1 then 
begin 
exponent := 0; 
for k := 0 step 1 until 28107d do digit[k] := 0 
end; 
j := 6+ (if 28107d=0 then —1 else 28107d)-+ 
(if negative then 1 else 0); 
for k := (if 7>28107n then 19 else 28107n—1) 
step —1 until j do 
out string(channel, ‘u’); 
if negative then out string(channel, ‘-’); 
out digit (digit (0}); 
if 28107d ~ 0 then owt string(channel, ‘-’); 
for k := 1 step 1 until 28107d do out digit(digit{k}) ; 
if exponent = 0 then out string(channel, ‘uuuu’) 
else 
begin 
out string(channel, ‘10’); 
comment This procedure assumes that 10 takes one space, 
if not, the preceding statement should be modified; 
if exponent = 0 then out string(channel, ‘-+’) 
else 
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begin out string(channel, ‘—’); 
exponent := —exponent 

end; 

j := exponent + 10; 


if 7 = 0 then out string(channel, ‘u’) 
else out digit(j); 
out digit(exponent—jX 10) 
end 
end floating point representation 
end out real; 
procedure out Boolean(channel, b); 
integer channel; Boolean b; 
begin 
integer k, j; 
j := if b then 4 else 5; 
comment this procedure assumes that true and false take 
respectively 4 and 5 spaces, if not the preceding statement 
should be modified; 
for k := (if 7>28108n then 19 else z8108n—1) step —1 until 
j do out string(channel, ‘u’); 
out symbol{channel, ‘true false’, j—3) 
end out Boolean; 
integer procedure read 7; 
begin 
integer 7; 
in integer(in channel, 1); 
end read 1; 
real procedure read r; 
begin 
real r; 
in real(in channel, r); 
end read r; 
Boolean procedure read b; 
begin 
Boolean b; 
in Booleantin channel, b); 
end read b; 
integer procedure ioi(i,s,n);_\ string s; integer 7,7; 
comment this and the next 3 procedures input respectively an 
integer, a real number, a Boolean or a one dimensional array, 
they output an equivalent Algol statement; 


value channel; 


read t i= 1 


readr := 7 


read b := 6b 


begin 
out string(out channel, 8); out string(out channel, ‘u := u’); 
in integer(in channel, 1); tot := 7; 


integer format(n); out integer(out channel, 2); 
out string(out channel, ‘;u’) 

end 707; 

real procedure ‘or(r, s, B, n, d); 


real 7; string s; Boolean B; integer n, d; 
begin 

out string(out channel, s); 

out string(out channel, ‘u := wu’); 


in real(in channel, r); ior := 7; 
real format(B, n,d); out real(out channel, r); 
out siring(out channel, ‘;u’) 
end ‘or; 
Boolean procedure i0b(B, s, n); 
integer 7; 
begin 
out string(out channel, s); 
out string(out channel, ‘u := wu’); 
in Boolean(in channel, B); tob := B; 
Boolean format(n); out Boolean(out channel, B); 
out string(out channel, ‘;u’) 
end 70b; 
procedure ioa(a, |, u, 8, B, n, d); 
integer 1, u, n, d; array a; 


Boolean b; string s; 


string s; Boolean B; 
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begin 
integer 1; 
if 1 > wthen go to end toa; 
real format(B,n,d); otz(l, ’, 3); 
out string(out channel, ‘uforu’); 
out string(out channel, s); 
out string(out channel, ‘[t]u := 
for 7 := | step 1 until u do 
begin 
in reai(in channel, a[i]s- out real(out channel, a[z]); 
if 7 < u then out string(out channel, ‘,u’) 
else out string (out channel, ‘udouiu := uiu-+ul;u’) 
end; — 
end toa: 
end 10a; 
procedure oti(i,s,n); value i, n; integer i, n; string s; 
comment this and the following 3 procedures output Algol 
statements compatible with those of the input output procedures 
tot, tor, tob, 10a; 
begin 
out string(out channel, s); 
out string(out channel, ‘u := wu’); 
integer format(n); out integer(out channel, 1); 
out string(out channel, ‘;u’) 


u’); 


end ott; 
procedure oir(r, s, B, n, d); 

realy; strings; Boolean B; integer n, d; 
begin 


out string(out channel, s); 
out string(out channel, ‘u := u’); 
real format(B, n,d); out real(out channel, r); 
out string(out channel, ‘;u’) 
end otr; 
procedure o1b(B,s,n); 
begin 
out string(out channel, s); 
out string(out channel, ‘u := wu’); 
Boolean format(n); out Boolean(out channel, B); 
out string (out channel, ‘; u’) 
end otb; 
procedure ola(a, 1, u, s, B, n, d); 
integer |, u, n,d; array a; 
begin 
integer 7; 
if | > wu then go to end ota; 
real format(B, n, da); oti(l, ‘’, 3); 
out string(out channel, ‘uforu’); 
out string(out channel, s); 
out string(out channel, ‘[i]u := wu’); 
for 7 := | step 1 until u do 
begin 
out real(out channel, a[t]); 
if i < u then out string(out channel, ‘,’) 
else out string (oul channel, ‘udoutu:=uiu-+ul;u’) 
end; 
end ota: 
end ota; 
procedure outi(i, n); integer 7, n; 
comment this and the following 3 procedures output integers, 
real numbers, Booleans or one dimensional] arrays using format 
as indicated in out integer; 
begin 
integer format(n); 
out integer(out channel, 7) 
end ouiz; 
procedure outr(r,B,n,d); realr; Boolean B; integer 7n, d; 


Boolean B; string s; integer 7; 


string s; Boolean B; 
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begin 
real format(B,n, d); 
out real(out channel, r) 
end outr; 
procedure outb(B, n); 
begin 
Boolean format(n); 
out Boolean(out channel, B) 
end outb; 
procedure outa(a,l,u, B,n,d); 
Boolean B; 
begin 
integer 7; 
if | > u then go to end outa; 
real format(B, n, d); 
for 7 := | step 1 until u do out real(out channel, a[z}); 
end outa: 
end outa 


Boolean 6; integer n; 


integer 1, u, n, d; 


array @; 
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ALGORITHM 336 

NETFLOW [H] 

T. A. Bray anp C. WiITzGaLu 
(Reed. 2 Oct. 1967 and 20 May 1968) 

Boeing Scientific Research Laboratories, Seattle, WA 
98124 


KEY WORDS AND PHRASES: capacitated network, linear pro- 
gramming, minimum-cost flow, network flow, out-of-kilter 
CR CATEGORIES: 5.32, 5.41 


procedure NETFLOW (nodes, arcs, I, J, cost, hi, lo, flow, pi, 

INFEAS); 

value nodes, arcs; integer nodes, arcs; 

integer array I, J, cost, hi, lo, flow, pt; label INFEAS; 

comment This procedure determines the least-cost flow over an 
upper and lower bound capacitated flow network. 

Each directed network arc a is defined by nodes J{a] and J[a], 
has upper and lower flow bounds hz[a] and lo[a], and cost per unit 
of flow cosét[a]. Costs and flow bounds may be any positive or 
negative integers. An upper flow bound must be greater than or 
equal to its corresponding lower flow bound for a feasible solu- 
tion to exist. There may be any number of parallel arcs connect- 
ing any two nodes. 

The procedure returns vectors flow and pt. flow[a] is the com- 
puted optimal flow over network are a. pi[n] is a number—the 
dual variable—which represents the relative value of injecting 
one unit of flow into the network of node n. NETFLOW may be 
entered with any values in vectors flow and pi (such as those 
from a previous or a guessed solution) feasible or not. If the 
initial contents of flow do not conserve flow at any node, the 
solution values will also not conserve flow at that node, by the 
same’ amount. 

This procedure is a revision (see remark by T. A. Bray and C. 
Witzgall [1]) of Algorithm 248 [2]. Like the original, it follows 
the out-of-kilter algorithm described by D. R. Fulkerson [3] 
and elsewhere. It follows the RAND code by R. J\ Clasen (For- 
TRAN) in three instances, using a single set of labels na, which 
correspond to the nb of Algorithm 248, avoiding superfluous 
tests in the part following BACK (for instance, c > 0 (A flow[a] < 
lo[a] is equivalent to c > 0 at this point of the program), and 
taking advantage of the fact that arcs remain in kilter and need 
not be rechecked again. In addition, the convention inf = —1 
is adopted in order to permit costs and bounds of value around 
99999999 without their interfering with the initiation of mini- 
mum search. 

REFERENCES: 

1. Bray, T. A., anD WitTzG@aLu, C. Remark on Algorithm 248, 
NETFLOW. Comm. ACM 11 (Sept. 1968), 633. 

2. Bricas, Wiittam A. Algorithm 248, NETFLOW. Comm. 
ACM 8 (Feb. 1965), 103. 

3. Futxerson, D.R. An out-of-kilte method for minimal-cost 
flow problems. J. Soc. Ind. Appl. Math. 9 (Mar. 1961), 
18-27; 

begin 

integer a, aok, c, cok, del, eps, inf, lab, m, n, src, snk; 

integer array na[l: nodes]; 

integer procedure minp(z,y); value x,y; integer 7,y; 

begin 

ifxe <y A 22 Othen minp := x else minp := y 
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end minp; 
comment check feasibility of formulation; 
for a := 1 step 1 until arcs do 
if lola] > hila] then go to INFEAS; 
inf := —1; 
comment find out-of-kilter arc; 
for aok := 1 step 1 until arcs do 
begin 
cok := cost{aok] + pi[I[aok}] — pi[J[aok}]; 
TEST: if flowlaok] < lo[aok] V (cok<0/A flow[aok] <hi{aok]) then 
begin 
sre := J{aok]; snk := I[aok]; nal[src] := + aok,; 
go to LABL 
end; 
if flowlaok] > hilaok] V (cok>0A flow[aok]>lolaok]) then 
begin 
sre := I[aok]; 
go to LABL 
end; 
comment arc aok is in kilter; 
go to NEXT; 
comment arc aok is out-of-kilter, clear all labels but source 
label, start new labeling; 
LABL: for n := 1 step 1 until src — 1, sre + 1 step 1 until 
nodes do na[n] := 0; 
LOOP: lab := 0; 
comment switch set for determining whether a pass thru 
the list of ares yields a new label; 


snk := J[aok]; na[src] := —aok; 


for a := 1 step 1 until arcs do 
begin 
if (na[T[a]]=O0/Ana[J[a}]=0) V (na[Z[a}]~0/Anal[J[a]] +0) then 
go to XC; 


e := cosi{a] + pr[Z[a]] — pi[J[a]]; 
if nalI[a]] = O then go to XA; 
if flow[a] = hifa] V (flow[a]2lolaJAc>0) then 


go to XC; 
na[J[a]] := +a; goto XB; 
XA: if flowla] S lola] V (flowfa] <hila]Ac<0) then 
go to XC; 
na[I{a]] := —a; 
XB: lab :=1; 


comment node labeled, test for breakthru; 
if na[snk] ~ 0 then go to INCR; 
XC: end no breakthru; 
if lab ~ 0 then go to LOOP; 
comment nonbreakthru, determine change to p2 vector; 
del := inf; 
for a := 1 step 1 until arcs do 
begin 
if (na[I[a]]=0/na[J[a]]=0) V (na[Z[a]] ~0/Ana[J[a]]~0) then 
go to XD; 
e := costla] + pi[Z[a]] — pilJ[al]; 
if na[J[a]] = 0 A flowla] < hi[a] then 
del := minp(del,c); 
if na[J[a]] +~ 0 A flow[a] > lofa] then 
del := minp(del,—c); 
XD: end; 
if del = inf then 
begin 
if flowlaok] = hifaok] V flow[aok] = lo[aok] then 


COLLECTED ALGORITHMS (cont.) 


del := abs(cok) 
else go to INFEAS 
end exit, no feasible flow; 
comment change pz vector by computed del; 
for n := 1 step 1 until nodes do 
if na[n] = 0 then pi[n] := pi[n] + del; 
comment test whether aok is now in kilter; 
if del = abs(cok) /\ flow{aok] > lofaok] /\ flow[aok| 
< hilaok] then 
go to NEXT; 
cok := costlaok] + pi[I{aok]] — pi[J[aok]]; 
go to LOOP; 
comment breakthru, compute incremental flow; 
INCR: eps := inf; n := sre; 
BACK: a:= na[n]; 
if a > 0 then 
begin 
m := Ia}; 
if cost{a] + pi[m] — pi[n] > 0 then 
eps := minp(eps, lola]—flow[a]) 
else eps := minp(eps, hila]—flow|a]) 
end 
else 
begin 
m:= J[—a]; 
if cost{—a] + pi [n] — pi |m] < 0 then 


eps := minp(eps,flow[—a]—h7|—a]) 
else eps := minp(eps,flow|—a]—lo[—a]) 
end; 
n:i= m; ifn # src then go to BACK; 


comment change flow: by eps; 
BACK2: a := nan; 
ifa > 0 then 
begin 
m:= Tal; flowla] := flowla] + eps 
end 
else 
begin 
m:= J[—al; flowl[—a] := flow[—a] — eps 
end; : 
n:=m; if n # sre then goto BACK?2; 
comment test whether aok is now in kilter; 
go to TEST; 
NEXT: 
end find next out-of-kilter are 
end VNETEFLOW with a feasible, optimal flow 


REMARIx ON ALGORITHM 336 [H] 
NETIFLOW [T. A. Bray and C. Witzgall, Comm. ACM 11 
(Sept. 1968), 631-632] 
T. A. Bray anp C. WiTzGauu (Reed. 20 Oct. 1969) 
Boeing Scientific Research Laboratories, Seattle, WA 
98124 


KEY WORDS AND PIIRASES: capacitated network, linear 
programming, minimum-cost flow, network flow, out-of-kilter 
CR CATEGORIES: 5.32, 5.41 
The algorithm as published contains an error on the 11th line 
following the line labled XD, which reads: 
if del = abs(cok) (A ... 
This line should read 
if del > abs(cok) A... 
Fortunately, this error does not invalidate the algorithm but may 
in some cases lead to additional operations. 
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ALGORITHM 337 Li= 1; 
CALCULATION OF A POLYNOMIAL AND ITS ee ee 
DERIVATIVE VALUES BY HORNER SCHEME [C1] Tie 
W. Panxrewicz (Recd. 28 Mar. 1968 and 16 May 1968) rf] = r[é] X l 
Warszawa - 1, Al. 3-go Maja 2/68, Poland. ra | 

en 


KEY WORDS AND PHRASES: function evaluation, polynom- end horner 
ial evaluation, Algol procedure, Horner’s scheme 
CR CATEGORIES: 5.12, 4.22 


procedure horner(n,a,k,r,20,b); value n,k,20,b; 
integer n,k; real 20; Boolean 6; array 4@,r; REMARK ON ALGORITHM 337 [C1] 
comment If b is true the procedure calculates and storesinr[t] CALCULATION OF A POLYNOMIAL AND ITS 
eperverterot : DERIVATIVE VALUES BY HORNER SCHEME 
ae one [W. Pankiewicz, Comm. ACM 11 (Sept. 1968), 633] 
é o> alj]Xz T 9)/de Outver K. Smiru (Recd. 27 Sept. 1968) 
and z = 20 for? = 0,1,+--,k. If } is false it calculates and ee si greeny aN 
stores in the array r the values of the first k-+-1 coefficients of Bes Paso Bis ane eoens 


the expansion of the polynomial in a power seriés in the neigh- KEY WORDS AND PHRASES: function evaluation, polyno- 
borhood of 20, i.e. mial evaluation, ALGoL procedure, Horner’s scheme 
CR CATEGORIES: 4,22, 5.12 


n Tt : 
2a xaTtj= Pe) xX (@—20) Tt. ' The definition of the given polynomial is incorrect in the com- 
ment. In both the third line and the eighth line of the comment, 
Here n is the degree of the polynomial whose coefficients are _a[j] should be replaced by a[n — j]. Also the first word ‘‘and’’ of the 
given by a[0:n]. It is assumed that O< k< n..If k = 0 only — fourthline of the comment should be changed to ‘‘at’’. 
the value of the polynomial is calculated. If b is false the choice 
k = n would be most useful. 
This algorithm is essentially equivalent in Algorithm 29 
[Comm. ACM 8 (Nov. 1960), 604] in terms of quantities com- 
puted, but the application of Horner’s scheme significantly 
reduces the number of operations. 
Example 1. For the polynomial of degree n'= 5: w(x) = 
efTf5+2X2x27T4-3X2<eT3+8xX2zT 2-7X«#+l1l, 
k = 2,20 = 2 andb = true, the following was obtained: r(0) = 
69, r{l] = 133, r[2] = 236, ie. w(2) = 69, w'(2) = 133 and 
w" (2) = 236. 
Example 2. For the polynomial of degree n = 7: w(x) = 
at7—T7TXeT5+6Xet4+4xXzr7F 3-27 2- 
2xXa2—9, k=7, 20 = 2 and b =false the following vector r 
was obtained: 15, 122, 279, 332, 216, 77, 14, 1, ji.e., the given 
polynomial can be expressed in the form: w(x) = 15 + 122 X 
(a—2) + 279 X (x—-2) T 2+ 332 X (@—2) T 3 + 216 X (x—2) 
tT 4+77 X (c—2) f 5+14X @—2) fT 64+ 7) T 7; 


begin 
integer i,j, 1; real rr; 
rr := a[0]; 
for 7 := 0 step 1 until k do 
r[z] := rr; 
for j := 1 step 1 until n do 
begin 


—r[0] := r[0] X 20 + aly]; 
l:= ifn —j>kthenk else n — j; 


for 7 := 1 step 1 until 1 do 
r[t] := r[t] X 20 + r[¢—1] 
end; 
if b then 


begin 
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ALGORITHM 338 
ALGOL PROCEDURES OR THE FAST FOURIER 
TRANSFORM [C6] 
Ricuarp C. SINGLETON* 

(Reed. 21 Nov. 1966, 2 Aug. 1967 and 18 July 1968) 
Stanford Research Institute, Menlo Park, CA 94025 


KEY WORDS AND PHRASES: fast Fourier transform, complex 
Fourier transform, multivariate Fourier transform, Fourier 
series, harmonic analysis, spectral analysis, orthogonal poly- 
nomials, orthogonal transformation, virtual core memory, per- 
mutation 

CR CATEGORIES: 3.15, 3.88, 5.12, 5.14 


The following procedures are based on the Cooley-Tukey algo- 
rithm [1] for computing the finite Fourier transform of a complex 
data vector; the dimension of the data vector is assumed here to 
be a power of two. Procedure COMPLEX TRANSFORM computes 
either the complex Fourier transform or its inverse. Procedure 
REALTRANSFORM computes either the Fourier coefficients of a 
sequence of real data points or evaluates a Fourier series with 
given cosine and sine coefficients. The number of arithmetic opera- 
tions for either procedure is proportional to n logs n, where n is 
the number of data points. 

Procedures FFT2, REVFFT2, REORDER, and REALTRAN are 
building blocks, and are used in the two complete procedures men- 
tioned above. The fast transform can be coraputed in a number of 
different ways, and these building block procedures were written 
so as to make practical the computing of large transforms on a sys- 
tem with virtual memory. Using a method proposed by Singleton 
[2], data is accessed in sub-sequences of consecutive array ele- 
ments, and as much computing as possible is done in one section 
of the data before moving on to another. Procedure FFT2 com- 
putes the Fourier transform of data in norrnal order, giving a re- 
sult in reverse binary order. Procedure REVFFT2 computes the 
Fourier transform of data in reverse binary order and leaves the 
result in normal binary order. Procedure REORDER permutes a 
complex vector from binary to reverse binary order or from reverse 
binary to binary order; this procedure also permutes real data in 
preparation for efficient use of the complex Fourier transform. 
Procedures FFT2, REVFFT2, and REORDER may also be used 
to compute multivariate Fourier transforms. The procedure 
REALTRAN is used to unscramble and combine the complex 
transforms of the even and odd numbered elements of a sequence 
of real data points. This procedure is not restricted to powers of 
two and can be used whenever the number of data points is even. 

REFERENCES: 

1. Coorey, J. W., and Tuxey, J. W. An algorithm for the 
machine calculation of complex Fourier series. Math. Com- 
put. 19, 90, (Apr. 1965), 297-301. 

2. SincueToN, R. C. On computing the fast Fourier transform. 
Comm, ACM 10 (Oct. 1967), 647-654; 


* This work was supported by Stanford Research Institute out of Research and 
Development funds. 


338-P 1- 0 


procedure COMPLEXTRANSFORM (A, B, m, inverse); 
value m, inverse; integer m; 
Boolean inverse; array A, B; 
comment Computes the Fourier transform of 2” complex data 
values. The arrays A{0: n—1] and B[O: n—1], where n = 2”, 
initially contain the real and imaginary components of the data, 
and on exit contain the corresponding Fourier coefficient values. 
If inverse is false, the Fourier transform 
1 n-l 
—= >) (ax + iby) exp (i2rjk/n) 
N k=0 
is computed. The transform followed by the inverse transform 
(or the inverse transform followed by the transform) gives an 
identity transformation. Procedures FFT2 and REORDER are 
used by this procedure and must also be declared; 
begin integer n, 7; real p, q; 
ni=2>m; p:= ¢:= 1.0/sqrt(n); 
if inverse then 
begin 
q:= —-4q; 
for j := n — 1 step —1 until 0 do Bij] := —Bfj] 
end; 
FFT2(A, B,n, m,n); REORDER(A, B, n,m, n, false); 
for 7 := n — 1 step —1 until 0 do 
begin A[j] := Alj] X p; Bly) := Bij] X q end 
end COMPLEXTRANSFORM; 


procedure REALTRANSFORM (A, B, m, inverse); 
value ™, inverse; integer m; 
Boolean inverse; array. A, B; 

comment Computes the finite Fourier transform of 27+! > 4 
real data points. If inverse is false, the arrays A[0: n] and 
B[O: nj, where n = 2”, initially contain the first 2” real data 
points % ,21,°** ,%n-1 a8 A[O],--- , A[n—1] and the remaining 
2” real data points @n ,%nz1,°°* ,Zen-1 a8 B[O], Bil], --- ,Bln—1]. 
On completion of the transform the arrays A and B contain 
respectively the Fourier cosine and sine coefficients a, and by, , 
computed according to the relations 


2n—-1 


a, =- >> x; cos (ajk/n) for k =0,1,-:-,n, 
nm ym0 
and 
1 2n—-1 
b, = - D) ajsin (wjk/n) for k=0,1,-++,n. 
nN k=0 


If inverse is true, the arrays A and B initially contain n + 1 
cosine coefficients a) , a1, -** , dn and n +1 sine coefficients 
bo , 61 , +++ , bn , where by = 6, = 0. The procedure evaluates the 
corresponding time series Xo ,%1,°** , Z2n-1, Where 

n-1 

a n 

Xj = . + >> [ax cos (xjk/n) + by sin (xjk/n)] + . cos (77), 

k=1 
and leaves the first n values as A[0], A{l], --- , A[mn—1] and the 
remaining n values as B[0], B[1], --- , B[n—1]. The procedures 
FFT2, REVFFT2, REORDER, and REALTRAN are used by 
this procedure, and must also be declared; 
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begin integer n, 7; real p; 


ni=2 Tm; 
if inverse then 
begin 
REALTRAN(A, B, n, true); 
for 7 := n — 1 step —1 until 0 do Bij) := — Bly); 


FFT2(A, B,n, m,n); 
for j := n — 1 step —1 until 0 do 


begin A[j] := 0.5 X Alj]; Bly] := —0.5 X B[j] end; 
REORDER(A, B,n, m, n, true) 
end 
else 
begin 


REORDER(A, B, n,m, n, true); 

REVFFT2(A, B,n,m,1); p:= 0.5/n; 

for j := n — 1 step —1 until 0 do 

begin A[j] := p X Aly]; Bij] := p X Bly] end; 

REALTRAN (A, B, n, false) 
end 
end REALTRANSFORM; 
procedure FFT2(A, B,n,m,ks); value n, m, ks; 
integer n, m,ks; array A, B; 
comment Computes the fast Fourier transform for one variable 
of dimension 2” in a multivariate transform. n is the number of 
data points, i.e.,n = m1 X m2 X +++ X np for a p-variate trans- 
form, and ks = ng X nny X +++ KX Mp, where ny, = 2” is the 
dimension of the current variable. Arrays A[0 :n—1] and 
B(0: n—1] originally contain the real and imaginary components 
of the data in normal order. Multivariate data is stored accord- 
ing to the usual convention, e.g., @;<: is in A[j7Xn2XnstkXn3tl] 
for j=0, 1, °¢--,m—1, k=0,1, +--+ ,nm2.-—1, andl=0, 
1, --- , 73 — 1. On exit, the real and imaginary components of 
the resulting Fourier coefficients for the current'variable are in 
reverse binary order. Continuing the above example, if the 
“ceolumn’’ variable nz is the current one, column 


k= Km—12"—* + Kkn—22”? + cS + ky12 + ko 
is permuted to position 
koa} + ky2m-2 + pees + Kim—22 + kin—1 . 


A separate procedure may be used to permute the results to 
normal order between transform steps or all at once at the end. 
If n = ks = 2”, the single-variate transform 

n~1 


(23 + ty;) = DS (ae + iby) exp (2rjk/n) 
k=0 


for j = 0, -:- ,m — 1 is computed, where (a + 7b) represent the 
initial values and (x + ty) represent the transformed values; 
begin integer k0, k1, k2, k3, span, j, 77, k, kb, kn, mm, mk; 
real rad, cl, c2, c8, sl, s2, 88, ck, sk, sq; 
real AO, Al, A2, A3, BO, B1, B2, B3; 
integer array C[0: m]; 
sq := 0.707106781187 ; 
sk := 0.382683432366 ; 
ck := 0.92387953251 ; 
C{m] := ks; mm := (m+2) XK 2; kn := 0; 
for k := m — 1 step —1 until 0 do C[k] := C[k+1] + 2; 
rad := 6,28318530718/(C[O]Xks); mk := m — 5; 
L: kb:=kn; kn := kn + ks; 
if mm ~ m then 
begin 
k2 := kn; k0 := Clmm] + kb; 
£2: k2:=k2—1; kO:=kO—-1; 
AO := A[k2]; BO := B[k2]; 
A[k2] := A[k0] — AO; A[kO] := A[kO] + AO; 
Bik2] := B[k0] — BO; B[k0] := B[kO] + BO; 
if k0 > kb then go to L2 
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end; 

cl := 1.0; sl :=0; 

jj := 0; k:= mm —2; 7 := 38; 

if k 2 0 then go to L4 else go to LG; 


13: if Clj] < jj then 


begin 
i 77 Cl; gy 1; 
if Clj] S jj then 
begin 
g:= 97 - Cll; gre j—-l; kim kh + 2; 
go to L3 
end 
end; 
jo = Cl + 99; J = 3; 


TA: span := Clk}; 


if 77 ~ 0 then 
begin 
c2 := 97 X span X rad; cl := cos(c2); sl := sin(c2); 


L5: c2:=cl f 2—sl1 f 2; s2:= 2.00 X cl X si; 


c3 := c2 X cl — s2X& sl; s8:= c2X sl +s2X cl 
end; 
for k0 := kb + span — 1 step —1 until kb do 
begin 
kl := kO+ span; k2:= kl + span; k3 := k2 + span; 
AO := A[k0]; BO := B[k0]; 
if sl = 0 then 


begin 
Al := A[k1]; Bl := B[k1]; 
A2 := A[k2];  B2 := B[k2]; 
A3 := A[k3]; B3 := Blk3] 
end 
else 
begin 
Al := A[kl] * cl — B[k1] X s1; 
Bl := A[k1] & sl + B[kl1] X cl; 


A2:= A[k2] X c2 — Bl[k2] X s2; 
B2 := A[k2] X s2 + B[k2] X 2; 
A3 := A[k3] X c3 — B[k3] X 83; 
B3 := A[k3] X s3 + Blk3] XK 3 
end; 
A[k0] := AO+ A2+ 414 A3; B[kO}] := BO+ B2+ Bl + B3; 


Alk1]:= AO+A2— A1~— A8; Bilkl] := BO+ B2— Bl — Bo; 
A[k2]:= AO— A2— B1+ B38; B[k2]:= BO- B2+ Al — A3; 
A(k3]:= AO — A2+ Bil — B38; B[k38):= BO-— B2— A1l1+ A3 
end; 
if k.> 0 then begin k := k — 2; goto [4 end; 
kb := k3 + span; 
if kb < kn then 
begin 
if 7 = O then begin k := 2; j:= mk; goto L3 end; 
j:= 97-1; c2:= cl; 
if 7 = 1 then 
begin cl :=cl X ck +81 X sk; sl :=s1X ck —c2 X skend 
else begin cl := (cl—sl) X sq; sl := (c2 + 81) X sq end; 
go to L5 
end; 


L6: if kn <n then go to L 
end FFT2; 


procedure REVFFT2(A, B,n,m,ks); valuen, m, ks; 


integer n,m, ks; array A, B; 


comment Computes the fast Fourier transform for one variable 


of dimension 2” in a multivariate transform. n is the number of 
data points, i.e.,n = mi X no X +--+ X np for a p-variate trans- 
form, and ks = nisi ®% nepe KX +++ K Np, where nz = 2” is the 
dimension of the current variable. Arrays A[0:n—1] and 
B(0: n—1) originally contain the real and imaginary components 
of the data with the indices of each variable in reverse binary 
order, e.g., @jx1 18 in Alj/Xn2Xns+k’Xnzt+l’] for j = 0,1, ---, 
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my —1,k =0, 1, ---,n.—1, andl =0, --- , ns; —1, where 
j’, k’, and l’ are the bit-reversed values of 7, k, and 1. On comple- 
tion of the multivariate transform, the real and imaginary com- 
ponents of the resulting Fourier coefficients are in A and B in 
normal order. If n = 2” and ks = 1, a single-variate transform 
is computed; 

begin 
integer k0, kl, k2,k3, k4, span, nn, j, jj, k, ib, nt, kn, mk; 
real rad, cl, c2, c3, sl, s2, 83, ck, sk, sq; 
real AO, Al, A2, A3, BO, B1, B2, B83, re, 2m; 
integer array C[0: m|]; 
sq := 0.707106781187; 


sk := 0.3882683432366; 
ck := 0.92387953251 ; 
C(O] := ks; kn :=0; k4:= 4X ks; mk :=m— 4; 
for k := 1 step 1 until m do C[k] := ks := ks + ks: 


rad := 3.14159265359/(C[0|Xks) ; 
L: kb:= kn +kh4; kn :=kn + ks; 
if m = 1 then go to L5; 
:= 97 := 0; gp := mk; nt := 3; 
1:= 1.0; sl :=0; 
£2: span := C{kJ; 
if jj = 0 then 
begin 
c2 := 97 X span X rad; cl := cos(c2); sl := sin(c2); 
Ls: c2:= cl f 2— sl f 2; s2:= 2.0 cl X sl; 
638 := c2 X cl — s2X sl; s8:= c2 X sit+s2xX el 
end else sl := 0; 
k3 := kb — span; 


L4: k2:= k38 — span; kl := k2 — span; kO := kl — span; 
AO := AfkO0]; BO := B[RO0]; 
Al:= Af{kl]; Bl := B[k1]; 
A2:= A[k2]; B2 := B[k2]; 
A3 := A[k3]; B3 := B[k3]; 


A[k0] := AO+ Al+ A2+4 A3; 
if sl = 0 then 
begin 
A[klj := AO — Al— B2+ B3; Bfkl]:= BO— B1+ A2— A3; 
A[k2]:= AO+ Al— A2— A3; B[k2]:= BO+ Bl — B2- B83; 
A[k3] := AO — Al + B2 — B3; _Bik3] := BO— Bl — A2+ A3 
end 
else 
ssh 
= AO — Al — B2 + B3: mM i= 
"Albel -= rexXcl—imX si; Bik]: fae) inhi SC che 
e:= AO0O+ Al — A2— A3; im:= BO+ Bl — B2 — BB; 
A(k2] := re X c2 — im * 82; B[k2] := re X s2 + im X c2; 
e:= AO — Al + B2 — B38; im:= BO — Bl — A2+4+ AB; 
A[k8] := re * c38 — im X& 83; Bik38] := re X s838 + im X c3 
end; 
k3 := kb +1; 
nt := nt —1; 
if nt = 0 then 
begin | 
c2 := cl; 
if nf = 1 then 
begin cl := cl X ch + sl X& sk; sl := 81 X ck —c2 X skend 
else begin cl := (cl—sl) X sq; sl := (c2+sl) X sq end; 


Bl(k0] := BO+ B1+ B2 + Ba; 


BO — Bl+ A2— AB; 


if k3 < kb then go to L4; 


kb := kb + k4; if kb S kn then goto L3 else go to L5 
end; 
if nt = —1 then begin k := 2; goto L2 end; 
if Clj] S j7 then 
begin 


i= 97 - Cl; G= 9-1; 
if Clj] S jj then 
begin jj := jj — Cljl} g:= 3-1; b:=k+2 end 
else begin jj := Clj] + 77; j := mk end 
end 


else begin 77 := C[j] + 37; j := mk end; 
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if 7 < mk then goto L2; k:=0; nt := 3; 
kb := kb + k4; if kb S kn then go to L2; 
L5: k := (m+2) X 2; 
if k #~m then 
begin 
k2 := kn; kO:= j := kn — C[k]; 
6: k2:= k2—1; kO:=kO—-1; 
AO := A[k2]; BO := B[k2]; 
A[k2] := A[kO0] — AO; A{kO] := A[kO] + AO; 


Blk2] := B(kO}] — BO; B[kO] := 
if k2 > 7 then go to L6 
end; 
if kn <n then goto L 
end REVFFT2, 


B{kO] + BO; 


procedure REORDER(A, B, n,m, ks, reel); 
value n,m,ks, reel; integer n, m, ks; 
Boolean reel; array A, B; — 
comment Permutes data from normal to reverse binary order 
or from reverse binary to normai order. If reel is false, data for 
one variate of dimension 2” in a multivariate data set of size n 
is permuted. In a p-variate transform with n = n: X nm X 
- X np, ks has the value ks = ni X ney KX +++ KX Np, where 
ne = 2” is the dimension of the current variable. For a single- 
variate transform, n = ks = 2”. If reel is true, A[2X7+1] and 
B(2Xj] are exchanged for j = 0,1, ---, (w—2)/2, then adjacent 
pairs of entries in A and B are permuted to reverse-binary order. 
This option is used when transforming 2n real data values, with 
the first n stored in A and the second n in B. After permutation, 
the even-numbered entries are in A and the odd- numberd entries 
are in B, each in reverse-binary order. 
Calling REORDER twice with the same parameter values gives 
an identity transformation; 


begin integer 2, j, jj, k, kk, kb, k2, ku, lim, p; 


real (; 
integer array C, LST|0: ml]; 
Clm| := ks; 


for k := m step —1 until 1 do C{k — 1] := Clk] + 2: 
pi=j:=m—1; t:=kb:=0; 
if reel then 
begin 

ku := nn — 2; 

for k := 0 step 2 until ku do 
Afk +1]; Afk+1]:= Blk]; Blk] := tend 


begin / := 
end else m := m—1; 
lim := (m+ 2) +2; if p S Othen go to L4; 
L: ku := k2 := Clj] + kb; gj := Clm — J]; ae = kb + jj; 
22: k:= kk + 97; 
L3: t:= A[kk]; Alkk] := A[k2]; A[k2] := 


= Blkk]; Blkk] := Blk2]; Blk2] := 
kk := kk +1; k2:= k2+1; 
if kk < k then go to L3; 
kk := kk + 97; k2 := k2 + 9); 
if kk < ku then go to L2; 
if 7 > lim then 


begin 
gi=egj-1; t:=7t4+1; 
LST] := 7; goto L 
end; 
kb := k2; 
if 7 > 0 then 


go to L end; 
p; go to L end; 


begin j := LST]; «:=2-1; 
if kb < n then begin 7 := 


LA: 
end REORDER; 


procedure REALTRAN (A, B, n, evaluate); 


value n, evaluate; integer n; 
Boolean evaluate; array A, B; 
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comment If evaluate is false, this procedure unscrambles the 
single-variate complex transform of the n even-numbered and 
n-odd-numbered elements of a real sequence of length 2n, where 
the even-numbered elements were originally in A and the odd- 
numbered elements in B. Then it combines the two real trans- 
forms to give the Fourier cosine coefficients A[0], A[1],--- , A[n] 
and sine coefficients B[0], B[1], --- , Bln] for the full sequence 
of 2n elements. If evaluate is true, the process is reversed, and a 
set of Fourier cosine and sine coefficients is made ready for 
evaluation of the corresponding Fourier series by means of 
the inverse complex transform. Going in either direction, 
REALTRAN scales by a factor of two, which should be taken 
into account in determining the appropriate’ overall scaling; 
begin integer k, nk, nh; 
real aa, ab, ba, bb, re, im, ck, sk, de, ds, 7; 
nh :=n + 2; 7 := 3,14159265359/n; 
ds := sin(r); r:= —(2Xsin(0.5Xr)) T 2; 
de := —0.5X* r; ckh:= 1.0; sk :=0; 
if evaluate then . 
begin ck := —1.0; dc := —dc end 
else begin A[n] := A[0]; Bln] := B[0] end; | 
for k := 0 step | until nh do 
begin 
nk :=n—k; 
aa := A[k] + A[nk]; ab := A[k] — Al[nk]; 
ba := Bik] + B[nk]; 6b := Blk] — Blnk]; 
re:= ck X ba + sk X ab; im := sk X ba — ck X ab; 
B[nk} := im — 6b; Blk] := im + bd; 
A[nk] := aa ~ re; A[k] := aa-+ re; 
de:=rXck+de; ck := ck+de; 
ds :=rX sk+ds; sk := sk+ds 
end 


end REALTRAN 
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procedure FFT (A, B, n, nv, ks); 
integer n, nv, ks; array A, B; 

comment This procedure computes the finite Fourier transform 
for one variate of dimension nv within a multivariate transform 
of m complex data values. The real and imaginary components 


value n, nv, ks; 


*This research was supported by Stanford Research Institute out of Research 
and Development funds. 

of the data are stored in arrays A[0:n—1] and B[0: n—1], follow- 
ing the usual arrangement for indexing multivariate data in 
a single-dimensional array, e.g., aja is stored in location 
AljXnoXnstkXnst+]j for j = 0,1,---,m—-1, k=0,1,---, 
n,—1, andl =0,1, --: , m3 — 1. The value of ks for the kth 
variate of a p-variate transform is 


ks = ne X egy X +++ K Np 


where nv = nx and n = m1 X nz X «++ X np. On completion of 
the transform, the real and imaginary components of the result- 
ing Fourier coefficients are in A and B respectively. For a single 
variable, n = nv = ks, and the transform 


nl 
» (ax + iby) exp (i2rjk/n) 
=) 

is computed for 7 = 0,1,-:: ,n—1. 

For a single-variate transform of 2n real-valued points, the 
amount of computing can be reduced by approximately one-half 
by using procedure REALTRAN (3] together with FFT. The 
even-numbered data points are stored initially in array A, the 
odd-numbered data points in array B, the transform is computed 
with 


FFT(A, B,n, n,n), 
and the result is unscrambled with 


REALTRAN (A, B, n, false) 


and then scaled by 1/2n to give the cosine coefficients as A(0], 
A{l], --- , Aln] and the sine coefficients as B[1], B[2], --- , 
Bin—1], with B[0] = Bin) = 0. The inverse operation, evaluat- 
ing the Fourier series with cosine coefficients A and sine coeffi- 
cients B, is computed by 


REALTRAN(A, B, n, true) 
followed by 
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FFT(A, B, Nn, nN; n), 


then scaling by 1/2, yielding the even-numbered time domain 
values in array A and the odd-numbered values in array B. 
Note that the upper bounds of array A and B must be increased 
to m when procedure REALTRAN is used. 

The method is based on an algorithm due to Cooley and 
Tukey [1], with modifications proposed by Singleton [2], to 
allow computing of large transforms on a system with virtual 
memory. The dimension nv is first decomposed into its prime 
factors nu, Nv2, °° , NVm , and then nv/nv; transforms of di- 
mension nv; are computed for 7 = 1, 2, --- , m. The resulting 
transformed values are then permuted to normal order in a final 
step. Computing times, to a first approximation, should be 
proportional to n(nvitnoe+-+: + nv,). The dimension of array 
FACTOR must be increased if nv has more than 20 factors. 

In factoring nv at the beginning of the procedure, factors that 
are squares of primes are first removed, then the square-free 
portion is factored. The two factors of each square are placed 
symmetrically about the square-free factors. For example, 
nv = 72 is factored as 2 X 3 X 2 X 3 X 2. This arrangement is 
used to simplify the final reordering in place. One symmetric 
permutation step is done for each square factor, and the reorder- 
ing is completed by following the permutation cycles of the 
square-free portion. 

In the transform phase of the procedure, special coding for 
factors of 2 and 3 is included for efficiency. Adjacent factors of 
2 are also paired, and the results stored as for factors of 2 rather 
than 4. The remaining factors are handled by an odd-factor 
routine, using trigonometric function symmetries and smaller 
real transforms to reduce the number of multiplications by one- 
half as compared with a straightforward complex transform of 
an odd factor. The approximate number of complex multiplica- 
tions is n/2 for a factor of 2, 3n/4 for a factor of 4, and 
(p—1)(p+3)n/4p for an odd factor p. 

In both the transform and reordering phases, data is accessed 
in subsequences of consecutive array elements, and as much 
computing as possible is done in one section of the data before 
moving on to another. This is done to reduce the number of 
memory overlay operations in a system with virtual memory. 
After the first transform or symmetric permutation step, the 
remaining steps can be performed independently on each of 
nv; spans of data. We complete all remaining steps on the first 
span before beginning with the second. Similarly, after the 
second step the first span is subdivided in nv, independent spans. 
This subdivision process is continued through the remaining 
steps. 

A number of working storage arrays are declared within this 
procedure. For large n, the total working storage is small in 
comparison with the 2n locations for data arrays A and B, ex- 
cept in a couple of cases. In the transform phase, approximately 
6q working storage locations are used, where q is the largest 
prime factor in the transform. This requirement is minor except 
in a single-variate transform with n a prime number. During the 
reordering phase, the worst case occurs when doing a single- 
variate transform with n a product of two or more primes with 
no square factors. In this case, approximately n working storage 
locations are required. 

This program was tested on the Burroughs B5500 computer 
and compared with another program computing a single n-by-n 
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complex Fourier transform. Whenever n had two or more prime 
factors, procedure FFT was much faster. The B5500 ALGOL 
system limits single-dimension arrays to 1023 words, but larger 
transforms can be computed by declaring 


array A, B[O: (n—1) + 512, 0: 511], 


storing the data 512 entries per row, and using partial word 

indexing A[J.[380:9], J.[89:9]] instead of A[J] wherever A and B 

appear in procedure FFT. 
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begin integer array FACTOR(0: 20]; Boolean zero; 
real AO, Al, A2, A3, BO, B1, B2, B38, cm, sm, 
cl, c2, c8, sl, s2, s8, c30, rad; 
integer k0, kl, k2, k3, 9k, kf, kh, 7f, mm, 
4,9, J), k, kb, m, span, kt, kn; 

comment Determine the square factors of nv; 

k:= nv; m:=0; jg := 2; 9 := 4; Of = 0; 

FACTOR [0] : =1; 

L: fort := k + jj whilez X jj = k do 
begin m := m+ 1; FACTOR(m] := j; k:=iend; 

if 7 = 2then 7] := 3elsej :=j + 2; 

:=39X5; if77 Skthen gotoL; kt :=m; 

comment Determine the remaining factors of nv; 

for j := 2,3 step 2 until k do 

fori :=k + jwhilet Xj =k do 

begin m :=m-+1; FACTOR[m] := 7; k:= tend; 

if FACTOR|kt] > FACTOR([m] then k := FACTOR{[kt| 

else k := FACTOR([m]); 

for 7 := kt step —1 until 1 do 

begin m := m+1; FACTOR[m] := FACTOR[j] end; 
begin integer array C,D[0: ml]; 
begin array CK, SK, CF, SF[0:k—1]; 
array AP, BP, AM, BM[0:(k—1) +2]; 
array RD, CC, SS[0:m]; 
Booiean array BB[0:m+1]; 
rad := 6.28318530718; ¢30 := 0.866025403784; 
for 7 := m step —1 until 2 do 
begin 
BB{j] := (FACTOR{j—1]+F ACTOR[j]) = 4; 
if BB [j] then 
begin j := j —1; BB{j] := false end 
end; _ 
BB{m+1] := BB[i] := false; 
ClO] := ks + nv; kn := 0; D[0] := ks; 
for j := 1 step 1 until m do . 
begin 
k := FACTOR[j); Cly] := Clj-1] X k; 
D{j] := Dij-1] + hk; RD{j] := rad/C{y]; 


cl := rad/k; 
if k > 2 then ; 
begin CC[j] := cos(cl); SS[j] := sin(cl) end 
end; 
mm := if BB[m] then m—1 else m; 
if mm > 1 then 
begin 
sm := C[lmm—2] * RD[m]); 
cm := cos(sm); sm := sin(sm) 
end; 
Ti: kb := kn; kn := kn +ks; 77 := 0; @ := 1; 
cl := 1.0; sl := 0; zero := true; 


L2: 


D3: 
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if BB(1+1] then 


begin kf := 4; «+1:=1+1end 
else kf := FACTOR{t]; 
span := Dit]; 
if — zero then 
begin 
sl := jj X RD{t]; cl := cos(sl); sl := sin(sl) 
end; 
comment Factors of 2, 3, and 4 are handled 
separately to gain efficiency; 
if kf = 4 then 


begin 
if — zero then 
begin 
c2:=cl f 2-— sl f 2; s2:=2.0X cl X s1; 
c38 := c2 X cl — s2X sl; 88 := c2 X sl + 82 X cl 
end; 


for kO0 := kb + span —1 step —1 until kb do 
begin 


kl := kO+ span; k2:= kl + span; k3 := k2 + span; 


AO := A[k0]; BO := B[k0]; 
if zero then 


begin 
Al := A[k1]; Bl := B[k1]; 
A2 := A[k2]; B2 := B[k2]; 
A3 := A[k3]; B3 := B[k3] 
end 
else 
begin 
Al := A[k1] X cl — B[k1] X s1; 
Bi := Afki1] * sl + B[ki] X cl; 


A2 := A{k2] KX c2 — B[k2] X 82; 

B2 := A[k2] X s2 + B[k2] X c2; 

A3 := A[k3] X c3 — B[k3] & 38; 

B3 := A[k3] X s3 + B[k38] K ¢3 
end; 


A[k0] := AO + A2+ Al + A8; B[kO] := BO+ B2+ 


Bl + B38; 
A[k1] := AO + A2— Al — A3; B{k1] := BO+ B2—- 
Bi — Bs; 
A[k2] := AO — A2 — Bl + B3; B[k2] := BO — B2+ 
Al — A3; 
A[k3] := AO — A2+ Bl — B83; Blk3] := BO — B2— 
Al + A3 
end 
end 
else if kf = 3 then 
begin 


if — zero then 


0 


begin c2 := cl f 2— sl f 2; s2:=2.0X cl X slend; 


for k0 := kb + span — 1 step —1 until kb do 
begin 
kl := kO+ span; k2 := kl + span; 
AO := A[k0]; BO := B[k0]; 
if zero then 
begin 
Al: 
A2: 
end 
else 
begin 
Al := A[kl] X cl — B[k1] X s1; 
Bl := A[kl}) X sl + Bfk1] X cl; 
A2 := A[k2] K c2 — B[k2] X s2; 
B2 := A[k2] XK s2 + B{k2] X c2 


A[kl]; Bl: 
A{k2]; B2: 


Bk]; 
B[k2] 


iil 


end; 
A[k0] := AO + Al + A2; B[kO] := BO + Bl + B2 
A0 := — 0.5 X (A1+A2) + AO; Al := (A1—A2) X 


c30; 


’ 


COLLECTED ALGORITHMS (cont.) 


BO := — 0.5 X (B1+B2) + BO; #81 := (BI—B2) x 
c30; ’ 
A[kll := AO — Bl; Bfkl] := BO+ Al; 
A[{k2] := AO+ Bl; B[k2] := BO — Al 
end 
end 
else if kf = 2 then 
begin 
kO := kb + span; k2 := kO0 + span; 
if zero then 
begin 
for k0 := k0 — 1 while kO = kb do 
begin 


~k2:=k2-—1; AO:= 


A[k2] : 


end 
end 
else 


for k0 := k0 — 1 while k0 


begin 
k2 : 
AO: 


k2 — 1; 
A[k2] X cl — 


= A[k0] — AO; A[kO] :== 
Blk2] := BlkO] — Bo; 


A[k2]; BO := B{k2]; 
A[k0] + AO; 
B[k0] := B[k0] + BO 


= kb do 


Bik2] x 81; 


BO: ae) X sl + Bik2] X cl; 


A (k2} 
Blk2] : 


= A[k0] — AO; 


kO := kb + span; 
if — zero then 


= Blk0] — BO; B[k0] := 


A[k0] := A[k0] + AO; 


B[kO] + BO 


= jk +2; 3 := D [i—1]; 


begin 
k:= jk —1; CF(1) := cl; SF{1] := s1; 
for j := 1 step 1 until k do 
begin 
CF(j+1] := CF{j] x cl — SF[j] K s1; 
SFij+1] := CF(j] X sl + SF[j] x cl 
end 
end; 
if kf + jf then 
begin 
CK [jk] := CK{1] := c2 := CC{[t];| 
SK[1] := s2 := SS[t]; SK{jk] := —s2; 
for j := 1 step 1 until kh do 
begin 
ko:= gk — 9; 
CK{k] := CK{j+1] := CK[j] X c2 — SK[j] X 82; 
SK{j+1] := CK{(j] X s2 + SK[j] * 2; 
K(k] := —SKij-+1) 
end 
end; 
LA: mace meu ee k2 := kO + ka; 
A3 := AO := A[k0]; B3 := BO := B[k0]; 


for j := 1 step 1 until kh do 


kl := kl + span; k2 := 


if zero then 


begin 
Al := Afkl]; Bl := 
A2 := A[k2]; B2:= 
end 
else 
begin 
= kf — j; 


Al := A[kl] X CF{[j] — 


k2 — spon; 
Eb: 
Bik]; IB: 
Blk2] Li: 


B(kl] X SF{j]; 
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B1 := A[kl] X SF[j] + Blk1] x CFIJ]; 
A2 := A[k2] X CF[k] — B[k2] x SF[k]; 
B2 := A[k2] X SF[k] + B[k2] X CFIk] 
end; 
AP{j] := Al + A2; AM[j] := Al — A2; 
BP{j] := Bl + B2; BM{j] := Bl — B2; 
A3 := AP{j] + A3; B3 := BP[j] + B3 
end; 
A[k0] := A8; B{kO] := 
kl := kO; k2 := kO + k8; 
for j := 1 step 1 until kh do 
begin 
kl := kl + span; k2 := k2 — span; jk := j; 
Al := a. Bi := BO; A2:= B2 := 0; 
for k := 1 step 1 until kh do 
begin 


Al := AP{k] X CK{jk] + Al; 
A2 := AM[k] X SK[jk] + A2; 
B\ := BP\k] X CK{jk] + Bl; 
B2 := BM{k] X SK{jk) + B2; 


jk := jk +4; if jk = kf then jk := jk — bf 


end; 
A[kl] := Al — B2; A[k2] := Al + B2; 
B(k1) := B1 + A2; B[k2] := Bl — A2 
end; 
if k0 > kb then goto JA; jf := kf 
end; 
if 7 < mm then 
begin 7 :=71+1; goto L2 end; 
1:= mm; zero := false; 
kb := D{t — 1] + kb; 
if kb < kn then 
begin 
for jj := Cl{i—2] + jj while j7 2 C[i—1] do 
begin 7 :=7—1; jj := jj — C{t] end; 
if 7 = mm then 


begin 
c2 := cl; cl := cm X cl — sm X s1; 
sl := sm X c2 + cm X sl; goto L3 
end; 
if BB[t] then? := 7+ 1; goto L2 
end; 
if kn < n then goto [1 
end; 
t:= 1; 
for 7 := kt — 1 step —1 until 1 do 
begin 
FACTOR{[j] := FACTOR(|j] — 1; 7 := 
end; 


comment We now permute the result to normal order; 


FACTOR{j] + 1 


0 


comment The following if statement does the complete re- 
ordering if the square-free portion of n has at most one 
prime factor. Otherwise it does a partial reordering, leaving 


each entry in its correct section of length n + c[kt], 
where c[ki] T 2 is the product of the square factors; 


if kt > 0 then 
begin rage array ze 2]; 


gj: 1; = kb: 

k3 : es = Dii] a ue jk := jj := Clj—-1); 

kO := kb + 9j; span := Clj] — Jj; 

k := k0 + 9; 

AO := A[k0]; A[kO] := A[k2]; A[k2] := AO; 
BO := Bik0]; Blk0) := Bik2]; B[k2] := BO; 


kO := kO+1; k2 := k2 +1; 

if k0O < k then go to L7; 

kO := kO0 + span; k2 := k2 + span; 
if kO < k3 then go to [4; 

if KO < (k3+span) then 


COLLECTED ALGORITHMS (cont.) 


begin k0 := k0 — D{j] + jj; go to L6 end; 


k3 := Dj] + k3; 

if (k83—kb) < D[j—1] then 

begin 
k2 := k3 + jk; gk := jk + 97; 
kO0 := k3 — D{j] + jk;. go to L6 

end; 

if 7 < kt then 

begin 


k := FACTOR[] +74; j:= 7 +1; 
L8: t:=t+1; Si] := 5; if7 ee ahed go to L8; 
go to L5 
end; 
kb := kd; 
if 7 > 0 then 
begin j := S[i]; 7:=71—1; goto L5 end; 
if kb < n then begin j := 1; go to L5 end 
ae 
= Clkt]; span := D{kt]; m := m — kt; 
kb := span + jk —2; 
comment . The following if statement completes the reorder- 
ing if the square-free portion of n has two or more prime 
factors; 
if kt < m — 1 then 
begin integer array R[0:k5]; 
array TA, TB[0:jk—1]; 


for j := ki step 1 until m do D[j] := D[j] + jk; 
ig 0; : 
for j := 1 step.1 until kb do 
begin 
k := kt; 
for 77 := D[k+1] + 77 while jj 2 D{k] do 
begin jj := jj — Dik]; k:=k+1 end; 
if jj = j then R[j] := — j else Rij] := 37 
end; 
comment Determine the permutation cycles of length 
a2; 
for j := 1 step 1 until kb do if R[j] > 0 then 
begin 
k2 := j; | 
for k2 := abs (R[k2]) while k2 # j do Flr := —R[k2] 
end; 
comment - Reorder A and B following the permutation 
ee 
kn := 74 := 7 := 0; 


LA: kb := ia: kn := kn + ks; 
LB: j:=j+1; if R{j] < 0 then go to LB; 
k := Rij]; kO:= jk Xk + kb; 
LC: TA{i] := A[k0+7]; TB{i] := BlkO+7]; 


i:=t+1; ifi < je then goto LC; 7 := 0; 
LD: k:= —R{k]; gj := O; kO:= gk Xk+ kb; 
LE: Aljj+i] := A[kO0+7]; Bljj+7] := BlkO+7];: 


i:=i+1; ifi < je then goto LE; 7 := 0; 
if k ~ j then go to LD; 

LF: A[kO+i] := TA[t]; B[kO+7] := TB{2]; 
i:=i+1; ifi < jkthen goto LF; 7 := 0; 


if 7 < k2 then goto LB; j := 0; 
b := kb + span; if kb < kn then go to LB; 
if kn <n then goto LA 
end 
end 


end FFT 
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REMARK ON ALGORITH 339 [C6] | 

AN ALGOL PROCEDURE FOR THE FAST FOURIER 

TRANSFORM WITH ARBITRARY FACTORS 
[Richard C. Singleton, Comm. ACM 11 (Nov. 1968), 
776] 

Ricuarp C. SINGLETON (Recd. 27 Nov. 1968) 

Stanford Research Institute, Menlo Park, CA 94025 


KEY WORDS AND PHRASES: fast Fourier transform, complex 
Fourier transform, multivariate Fourier transform, Fourier 
series, harmonic analysis, spectral analysis, orthogonal poly- 
nomials, orthogonal transformation, virtual core memory, 
permutation 

CR CATEGORIES. 3.15, 3.88, 5.12, 5.14 


On page 778, column 2, the 7th and 6th lines from the bottom 
should be corrected to read: 
LJ: jj += Cit—2) + jj; if 77 2 C[¢—1] then 
begin? :=7i—1; jj := jj — Cit]; goto LJ end; 
On page 779, column 1, the 9th and 8th lines from the bottom 
should be corrected to read: 
LX: jj := D{k+1) + jj; if jj 2 Dk] then 
begin jj := jj — Dik]; k:=k+1; goto LX end; 
In both cases jj was originally. used as the controlled variable of 
a for clause and thus was undefined after exit; the corrections 
preserve the value of jj for later use. 
If the user prefers to compute constants with library functions, 
line 5 in column 2 on page 777 may be replaced by: 
rad := 8.0 X arctan(1.0); c80 := sgrt(0.75); 
Algorithms 338 [Comm. ACM 11 (Nov. 1968), 773] and 339 were 
punched from the printed page and tested on the CDC 6400 


ALGOL compiler. After changing a colon to a semicolon at the end 


of line 37 in column 2 on page 775, the test results agreed with 
those obtained earlier with this compiler. 

When computing a single-variate Fourier transform of real 
data, procedure REALTRAN may be used with procedure FFT 
(Algorithm 339) to reduce computing time. Two versions of 
REALTRAN have been given (Algorithms 338 and 345 [Comm. 
ACM 12 (Mar. 1969), 179-184]); the first version is the faster of 
the two, but the second should be used if arithmetic results for 
real quantities are truncated rather than rounded. 

In describing the evaluation of a real Fourier series, in the 
middle of column 2 on page 776, the necessary steps of reversing 
the signs of the B array values both before and after calling FFT 
were omitted. The correct steps, including scaling, are as follows: 

REALTRAN(A B, n, true); 

for7 := n—1 step —1 until 0 do Bij] := —Bfy]; 

FFT (A, B,n, n,n); 

for j := n— 1 step —1 until 0 do 

begin A[j] := 0.5 X A[j]; BLj] := —0.5 X Bij] end; 


COLLECTED ALGORITHMS FROM CACM 
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ALGORITHM 340 
ROOTS OF POLYNOMIALS BY A ROOT-SQUARING 
AND RESULTANT ROUTINE [C2] 
ALBERT NOLTEMEIER 
(Reed. 2 Nov. 1967, 25 Jan. 1968 and 16 July 1968) 


2X7 = 1.000 000 0000 zs = —1.000 000 0000 
(ii) Po(z) = 2 + 7xt + 5a8 + 62? + 38x 4 2 
zi = —6.3509936102 
22,3 = 1.3506884657 &K 107° + 7 & 7.7014185283 XK 1071 
wa,5 = —4.5957204142 K 10°! + 2 X 5.5126354891 & 107! 
(iii) P3(x) = 28 — 225 + 224 + #3 + 62? — 62 + 8 


Technische Universitat. Hannover, Rechenzentrum, 212 = — 9.9999999974 x 107! + 7 X 1.0000000002 
Hannover, Germany aa =  4.9999999999 x 107! + 7 X 8.6602540377 X 107} 
; Zs, =  1.4999999997 -E i X 1.3228756548 


KEY WORDS AND PHRASES: rootfinders, roots of poly- 
nomial equations, polynomial zeros, root-squaring operations, 
Graeffe method, resultant procedure, subresultant procedure, 
testing of roots, acceptance criteria 

CR CATEGORIES: 5.15 

procedure AG4(n, c, mm, delta, epsilon, range) Result: (re, im, 
mu, rt, gc,m,1, t) Exit: (fail); 
value n,.mm, delta, epsilon, range; 
integer n,m,1,mm; real delta, epsilon, range; 
integer array mu; 
array c, re,im, rt, gc, t; 
label fail; 

comment AG4 finds simultaneously zeros of a polynomial of 
degree n with real coefficients by a root-squaring and resultant 
routine. 

This procedure supersedes Algorithm 59 [2]. The. following 


(iv) Pa(x) = 2? — 4.012 + 4.02 
The procedure fails to compute any zero in this case (parameter 
m = 0). After changing the parameter epsilon to 105, AG4 
evaluates the zero x = 2.0049937655 with multiplicity 2 and re- 
mainder term 2.5 X 10°; 
Parameters: 
n degree of the polynomial 
c real coefficients of the polynomial 
c[j](j=0,--+,n), where cin] is the constant term 
delta, epsilon parameters for acceptence criteria 
practical input delia = 0.2, epsilon = 10 fT (—7) 
range upper bound of the range of real constants 
(for the cDec 1604 -A range = 10 7 307) 
mm number of root-squaring iterations 
practical input mm = 10 


changes were made: re real part of each zero relj](j=1, ---, m) 
(a) In the procedure heading, the meaning of the old formal im imaginary part of each zero im[j](j=1, --- , m) 
parameter alpha is shared by the three new parameters mm, mu corresponding multiplicity mu[j]G=1, --- , m) 
delta, and epsilon, and range, m, 1, t, fail are added to the formal rt remainder term rt[j](j=1, ---, m) 
parameter list. gc coefficients of the polynomial generated from these zeros 
(b) In the beginning of the procedure body the polynomial is gclj|(j=0, +++ n—7) 


tested for 0 as a zero (label ZROTEST). Although the modulus 
p = 0 can be found by squaring operations, the procedure 
usually will not find the root 0 without that test. 

(c) In the program section labeled SQUARING OPERATION 
the iteratively squared coefficient is tested whether it will re- 
main in’ the allowed range of numbers (forntal parameter 
range) for a particular machine after another squaring opera- 
tion. 

(d) If there is a complex zero with a real part of 0, the resultant 


m number of distinct zeros found by the routine 

4 degree of the residual polynomial 

t coefficients of the residual polynomial 
t(j7](7=0, --- ,t), where ¢{z] is the constant term 

fail a zero with multiplicity greater than n found, change 
parameters for acceptance criteria. 


REFERENCES: 


1. Bareiss, E. H. Resultant procedure and the mechaniza- 
tion of the Graeffe process, J. ACM 7 (Oct, 1960), 346-386. 

2. Baretss, E. H. ann FisHeRKeELLER, M. A. Algorithm 59, 
Zeros of a real polynomial by resultant procedure, 
Comm. ACM 4 (May 1961), 236-237. 

3. THacner, H. C. Certification of algorithm 3, Comm. ACM 
3 (June 1960), 354. 

4. Grau, A. A. Algorithm 256, Modified Graeffe method, 


R(p) is a polynomial of degree n with the coefficients r,z1 = 
rn = 0. Computing the moduli of the zeros of this polynomial 
in the program section labeled SQUARING OPERATION 
and testing for pivotal coefficients, one would have to divide 
by 0. This case has been excluded by testing the divisor. 

(e) If the acceptance criteria epsilon and delia are chosen too 
large, the sum of the multiplicities of the already found zeros 


may be greater than the degree n of the polynomial. In the 

program sections labeled IT and D, the test for the degree of 

the residual polynomial, the number of zeros, and the sum of 
the multiplicities of zeros in order to end the procedure has 
been improved. 

Tests: The procedure AG4 has been tested on the CDC 
1604-A computer at the Rechenzentrum, Technische Universitat 
Hannover. The following results were obtained in a few repre- 
sentative cases. The parameters of acceptance criteria are 
delta = 0.2, epsilon = 107-7, and mm = 10. 

(i) Piz) = x28 — 302° + 27324 — 820z? + 576 
x1 = 4.000 000 0010 = x. = —4.000 000 0010 
ts = 2.999 999 9990 %4 = —2.999 999 9990 
Xs 2.000 000 0000 Xe = —2.000 000 0000 


Comm. ACM 8 (June 1965), 379; 
begin 
integer d,numzro; 
Boolean zero; 


numzro := 0; zero := false; d:=n; 
ZROTEST: 

if c[d] = 0 then 

begin 
zero := true; d:=d—1; numzro := numzro+1; 
go to ZROTEST 

end; 

begin 


integer ct, nu, nuc, beta, j, jc, k, p, em, 1, mme, ll, me, sm; 
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Boolean root; T1: rh[ct] := rp; 2 := rp + epstlon X rp; 

real z, y, gz, rp, h; y := « + epsilon X rp; 

array a, ac{0: d, 0: mm], rr, rc[0: d], s[—1: d], for k := 0 step 1 until d do [k] := abs(clk]); 
ag[0: d+1, —1: d+], rh, g, 9, f[1: 2 X dl]; flet] := synd(—y, 0.0, d, t) — synd(—z, 0.0, d, t); 

switch ss := S1, 82; g{ct] := synd(—rh{ct], 0.0, d, c); 

switch tf := T1, T2; if abs(f[ct]) > g[ct] then 

switch vv := V1, V2; begin 


root := true; g[ct] := 0; 


cL := ct +1; f{ct] := f[ct—1] 


integer procedure min(u, v); integer u, 0; 
min := if u S v then wu else 1; 


real procedure synd(ww, qq, 11, tt); a a Shee 
integer 77; real ww, qq; array tt; im ‘ 
get] := synd(—rh[ct], 0.0, d, c); 
a ec if abs(f[ct]) > glct] then 
begin 
s[—1] := 0; [0] °= «[0); root := true; g[ct] := 
for em := 1 step 1 until 77 do ct:= ct +1; fflet] := Paer 
s[em] := ttlem] — ww X s[em—1] — gq X slem—2]; end; 
if gq = 0 then synd := abs(s[ztz]) if nu = 1 then go to M2; 
nd a = abs(s[ti—1] X sqrt(abs(qq))). + abs(s[z7]) q(ct] := rp T 2; nuc:= nu; je := 79; 
end syn mmc := me; : 
= beta := 1; for j := 0 step | until d do 
Sau nine OPERATION: begin 
me := mm; re[j] := rr[j]; ae[j, me] := alj, me] 
begin end; 
for m := 1 step 1 until mm do RESULTANT: 
begin begin 
for j := 0 step 1 until d do array b[—1:d+1, —1:d+1], aa[0:d], 
begin r(O:d, O:d], cb[—1:d+1]; 
h:= 0; cb[—1] := cb[d+ 1] := 0 
gH ll := 1 step 1 until min(d—J,j) do ae 7 oe ae Pesilads 
=h+(=1) 1 WX alj—U, m—1) X alf-+Ul, m=; ee 
lim a = (-1) T 7X @fj,m—1] T 242 xh) for k := 0 step 1 until d do 
~ begin 
a 0 step 1 until d do blk, —1] := 0; bfk—1, k] := 
é for j := 0 step 1 until k do 
ee ee b(R+1, j] := bfk, j-1] — alet] X bIk—1, JI; 
egin me := m; goto end blk+1, k+1] := 1; h := 0; 
a for j := d — k step —1 until 0 do 
’ h :=h + (cblj] X eb[k+j] — eblj—-1] 
ene x eb{kt+j+1]) X gle] T @—k-9); 


Wi: aalk] := (-1) T kXh; 


for j := 0 step 1 until d do for j := 0 step 1 until k — 1 do 


rr[j) :=if alj, me] = 0 then 0 else r[k, j] := r{k—1, j] + aalk] x d[k, 9); 
(—1) fT 7 X alg, me—1] T 2/a[j, me); r{k, k] := aalk] 
ll := 0; i 
for 7 := d step —1 until 0 do ore 2; 
begin for j : =0s step | until d do 
if a[j, me] = 0 then alj, 0] := r{d, d—J]/r{d, d] 
begin tl := 11 +1; rrlj] := Ul end end; 
else go to W2 go to SQUARING OPERATION; 
end; T2: 
W2: if (rp/2) f 2 > q[ct] then go to M3; 
jJs=1; nu:=1; rh{ct] := rp; 
RD: gict] := synd(—rh{ct], g[ct], d, c); 
if (l—deltaSrr{j]) A (rr[j]S1+delia) then if abs(f[ct]) > g{ct] then 
begin begin 
rp := abs(alj, mel/alj—nu, me]) 7 (/(2T meXnu)); et := ct +1; filet] := flct—1] 
go to ti[beta] g{ct] := g[ct—1] 
end; end; 
M1: rh{ct] := —rp; 
nu:= nut; g{ct] := synd(—rh{ct], g[ct], d, c); 
M2: if abs(f[et]) > g[ct] then 
jr= jt; begin 
if 7 = d+ 1 then go to ss([beia] else go to RD; ct:= ct +1; flct] := flct—1]; 
M3: glet] := g[ct—1] 
nu:= 1; goto M2; end; 


go to M3; 


COLLECTED ALGORITHMS (cont.) 


S82: 
me := mmc; 
for j := 0 step 1 until d do 
begin 
alj, me] := aclj, me]; rr[j] := re[j] 
end; 
j:= je; beta := 1; 
if root then go to M3 else nu := nuc; 
go to M1; 
Si: 
for j := 0 step 1 until d do ag{j, 0] := 1; 
for 7 := —1, 1 step 1 until d do 
for m := 0 step 1 until d do 
ag[m, j] := 0; 
k:= 1; t:=d; m:= 1; i:=0; 
for j := 0 step 1 until d do ¢{j| := clj]; 


MULT: 

mu|[m] := 0; 

p := if g{k] = 0 then 1 else 2; 
IT: 


gz := synd(—rh{k], q[k], 2, t); 
if abs(f{[k]) > gx then 
begin 
W:= UW + p; 
for j := 1 step 1 until Il do 
ag(ll, j] := ag(ll—p, 7] — rh{k] X ag(ll—p, j—-1) + glk] x 
ag(ll—p, j—21; 
mu[m| := mulm| + p} t:= 14 — p; 
if 1 < 0 then go to fail; 
if 1 = 0 then go to Fl; 
for j := 0 step 1 until z do {[j] := s{j]; 


go to IT 
end 
else if mu[m] ~ 0 then 
El: 
begin 
ri[m] := g[k]; go to v[p]; 
end 
else go to D1; 
Vi: 
re[m] := rh[k]; tm[m] :=0; goto EF; 
V2; 


re[m] := rh{k]/2; 
im(m] := sgrt(q[k] — re[m] T 2); 


m:=m+i; 
Di: 
k:=k+1; 
sm := 0; 
if m #1 then 
for 7 := 1 step 1 until m — 1 do sm := sm + muf{j]; 
ifk sect A sm Sd /fi>Othen goto MULT; 
for 7 := 0 step | until d do gc[j] := ag{ll, 7]; 
m:=m—1; 
if zero then 
begin 
for j := d+ 1 step 1 until d + numzro do gc[j] := 0; 
m:=m+i; 
re(m| := 0; tm[m]:= 0; mul[m] := numzro; ri[m] := 0 
end 
end 
end AG4 
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REMARK ON ALGORITHM 340 [C2] 

ROOTS OF POLYNOMIALS BY A ROOT-SQUARING 
AND RESULTANT ROUTINE [Albert Noltemeier, 
Comm. ACM 11 (Nov. 1968), 779] 

ALBERT NOLTEMEIER (Recd. 6 Jan. 1969) 

Technische Universitat Hannover, Rechenzentrum, Han- 
nover, Germany 


KEY WORDS AND PHRASES: rootfinders, roots of polynomial 
equations, polynomial zeros, root-squaring operations, Graeffe 
method, resultant procedure, subresultant procedure, testing 


of roots, acceptance criteria 
CR CATEGORIES: 5.15 


The following misprints were found in the algorithm and should 
be corrected as indicated: 

1. In the comment, in the first column on page 780, the last line 
before the paragraph beginning with the word ‘‘Parameters’’ ends 
with a semicolon; it should end with a period. 

2. In the seventh line following the word ‘‘Parameters’’ the ab- 
breviation CDC should appear in capital letters. 

3. In the procedure body, in the second column on page 780, 
the line before the label SQUARING OPERATION is missing. It 
should read as follows: 

for j := Ostep 1 until d do a{j, 0] := cy]; 
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ALGORITHM 341 | 
SOLUTION OF LINEAR PROGRAMS IN 0-1 
VARIABLES BY IMPLICIT ENUMERATION [H] 
J. L. Byrne ann L. G. Prouy 
(Recd. 8 Nov. 1967 and 17 June 1968) 
Department of Mathematics, University of Southampton, 
Hampshire, England 


KEY WORDS AND PHRASES: 
variables, partial enumeration 
CR CATEGORIES: 5.41 


linear programming, zero-one 


procedure IMPLEN (m,n, A, x, api, nosoln, count, inf); 
value m,n, inf; integer m,n, count; real inf; 
Boolean api, nosoln; real array A; integer array 7; 
comment This procedure solves the integer linéar program, 
minimize  A(0, 1] X [1] + --- + A[0, n] X a[n] 
subject to A[z, 1] X x[1] +--+ + Aft, n] X a[n] 
+ Aft, 0] 20 (¢=1, 2, +++ , m) 
and xfj]=Oorl (j=1,2,---,n). 
It is assumed that A[0,j] 20 (j=1, 2, +++ ,). The algorithm 
used is that of Geoffrion (SIAM Rev. 9, No. 2). On entry, inf 
is the largest positive real number available and api is set to 
true if a priori information concerning the solution is supplied 
in the form of a binary vector z[1: n] and its! associated cost 
A[0, 0]. On exit nosoln is true if no feasible solution to the con- 
straints has been found, otherwise it is false and x contains the 
optimal solution, A[0, 0] contains the optimal value of the ob- 
jective function and A[z, 0] contains the values of the slack 
variables. In either case count contains the Number of iterations 
performed; 
begin 
integer 7,j,k,ta,e,d; realz,q,maz,r; Boolean null; 
integer array s, v[1: n]; 
comment _s holds the current partial solution in order of as- 
signment, v is a state vector associated with $3 
if api then 
begin 
for j := 1 step 1 until n do 
if x[j] = 0 then begin s[{j] := —j; 
else 
begin 
sj) = 4; oli] = 3; 
for 1 := 1 step 1 until m do 
Alt, O] := Alt, 0] + Ale, J] 
end; 
€:= 0; 
end; 


v[7] := 2 end 


z:= A[0, 0]; goto L0 
for j := 1 step 1 until n do s[j] := o[j] := 0; 
z2:= 0.0; e:=0; 
LO: nosoln := true; count :=0; A[0, 0] := inf; 
comment all relevant variables are now initialized; 
START: count := count + 1; 
for 1 := 1 step 1 until m do 
if A[z, 0] < 0.0 then go to FORMT; 
comment best completion of s is feasible; 
go to INCUMBENT; 
FORMT: null := true; 
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comment formset T of free variables to which 1 may be profit- 
ably assigned ; 
for j := 1 step 1 until n do 
begin 
if = (v[j] = 0 A Ald, j] + 2 < A[0, 0]) then go to LI; 
for k := 7 step 1 until m do 
if A[k, 0] < 0.0 A A[k, j] > 0.0 then 
begin null := false; v{j]:= 1; goto Ll end; 
D1: end; . 
if null then go to NEWS; 
comment if 7 is empty then s is fathomed; 
for k := i step 1 until m do 
begin 
if A[k, 0] = 0.0 then go to L2; 
q := Alk, 0]; 
for j := 1 step 1 until n do 
if v[fj] = 1A Alk, j] > 0.0 then gq := q+ Alk, J); 
if g < 0.0 then go to NEWS; 
comment if g is negative s is fathomed; 
L2: end; 
max := —inf; 
for j := 1 step | until n do 
begin 
if o[j] ~ 1 then goto L3; gq := 0.0; 
for 7 := 1 step 1 until m do 
begin 
ifr <00then g:=q+r 
end; 
if mac S q then 
begin max := q; d:=j end; 
D3: end; 
e:=e+1; sfle]:=d; vfd]:=3; wa :=1; 
comment Augments by assigning 1 to z[d]; 
RESET: forj := 1 step 1 until n do 
if v[j] = 1 then o[7] := 0; 
comment clear 7; 
for 7 := 1 step 1 until m do 
Aft, 0] := Af[z, 0] + 2a & Alfz,.d]; 
z:=z+%aX A[0, dj; 
comment Recalculate slacks and objective function; 
go to START; 
INCUMBENT: nosoln := false; 
if z 2 A[0, 0] then go to NEWS; 
A[0, 0] := 2; 
if apt then begin api := false; go to L4 end; 
for j := 1 step 1 until n do 
xz[j] := if v[7] = 3 then 1 else 0; 
NEWS: if e = 0 then go to RESULT; 
[4: d:= sf[e]; 
ifd > 0 then go to UNDERLINE; 
v[—d] := 0; e:=e¢e—1; comment backtrack; 
go to NEWS; 
UNDERLINE: s{e] := —d; 
comment Assign 0 to z[d]; 
go to RESET; 
RESULT: 


end 


vid} := 2; ia:= —1; 
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REMARIX ON ALGORITHM 341 (H] 

SOLUTION OF LINEAR PROGRAMS IN 0-1 

VARIABLES BY IMPLICIT ENUMERATION 
[J. L. Byrne and L. G. Proll, Comm. ACA 11 (Nov. 
1968), 782] 

L. G. Proun (Reed. 5 Dec. 1968 and 18 Aug. 1969) 

University of Southampton, Department of Mathematics, 
Hampshire, England 

KEY WORDS AND PHRASES: 


variables, partial enumeration 
CR CATEGORIES: 5.41 


linear programming, zcro-one 


The published algorithm contains an error in the assembly of 
the initial partial solution, s, if a priori information is given. In 
certain cases this can cause premature termination of the algo- 
rithm. The error may be corrected by replacing the following lines 
of the procedure body, from 

begin 

for j := 1 step 1 until n do 


to 
e:=n; 2:= Al0,0]; goto JA; 
by 
begin 
e:= 0; 
for j7 := 1 step 1 until n do 
if r[j] = O then v{j] := 0 
else 
begin 
e:= e+]; sfe] := 9; vf7| := 3; 
for 7 := lL step 1] until m do 
Afi, Ol = Ali, O] + Ali, A; 
end; 
z:= A[0, 0]; go to 10; 
and by deleting the line 
if api then begin api := false; go to L4 end; 
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SOLUTION OF LINEAR PROGRAMS IN 0-1 

VARIABLES BY IMPLICIT ENUMERATION 
[J. L. Bryne and L. G. Proll Comm. ACM 11 (Nov. 
1968), 782] 

M. M. Guienarp (Reed. 21 Mar. 1969 and 17 Nov. 1969) 

Laboratoire de Calcul, 13 Place Philippe Lebon, Lille, 
France 

KEY WORDS AND PHRASES: 


variables, partial enumeration 
CR CATEGORIES: 5.41 


linear programming, zero-one 


There is an error in the procedure; the slack variables are de- 
stroyed during computation. It is necessary then to declare an ar- 


ray slacks local to the procedure, and to return the final slacks in 
Aft, 0], @=1,2,+++,m, 


One could correct the program as follows. Add before second 
comment: 


real array slacks [1:m]; 


add before NEWS: 
for 7 := 1 step 1 until m do 
slacks [i] := Ald, 0]; 
add after RESULT: 
if — nosoln then 
for 7 := 1 step 1 until m do 


Alt, 0] := slacks |]; 
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ALGORITHM 342 
GENERATOR OF RANDOM NUM BERS SATIS- 
FYING THE POISSON DISTRIBUTION [G5] 
Ricuarp H. Snow (Recd. 20 Dec. 1966, 24 Aug. 1967, 

5 Feb. 1968, 26 Mar. 1968, 5 June 1968 and 9 Sept. 1968) 
I[T Research Institute, Chicago, Il. 60616. 


KEY WORDS AND PHRASES: Poisson distr ibution, random 
number generator, Monte Carlo 
CR CATEGORIES: 5.12, 5.5 


integer procedure poisson carlo (npx, npx1, random); value npz, 

random, real npx, npx1, random; 

comment The Poisson distribution gives the probability that 

px events will occur in a certain interval or volume, where the 
expected or mean value of events is npx. Applications are de~ 
scribed by B. W. Lindgren and G. W. McElrath [1]. For a Monte 
Carlo calculation we wish to generate numbers px that satisfy 
the Poisson distribution, that is to find the inverse of the Poisson 
function. To do this we generate a pseudo-random number in 
the interval 0, 1 and find the number pz such that random < 
(probability that the number is pz or less) and random > (the 
probability that the number is px — 1 or less). 

poisson carlo returns the value —1 to signal that the pro- 
cedure was called with a value of npx < 0 or too large for the 
precision of the computer. It is the responsibility of the user to 
test the calculated value if there is any possibility of the occur- 
rence of the error condition. 

In order to save computing time, values of the Poisson dis- 
tribution computed at a previous entry for the same value, of 
npx are stored in the own array pson. The prévious value of 
npx is nprl. The actual parameter corresponding to npx1 must 
be a real identifier, not a constant or an expression. Before the 
first call of poisson carlo the calling program must set npzl to a 
value ~ npx. The number of pson elements that were previously 
computed and stored is computed. If it is desired ‘to save storage 
space at the expense of computing time, the upper bound 84 of 
pson may be reduced, but then the limit of computed near the 
end of the procedure must also be decreased acc ordingly. 

The procedure which generates random is preferably algorithm 
266 [3] or 294 [2]. It can be called as the actual ac in the 
procedure call of pozsson carlo. 

The author thanks Mr. I. D. Hill for numerous suggestions 
and corrections which greatly improved the algorithm. 
REFERENCES: 

1. LinpGren, B. W., anp McEvratu, G. W. introduction to Prob- 
ability and Statistics; 2 ed. Macmillan, New York, 1966, pp. 
64-68. 

2. Pixe, M. C., ano Hint, I. D. Algorithm 266, pseudo-random 
numbers. Comm: ACM 8 (Oct. 1965), 605. | 

3. Srrome, W. M. Algorithm 294, uniform random. Comm. ACM 
10 (Jan. 1967), 40, 

begin 

own integer computed; own real pnc; 
own real array pson [0:84];. 

integer n; real ps; 

if npr < 0 then go to error; 

if npz # npxl then 

begin 
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computed := 0; 
pnce := pson [0] := exp (—npx); 
if pnc = 0 then go to error; 
comment pson [0] is the probability that poisson carlo = 0. 
It cannot be zero unless —npzx underflows the argument 
range of procedure exp. For most computers this sets an 
upper limit of 85 for npx; 
npxl := npr 
end new npz; 
ps := pson [computed]; 
if random S ps then 
begin 
integer nmin, nmax; 
comment The probability term can be found by searching 
the stored values; 


nmin := 0; nmaz := computed + 1; 
for n := (nmaztnmin—1) + 2 while nmac — nmin > 1 do 
if random > pson[n] then nmin := n+ lelsenmaz :=n-+1; 
poisson carlo := nmin 
end search 
else 
begin 
real psc, pn; pn := pnc; 


comment Additional probability terms must be computed; 
for n := computed + 1,n + 1 while random > ps do 
begin 

pn := pn X npx/n; 

psc := pS; ps := ps + pn; 

comment ps = cumulative probability of terms up to n, 

and pn = probability of nth term; 
if ps = psc then go to error; 


a n < 84 then begin pson[n] := ps; 
pne := pn; computed := n end; 
sect carlo :=n 
end 
end more; 
go to fin; 
error: poisson carlo := —1; 
jin: 


end poisson carlo; 

comment The following is an example of a calling program for 
the case where poisson carlo is compiled within the calling 
program rather than separately. Instead of own variables, 
non-local variables may then be used. The program is within 
the IFIP subset if this change is made, and if the expression 
(nmaz-+nmin—1) + 2 is replaced by the less efficient expression 
.501X (nmax+nmin—2); 

begin 
integer x, computed; 
real pnec, npx, npxl; 
real procedure random (x); 
comment Procedure body random is inserted here; 
integer procedure poisson carlo (npz, npx1, random); 
comment Procedure body of poisson carlo is inserted here 

after deleting declarations of own variables; 

ininteger (2,2); npxl := —1; 

inl: inreal (2, npx); 
outinteger (1, poisson carlo (npx, npzl, random (x))); 
go to inl 

end 


real array pson [0:84]; 
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EIGENVALUES AND EIGENVECTORS OF A 

REAL GENERAL MATRIX [F2] 

J. GRAD AND M. A. BREBNER 

(Recd. 12 Oct. 1967, 1 July 1968 and 8 July 1968) 

Computer Services, University of Birmingham, Birming- 
ham 15, England 

KEY WORDS AND PHRASES: eigenvalues, 
latent roots, latent vectors, Householder’s 


algorithm, inverse iteration 
CR CATEGORIES: 5.14 


elgenvectors, 
method, QR 


ABSTRACT: 

Purpose. This subroutine finds all the eigenvalues and eigen- 
vectors of a real general matrix. The eigenvalues are computed 
by the QR double-step method and the eigenvectors by inverse 
iteration. 

Method. Firstly the following preliminary modifications are 
carried out to improve the accuracy of the computed results. (i) 
The matrix is scaled by a sequence of similarity transformations 
so that the absolute sums of corresponding rows and columns are 
roughly equal. (ii) The scaled matrix is normalized so that the 
value of the Euclidean norm is equal to one. 

The main part of the process commences with the reduction of 
the matrix to an upper-Hessenberg form by means of similarity 
transformations (Householder’s method). Then the QR double- 
step iterative process is performed on the Hessenberg matrix until 
all elements of the subdiagonal that converge to zero are in modu- 
lus less than 2~ || H||z , where t is the number of significant digits 
in the mantissa of a binary floating-point number. The eigenvalues 
are then extracted from this reduced form. 

Inverse iteration is performed on the upper-Hessenberg matrix 
until the absolute value of the largest cornponent of the right- 
hand side vector is greater than the bound 2'/(100 N), where N 
is the order of the matrix. Normally after this bound is achieved, 
one step more is performed to obtain the computed eigenvector, 
but at each step the residuals are computed, and if the residuals 
of one particular step are greater in absolute value than the 
residuals of the previous step, then the vector of the previous 
step is accepted as the computed eigenvector. 

Program. The subroutine EIGENP is completely self-con- 
tained (composed of five subroutines 

EIGENP, SCALE, HESQR, REALVE, and COMPVE) 
and communication to it is solely through the argument list. The 
entrance to the subroutine is achieved by 
CALL EIGENP (N, NM, A, T, EVR, EVI, VECR, VECI, INDIC) 
The meaning of the parameters is described in the comments at 
the beginning of the subroutine EIGENP. 

REFERENCES: 

1. Witxinson, J. H. The Algebraic Eigenvalue Problem. Clarendon 
Press, Oxford, 1965, pp. 347-353, 485-867, 619-633. 

Test results. All tests have been performed on a KDF9 computer 
(t = 39). No breakdown of the method has cecurred and in general 
very accurate computed eigenvalues and eigenvectors have been 
obtained.. 
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Some examples: 
(i) The matrix 


-5 ~1 -—1 —5 —-1 
1 0 0 0 0 
0 1 0 0 0 
0 0 1 0 0 
0 0 0 1 0 


has all eigenvalues with modulus equal to one. The computed 
eigenvalues are 
— 1.00000 0000, —.25000 00000 + 7.96824 58366, .59000 00000 + 
1.86602 54038. 

The computed eigenvectors are 


T1 X2, £3 %4, 45 
- 447213595 1. 000000000 — .500000000 =F 7. 866025404 
— .447213595 — . 250000000 =F 7.968245837 —1. 000000000 = 7. 16-10 
447213595 — .875000000 + 7. 484122918 — .500000000 +: 7. 866025404 
— .447213595 .687500000 + ¢.726184377 . 500000000 + 7. 866025404 
«447213595 . 531250000 =F 7. 847215107 1..000000000 
and the computed residuals are in modulus less than .3E — 10. 


(ii) The matrix 


| 0 -1 -3 -2 

—1 0-1 0 

has the eigenvalues 

—4 + 12 and —-1 + v/2. 
The computed eigenvalues are 

—4.000000000 + 72.000000000, —2.414213562, 4142135624. 
The computed eigenvectors are 


Li, 22 x3 Ly 
— .2000000000 =F z.4000000000 .60F-12 — .12F-11 
1 .000000000 — .7941044878 .4759631495 
. 2000000000 + 72.4000000000 .§615166683 . 3365567706 
.14E-10 + 1.63E-11 .2325878195 — .8125199201 


and the computed residuals are in modulus less than .7F -- 10. 
(iii) The matrix A 


1 O 0.01 
A=|01 1 O 
0 1 1 


is transformed by the process of scaling into the form B 


574423 0 066333 
B =| .053454 .574423 0 
0 053454 .574423 


with the elements given to six decimal places. The obtained 
matrix B is essentially invariant under the QR double-step proc- 
ess. This kind of trouble was overcome by introducing the state- 
ments 


R = DABS(X) + DABS(Y) 
IF(R.EQ.0.0)SHIFT = A(M,M—1) 
IF(R.EQ.0.0)GO TO 21 


in the subroutine HESQR. 
The exact eigenvalues of A are 
1.1, 0.95 + 70.5/0.03. 
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The computed eigenvalues are 


1.100000000, 0.9500000000 + 70.0866025404. 


Acknowledgments. The authors wish to thank Dr. K. A. Redish, 


the former director of Computer Services at the University of 
Birmingham, and Dr. 8. H. Hollingdale, the present director of 
Computer Services, for their encouragement. Finally, the authors 
are indebted to Dr. J. H. Wilkinson, National Physical Labora- 
tory, Teddington, for useful consultations and suggestions. 
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SUBROUTINE EIGENP (No NMsAoToEVReEVI sVECReVECI pINDIC) 
DOUBLE PRECISION DlsD2sD3sPRFACT : 

INTEGER ToT VECsJoK 9K1 s9KONoL hl oMoNoyNM : 

REAL ENORMsEPSsEnoRoRi oT ; 

DIMENSION A(NMo1) sVECR(NM91) sVECI(NMs]i) 9 
1LEVRO(NM) sEVI (NM) » INDIC(NM) i 

DIMENSION IWORK(100) sLOCAL{ 100) sPRFACTI(100) 
1»sSUBDIA(100) sWORK1 (190) sWORK2(100) sWORK(100) 


THIS SUBROUTINE FINDS ALL THE EIGENVALUES ‘AND THE 
EIGENVECTORS OF A REAL GENERAL MATRIX OF ORDER Ne 


FIRST IN THE SUBROUTINE SCALE THE MATRIX IS SCALED SO THAT 
THE CORRESPONDING ROWS AND COLUMNS ARE APPROXIMATELY 
BALANCED AND THEN THE MATRIX IS NORMALISED! SO THAT THE 
VALUE OF THE EUCLIDIAN NORM OF THE MATRIX IS EQUAL TO ONEe 


THE EIGENVALUES ARE COMPUTED BY THE QR DOUBLE=STEP METHOD 
IN THE SUBROUTINE HESQRe 

THE EIGENVECTORS ARE COMPUTED BY INVERSE ITERATION IN 

THE SUBROUTINE REALVEsFOR THE REAL EIGENVALUESsOR IN THE 
SUBROUTINE COMPVE»FOR THE COMPLEX EIGENVALUES. 


THE ELEMENTS OF THE MATRIX ARE TO BE STORED IN THE FIRST N 
ROWS AND COLUMNS OF THE TWO DIMENSIONAL ARRAY As THE 
ORIGINAL MATRIX IS DESTROYED BY THE SUBROUTINE. 
N IS THE ORDER OF THE MATRIXe 
NM DEFINES THE FIRST DIMENSION OF THE TWO DIMENSIONAL 
ARRAYS AsVECR»VECI AND THE DIMENSION OF THE ONE 
DIMENSIONAL ARRAYS EVR»sEVI AND INDICe THEREFORE THE 
CALLING PROGRAM SHOULD CONTAIN THE FOLLOWING DECLARATION 

DIMENSION A(NM»NN) »VECR(NMSNN) sVECE(NMINN) 6 

LEVRONM) sEVI(NM) » INDIC(NM) 
WHERE NM AND NN ARE ANY NUMBERS EQUAL TO OR GREATER THAN N 
THE UPPER LIMIT FOR NM IS EQUAL TO 100 BUT MAY BE 
INCREASED TO THE VALUE MAX BY REPLACING THE DIMENSION 
STATEMENT 

DIMENSION IWORK(100) »LOCAL(100)» sos »sWORK(100) 
IN THE SUBROUTINE EIGENP WITH 

DIMENSION IWORK (MAX) sLOCAL(MAX)s eee »WORK (MAX) 
NM AND NN ARE OF COURSE BOUNDED BY THE SIZE OF THE STORE. 


THE REAL PARAMETER T MUST BE SET EQUAL TO THE NUMBER OF 
BINARY DIGITS IN THE MANTISSA OF A SINGLE PRECISION 
FLOATING=POINT NUMBERe 


THE REAL PARTS OF THE N COMPUTED EIGENVALUES WILL BE FOUND 
IN THE FIRST N PLACES OF THE ARRAY EVR AND THE IMAGINARY 
PARTS IN THE FIRST N PLACES OF THE ARRAY EVI. 

THE REAL. COMPONENTS OF THE NORMALISED EIGENVECTOR I 
(T2le2%eeexN) CORRESPONDING TO THE EIGENVALUG STORED IN 
EVR(IT) AND EVICT) WILL BE FOUND IN THE FIRST N PLACES OF 
THE COLUMN I OF THE TWO DIMENSIONAL ARRAY VECR AND THE 
IMAGINARY COMPONENTS IN THE FIRST N PLACES OF THE COLUMN I 
OF THE TWO DIMENSIONAL ARRAY VECI.« 


THE REAL EIGENVECTOR IS NORMALISED SO THAT JHE SUM OF THE 
SQUARES OF THE COMPONENTS IS EQUAL TO ONE. | 

THE COMPLEX EIGENVECTOR IS NORMALISED SO THAT THE 
COMPONENT WITH THE LARGEST VALUE IN MODULUS !HAS ITS REAL 
PART EQUAL TO ONE AND THE IMAGINARY PART EQUAL TO ZEROs> 


THE ARRAY INDIC INDICATES THE SUCCESS OF THE SUBROUTINE 
EIGENP AS FOLLOWS ‘ 
VALUE OF INDIC(T) 


EIGENVALUE I EIGENVECTOR I 


0 NOT FOUND NOT FOUND 
1 FOUND NOT FOUND 
2 FOUND 


FOUND 


IF(NeNE*1)GO TO 1 
EVR(1) = All) 
EVI(1) = 0«0 


VECR( 191) = 160 
VECI¢(1l91) = 000 
INDIC(1) = 2 

GO TO 25 


1 CALL SCALE (N»®NMeAsVECI sPRFACT sENORM) 
THE COMPUTATION OF THE EIGENVALUES OF THE NORMALISED 
MATRIX. 
EX = EXP(=T¥ALOG(200)) i 
CALL HESQRIN»sNMsAsVECI sEVReEVI sSUBDIAs INDICsEPS EX) 


ANANANAN 


NnAN aaAN 


NANAANNA 


nan 


Cc 
Cc 
Cc 
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POSSIBLE DECOMPOSITION OF THE UPPER-HESSENBERG MATRIX 
INTO THE SUBMATRICES OF LOWER ORDER IS INDICATED IN THE 
ARRAY LOCALs THE DECOMPOSITION OCCURS WHEN SOME 
SUBDIAGONAL ELEMENTS ARE IN MODULUS LESS THAN A SMALL 
POSITIVE NUMBER EPS DEFINED IN THE SUBROUTINE HESQR e THE 
AMOUNT OF WORK IN THE EIGENVECTOR PROBLEM MAY BE 
DIMINISHED IN THIS WAYe 

Jun 

Iz ]1 

LOCAL(1) = 1 

IF (JeEQel)GO TO 4 

2 IF (ABS(SUBDIA(J-1)) eGTeEPS)IGO TO 3 

I = I+l 

LOCAL (1)=0 

3 9 = Jol 
LOCAL (1)=LOCAL(1)+1 
IF (JeNE21)GO TO 2 


EIGENVECTOR PROBLEMe 
K =] 
KON = 0 
L = LOCAL(1) 
MaN 
DO 10 ImlsN 
IVEC = Ne-I+1 
IF(IeLEeL)GO TO 5 
K = K+l 
Me NeL 
L ® L+LOCAL(K) 
5 IF(INDICCIVEC) eEQe0)GO TO 10 
IF(EVICIVEC) eNEe000)GO TO 8 


TRANSFER OF AN UPPER-HESSENBERG MATRIX OF THE ORDER M FROM 
THE ARRAYS VECI AND SUBDIA INTO THE ARRAY Ac 
DO 7 K1l=l5M 
DO 6 Li=Kl»M 
6 A(Klst1) = VECI(KisL1) 
IF(K1eEQe1)GO TO 7 
A(K1sKl-1) = SUBDIA(K1=-1) 
7 CONTINUE 


THE COMPUTATION OF THE REAL EIGENVECTOR IVEC OF THE UPPER- 
HESSENBERG MATRIX CORRESPONDING TO THE REAL EIGENVALUE 
EVR(IVEC)« 
CALL REALVE(N»NMsMeIVEC As VECR » EVR 2 EVI s IWORK 9 
1 WORK» INDICsEPS»EX) 
GO TO 10 


THE COMPUTATION OF THE COMPLEX EIGENVECTOR IVEC OF THE 
UPPER~HESSENBERG MATRIX CORRESPONDING TO THE COMPLEX 
EIGENVALUE EVR(IVEC) + I*EVICIVEC)s IF THE VALUE OF KON IS 
NOT EQUAL TO ZERO THEN THIS COMPLEX EIGENVECTOR HAS 
ALREADY BEEN FOUND FROM ITS CONJUGATE. 
8 IF(KONeNE20)GO TO 9 
KON = 1 
CALL COMPVE(NeNMoMeIVEC sAs VECRoVECI sEVReEVISINDICs 
1 IWORKsSUBDIA»WORK1 »WORK2»WORK sEPS EX) 


GO TO 10 
9 KON = 0 
10 CONTINUE 


THE RECONSTRUCTION OF THE MATRIX USED IN THE REDUCTION OF 
MATRIX A TO AN UPPER=HESSENBERG FORM BY HOUSEHOLDER METHOD 
DO 12 I19N 
DO 11 JelIsN 
A(ToJ) # 060 
A(Jol) 2 000 
A(IsI) # 120 
IF(NeLE22)GO TO 15 
M = N-2 
DO 14 K2loM 
L = K+l1 
DO 14 Je2eN 
D1 = 000 
DO 13 IalsN 
D2 = VECI (19K) 
Ol = D1lt D2*A(JsI) 
DO 14 IzboN 
A(Jol) # AlJsTI-VECI(I 9K) #D1 


11 
12 


13 
14 


THE COMPUTATION OF THE EIGENVECTORS OF THE ORIGINAL NON~ 
SCALED MATRIX. 
15 KON = 1 
DO 24 I#leN 
L 2 O 
IFCEVI(1)eEQ2020)GO TO 16 
Li 
IF(KONeEQ20)GO TO 16 
KON = 0 
GO TO 24 
DO 18 J#l»N 
Dl = 0640 
D2 = 060 
DO 17 KmloN 
03 = AlJsK) 
D1 = D1+D3*VECRIKsI1) 
IF(LeEQe0}GO TO 17 


16 


COLLECTED ALGORITHMS (cont.) 


D2 = D2+D3*VECR(KsI-1) 
17 CONTINUE 
WORK(J) = D1/PRFACT (J) 
IF(LeEQe0IGO TO 18 
SUBDIA(J)=D2/PRFACT (J) 
18 CONTINUE 


: THE NORMALISATION OF THE EIGENVECTORS AND THE COMPUTATION 
C OF THE EIGENVALUES OF THE ORIGINAL NON=NORMALISED MATRIXe 
IF(LeEQol)GO TO 21 
Ol = 020 
DO 19 M@loeN 
19 - O01 # D1+WORK (M) *¥#2 
O1 = OSQRT(D1) 
DO 20 MzloN 
VECI(MeT) = 000 
20 VECR(MsI) = WORK(M)/D1 
EVR(1) = EVR(1)*#ENORM 
GO TO 24 


21 KON = 1 
EVR(1) = EVR(1)*#ENORM 
EVR(I-1) = EVR(T) 
EVI(1) = EVICI)*#ENORM 
EVI(I-1) =s-EVI(1) 
R = 0.0 
DO 22 J=#loN 
R1 = WORK(J)##2 + SUBDIA( J) ##2 
IF(ReGEeR1)GO TO 22 
R=# Rl 
Lowe J 
22 CONTINUE 
D3 = WORK(L) 
R1l = SUBDIA(L) 
DO 23 JeleN 
D1 = WORK(J) 
D2 = SUBDIA(J) 
VECR(Je9T) # (014#03402*R1)/R 
VECH(JsT) = (02*D3-D1*R1)/R 
VECR(JsI-1) © VECR(JeI) 
23 VECI(JeI-1) e-VECI(JsI1) 
24 CONTINUE 


25 RETURN 
END 


SUBROUTINE SCALE (NoNMeA cH» PRFACT sENORM) 

DOUBLE PRECISION COLUMN s FACTOR sFNORM sPRFACT 9Q9ROW 
INTEGER IsJe ITER sNsNCOUNT oNM 

REAL BOUND1 »8OUND2 »ENORM 

DIMENSION A(NMo1) sH(NMol ) sPRFACT(NM) 


THIS SUBROUTINE STORES THE MATRIX OF THE ORDER N FROM THE 
ARRAY A INTO THE ARRAY He AFTERWARD THE MATRIX IN THE 
ARRAY A IS SCALEO SO THAT THE QUOTIENT OF THE ABSOLUTE SUM 
OF THE OFF-DIAGONAL ELEMENTS OF COLUMN I AND THE ABSOLUTE 
SUM OF THE OFF~OIAGONAL ELEMENTS OF ROW I LIES WITHIN THE 
VALUES OF 30UND1 AND BOUND2. 

THE COMPONENT I OF THE EIGENVECTOR OBTAINED BY USING THE 
SCALED MATRIX MUST BE DIVIDED 8Y THE VALUE FOUND IN THE 


PRFACT(I) OF THE ARRAY PRFACT. IN THIS WAY THE EIGENVECTOR 
OF THE NON-SCALED MATRIX IS OBTAINED. 


AFTER THE MATRIX IS SCALED IT IS NORMALISED SO THAT THE 
VALUE OF THE EUCLIDIAN NORM IS EQUAL TO ONEes 

1F THE PROCESS OF SCALING WAS NOT SUCCESSFUL THE ORIGINAL 
MATRIX FROM THE ARRAY H WOULD BE STORED BACK INTO A AND 
THE EIGENPROBLEM WOULD BE SOLVED BY USING THIS MATRIXe 

NM DEFINES THE FIRST DIMENSION OF THE ARRAYS A AND He NM 
MUST BE GREATER OR EQUAL TO No 

THE EIGENVALUES OF THE NORMALISED MATRIX MUST BE 
MULTIPLIED BY THE SCALAR ENORM IN ORDER THAT THEY BECOME 
THE EIGENVALUES OF THE NON-NORMALISED MATRIXe 


NNNANNAADANANANANN ANNADADAAAN 


OO 2 I=loN 
DO 1 J#loN 
1 H(ITsJ) = AlTosJ) 
2 PRFACT(I1)= 140 
BOUND] = 0.75 
BOUND2 = 1.33 
ITER = 0 
3 NCOUNT = 0 
DO 8 I=l»N 
COLUMN ® 0-20 
ROW = 0.0 
DO 4 J=loN 
IFCTeEQeJ)IGO TO 4 
COLUMN = COLUMN+ ABSCA(JsI)) 
ROW = ROW + ABS(ACIsJ)) 
4 CONTINUE 
TF (COLUMNCEQ+¢020)1GO TO 5 
TF (CROWeEQe000)GO TO 5 
Q = COLUMN/ROW 
IF (QeLTeBOUNDI )GO TO 6 
IF (QeGTeBOUND2)GO TO 6 


DANA DANADNDNNDDNDADNNANNDDANNANAANAAN 
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5 NCOUNT = NCOUNT + 1 
GO TO 8 
6 FACTOR = DSQRT(Q) 
DO 7 J#loN 
IF(TeEQeJ)GO TO 7 
A(IoJ) = AlIeJ)*FACTOR 
A(Jol) = AlJsI)/FACTOR 
7 CONTINUE 
PRFACT(I) = PRFACT(I)#FACTOR 
8 CONTINUE 
ITER = ITER+1 
IFC ITER-GT230)GO TO 11 
IF (NCOUNTeLTeNIGO TO 3 


FNORM = 020 
DO 9 I#1leN 
DO 9 J#leN 
Q = A(IeJ) 

9 FNORM #* FNORM+Q#Q 
FNORM = DSQRT(FNORM) 
DO 10 I=#leN 

DO 10 J=leN 
10 AlCl oJ) ACT 9J) /FNORM 
ENORM = FNORM 
GO TO 13 


11 DO 12 I#19N 
DO 12 JelsN 
12 A(IeJ) = H(I9J) 
ENORM = 120 


13 RETURN 
END 


SUBROUTINE HESQR(NsNMeAsHsEVReEVI sSUBDIAsINDIC sEPS EX) 
DOUBLE PRECISION S#SR»oSR29XvVoZ 

INTEGER IsJsKoL sMoMAXST oM1 sNeNMoNS 

REAL EPSsEXsRoSHIFTsT 

DIMENSION A(NMo1) sH(NMol ) sEVRO(NM) 9 EVI (NM) »SUBDIA( NM) 
DIMENSION INDIC(NM) 


THIS SUBROUTINE FINDS ALL THE EIGENVALUES OF A REAL 
GENERAL MATRIXe THE ORIGINAL MATRIX A OF ORDER N IS 
REDUCED TO THE UPPER-HESSENBERG FORM H BY MEANS OF 
SIMILARITY TRANSFORMATIONS (HOUSEHOLDER METHOD)« THE MATRIX 
H 1S PRESERVED IN THE UPPER HALF OF THE ARRAY H AND IN THE 
ARRAY SUBDIAs THE SPECIAL VECTORS USED IN THE DEFINITION 
OF THE HOUSEHOLDER TRANSFORMATION MATRICES ARE STORED IN 
THE LOWER PART OF THE ARRAY He 

NM IS THE FIRST DIMENSION OF THE ARRAYS A AND He NM MUST 
BE EQUAL .TO OR GREATER THAN Ne 

THE REAL PARTS OF THE N EIGENVALUES WILL BE FOUND IN THE 
FIRST N PLACES OF THE ARRAY EVRsAND 

THE IMAGINARY PARTS IN THE FIRST N PLACES OF THE ARRAY EVI 
THE ARRAY INDIC INDICATES THE SUCCESS OF THE ROUTINE AS 
FOLLOWS 


VALUE OF INDIC(I) EIGENVALUE 1] 
0 NOT FOUND 
i FOUND 


EPS IS A SMALL POSITIVE NUMBER THAT NUMERICALLY REPRESENTS 
ZERO IN THE PROGRAM. EPS ® (EUCLIDIAN NORM OF H)*EX »sWHERE 
EX = 2**(-T)e T IS THE NUMBER OF BINARY DIGITS IN THE 
MANTISSA OF A FLOATING POINT NUMBER» 


REDUCTION OF THE MATRIX A TO AN UPPER-HESSENBERG FORM He 
THERE ARE N-2 STEPS. 
IF (N-2) 149192 
1 SUBDIA(1) = A291) 
GO TO 14 
2M = N-2 
DO 12 K2leM 
L = K+l 
S = 0.0 
DO 3 I=LeN 
H(I9K) # ALT9K) 
3 S = S+ABS(A(19K)) 
IF (SeNEsABS(A(K+19K)))GO TO 4 
SUBDIA(K) = A(K+l9K) 
H(K+19K) = 000 
GO TO 12 
4 $§R2 = 020 
DO 5 I=LeN 
SR = ACIsK) 
SR = SR/S 
A(IoK) = SR 
5 SR2 = SR2+SR#*SR 
SR = DSQRT(SR2) 
IF(ACL 9K) el T00001GO TO 6 
SR = -SR 
6 SR2 = SR2-SR#¥A(L9K) 
A(L>K) = A(LsK)-SR 
H(L»K) = H(LeK)-SR#S 
SUBDIA(K) = SR#S 
X = S*DSQRTISR2) 


COLLECTED ALGORITHMS (cont.) 


DO 7 IetoNn 
H(IoK) = H(IsK)/X 
7 SUBDIA(I) = AlIoK)/SR2 
C PREMULTIPLICATION-BY THE MATRIX PRe 
DO 9 J=zLoN 
SR = 0-0 
00 8 Ie#tsN 
8 SR = SRtA( IT sK)#AC I 9J) 
DO 9 ImLoN 
9 Al(IsJ) = A(IsJ)—-SUBDIA(1)#SR 
C POSTMULTIPLICATION BY THE MATRIX PRe 
00 11 JelsN 
SR#020 
DO 10 I=#leN 
10 SR ® SRA IoT) ACI OK) 
DO 11 I2leNn 
11 A(Jol) «& AlJoT)-SUBDIA(T)*SR 


12 CONTINUE 
OO 13 KeloM 
13 A(K+1eK) = SUBDIA(K) , 
TRANSFER OF THE UPPER HALF OF THE MATRIX A INTO THE 
ARRAY H AND THE CALCULATION OF THE SMALL POSITIVE NUMBER 
EPS. 
SUBDIA(N-1)} = A(NsN=1) 
14 EPS = 0.0 
DO 15 KeloN 
INDIC(K) = 0 
IF(KeNESN)EPS © EPS+SUBDIA(K) ¥*2 
DO 15 I=K»N 
H(Kel) # A(Koel) 
15 EPS = EPS + A(Kol) ##*2 
EPS = EX#SQRT(EPS) 


NAN 


THE QR ITERATIVE PROCESS»s THE UPPER- -HESSENBERG MATRIX H IS 
REDUCED TO THE UPPER=MODIFIED TRIANGULAR FORMe 


DETERMINATION OF THE SHIFT OF ORIGIN FOR THE. FIRST STEP OF 
THE QR ITERATIVE PROCESS. 

SHIFT @ A(N»sN~1) 

IF(NeLEo2)SHIFT = 020 

TFCACNSN) oNESOCO)SHIFT = 020 

TFCA(N*19N) eNECOCO)SHIFT * 040 

IF CACN@19N~1) eNE®OeO) SHIFT = 060 

M#2N 

NS 0 ; 

MAXST = N#10 : 


NNAADAN 


TESTING IF THE UPPER HALF OF THE MATRIX IS EQUAL TO ZEkKOe 
IF IT IS EQUAL TO ZERO THE QR PROCESS IS NOT: ‘NECESSARY « 
DO 16 I#2e9N 
DO 16 KaIeN 
IF (ACT -19K) eNE0060)GO TO 18 
16 CONTINUE 
DO 17 Iml»N 
INOIC(1) #1 
EVR(I) = A(IsTI) 
17 EVI¢(1) # 020 
GO TO 37 


AND 


an 


START THE MAIN LOOP OF THE QR PROCESS. 
18 K=M~1 
M1lek 
a . 
FIND ANY DECOMPOSITIONS OF THE MATRIX 
JUMP TO 34 IF THE LAST SUBMATRIX OF THE DECOMPOSITION 1S 
OF THE ORDER ONE. 
JUMP TO 35 IF THE LAST SUBMATRIX OF THE DECOMPOSITION IS 
OF THE ORDER TWO. 
1F(K) 37934919 
19 IF(ABS(A(M9K) )oLEeEPS)GO TO 34 
IF (M-2eEFQ.290)GO TO 35 
20 1 = I-1 
IF(ABS(A(K91)) eLEsEPS)GO TO 21 
Ks I 
IF(KeGT.1)GO TO 20 
21 IF (KeEQsM1)GO TO 35 
C TRANSFORMATION OF THE MATRIX OF THE ORDER GREATER THAN TWO 
S = A(MoM)+A(M] ML) +SHIFT 
SR= A(MoM) #A(M1] oM1)— A(MsM1)¥A(M1 9M) +0025 *SHIFT#X2 
A(K+#29K) = 000 
C CALCULATE X1sY1sZ19FOR THE SUBMATRIX OBTAINED BY THE 
C DECOMPOSITION. 
X = ACK eK FCA(K eo KIWS)FA(K sK4+1) HA(CKH1 9K 45 SR 
Y = A(K+1 KJ #(ACK OKI FA(K#1 9KO1)=S) 
R = DABS(X)+DABS(Y) 
IF(ReEQeOeOISHIFT = A(MsM~1) 
IF (RsEQe020)GO TO 21 
Z = A(K4+Z 9K41IHA(KF1 9K) 
SHIFT = 0.0 
NS = NS#) 


NANDAN 


C THE LOOP FOR ONE STEP OF THE QR PROCESS« 
DO 33 I=K»oM1 
IF(TeEQeK)GO TO 22 
€ CALCULATE XRoYRoZRe 
X = AlIeI-1) 
Y = A(I4loI-l) 
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Z & 020 
IFC I+2e¢GTeM)GO TO 22 
Z = A(1+2sI~1) 
22 SR2 = DABS(X)+DABS(Y)+DABS(Z) 
IF(SR2eEQe0¢0)GO TO 23 
X = X/SR2 
Y = Y/SR2 
Z ® 2Z/SR2 
23 S = DSQRT(X*X + Y#Y + Z#Z) 
IF (XeLT2020)GO TO 24 
S$ = -S 
24 IF(TeEQeK)GO TO 25 
A(IsI-1) = S*SR2 
25 IF (SR2eNE0020)GO TO 26 
IF(14+3eGTeM)IGO TO 33 
GO TO 32 
26 SR = 160-X/S 
S = x=-S 
xX ®= Y/S 
Y = Z/S 
C PREMULTIPLICATION BY THE MATRIX PRe 
DO 28 J=IeM 
S = A(T sJIFAC1T +195) #X 
IF(1+2eGTeM)GO TO 27 
S @ StA(T+2sJ)#Y 
27 S = S*SR 
AC(I9J) = AlITeJ)=-S 
A(I+ly9J) = Al(I+19J)—-S*X 
IF(1+2eGTeM)GO TO 28 
AlI+29J) = AlI1+29J5)-S#Y 


28 CONTINUE 
C POSTMULTIPLICATION BY THE MATRIX PRe 
L = +2 
IFC IleLTeM1)GO TO 29 
L =M 


29 OO 31 JeKolt 
S = AlJoIT)+A( Jo T +1) #X 
IF(I+2eGTeM)GO TO 30 
S = § + AlJoI+2)#Y 
30 S = S*SR 
A(JeT) = Al(JsT)-S 
A(JsT+1)*A(JsT+1)-S#X 
IF(I+2eGTeM)GO TO 31 
AC JeT4+2)=A(J9T1+2)—-S#Y 
31 CONTINUE 
IF(1+3e¢GTeM)GO TO 33 
S = -ACI+39T+2)*Y*#SR 
32 A(I+3s1) = S$ 
A(I43514+1) = S#X 
A(14+3914+2) = S#Y + A(I+39IT+2) 
33 CONTINUE 


¢ 
IF(NSeGTeMAXST)GO TO 37 
GO TO 18 

c 

C COMPUTE THE LAST EIGENVALUEes 


34 EVR(M) = A(Mo»M) 
EVI(M) = 0-0 
INDIC(M) = 1 
MezkK 
GO TO 18 

c 


0 


C COMPUTE THE EIGENVALUES OF THE LAST 2X2 MATRIX OBTAINED BY 


C THE DECOMPOSITION. 

35 R = 0o5*(A(K9K)+A(M9M) ) 
S = 0e5*(A(MeM)-A(KoK)) 
S = S¥S + A(KeM)#A(M9K) 
INDIC(K) 2 1 
INDIC(M) = 1 
IF (SeLTe000)GO TO 36 
T = OSQRT(S) 
EVR(K) = R-T 
EVR(M) © R+T 
EVI(K) = 060 
EVI(M) = 060 
M = M=2 
GO TO 18 

36 T = DSQRT(-S) 
EVR(K) = R 
EVI(K) = T 
EVR(M) = R 
EVI(M) = -T 
M = M=2 
GO TO 18 


37 RETURN 
: END 


SUBROUTINE REALVE (NsNMoMs I VEC sAnVECR s EVR» EVI » 
1I WORK sWORK s INDIC sEPSsEX) 

DOUBLE PRECISION SoSR 

INTEGER IsIVECsITERsJoK ok sMoNsNMeNS 

REAL BOUND sEPSeEVALUE 9 EX sPREVISoReR1 oT 
OIMENSION A(NMo1) sVECRINMo1) sEVR(NM) 

DIMENSION EVI(NM) » IWORK (NM) »WORK (NM) sINDIC(NM) 


COLLECTED ALGORITHMS (cont.) 


THIS SUBROUTINE FINOS THE REAL EIGENVECTOR OF THE REAL 
UPPER-HESSENBERG MATRIX IN THE ARRAY AsCORRESPONDING TO 
THE REAL EIGENVALUE STORED IN EVR(IVEC)e THE INVERSE 
ITERATION METHOD IS USED. 

NOTE THE MATRIX IN A IS DESTROYED BY THE SUBROUTINE. 

N IS THE ORDER OF THE UPPER=HESSENBERG MATRIX. 

NM DEFINES THE FIRST OIMENSION OF THE TWO DIMENSIONAL 
ARRAYS A AND VECRe NM MUST BE EQUAL TO OR GREATER THAN Ne 
M IS THE ORDER OF THE SUBMATRIX OBTAINED BY A SUITABLE 
DECOMPOSITION OF THE UPPER-HESSENBERG MATRIX IF SOME 
SUBDIAGONAL ELEMENTS ARE EQUAL TO ZERO. THE VALUE OF M IS 
CHOSEN SO THAT THE LAST N-M COMPONENTS OF THE EIGENVECTOR 
ARE ZEROs 

IVEC GIVES THE POSITION OF THE EIGENVALUE IN THE ARRAY EVR 
FOR WHICH THE CORRESPONDING EIGENVECTOR IS COMPUTED. 

THE ARRAY EVI WOULD CONTAIN THE IMAGINARY PARTS OF THE N 
EIGENVALUES IF THEY EXISTED. 


THE M COMPONENTS OF THE COMPUTED REAL EIGENVECTOR WILL BE 
FOUND IN THE FIRST M PLACES OF THE COLUMN IVEC OF THE TwO 
DIMENSTONAL ARRAY VECRe 


IWORK ANDO WORK ARE THE WORKING STORES USED DURING THE 
GAUSSIAN ELIMINATION AND BACKSUBSTITUTION PROCESS» 


THE ARRAY INOIC INDICATES THE SUCCESS OF THE ROUTINE AS 
FOLLOWS 


VALUE OF INDIC(!) EIGENVECTOR I 
1 NOT FOUND 
2 FOUND 


EPS 15 A SMALL POSITIVE NUMBER THAT NUMERICALLY REPRESENTS 
ZERO IN THE PROGRAM. EPS # (EUCLIDIAN NORM OF A) #*EXeWHERE 
EX = 2*#(-T). T IS THE NUMBER OF BINARY DIGITS IN THE 
MANTISSA OF A FLOATING POINT NUMBERe 
VECR(1sIVEC) = 120 
IF (MsEQo1)GO TO 24 
SMALL PERTURBATION OF EQUAL EIGENVALUES TO OBTAIN A FULL 
SET OF EIGENVECTORS. 
EVALUE = EVR(IVEC) 
IF ( EVECeEQeM)GO TO 2 
K = IVEC+1 
R = 0.0 
DO 1 I12KsM 
IFCEVALUESNEeEVR(I))GO TO 1 
IFCEVICI) eNE6060)GO TO 1 
R = R+320 
1 CONTINUE 
EVALUE = EVALUE+R*#EX 
2 DO 3 KzloM 
3 A(KsK) & Al(KsK )-EVALUE 


NANNANBNNABAANNANNHDHNANNANANNNANNANANNAAAN 


aN 


Cc 
C GAUSSIAN ELIMINATION OF THE UPPER-HESSENBERG MATRIX Ae ALL 
C ROW INTERCHANGES ARE INDICATED IN THE ARRAY IWORKeALL THE 
C MULTIPLIERS ARE STORED AS THE SUBDIAGONAL ELEMENTS OF Ac 
K = M=-) 
DO 8 IelsKk 
L = I+] 
IWORK(I) = 0 
IFCACEs 1) eNE*020)GO TO 8 
A(IeI) = EPS 
GO TO 8 
4 IF CABSCACT 91) ) eGEeABS(A(I+1919))GO TO 6 
IWORK(I) = 1 
DO 5 JuloM 
R = A(IoeJ) 
A(IsJ) = A(T+1lsJ) 
5 A(I+lseJ) = R 
6 R = -ACI41lsI)/ACI 91) 
A(I+leoI) = R 
DO 7 Jak»M 
7 A(T+loJ) = ACT+leJItR¥ACT oJ) 
8 CONTINUE 
IF (A(MeM) eNE*020)GO TO 9 
A(MoM) = EPS 


aan 


THE VECTOR (1solseeesl) IS STORED IN THE PLACE OF THE RIGHT 
HAND SIDE COLUMN VECTOR. 
9 DO 11 I#10N 
IF(1-GTeM)GO TO 10 
WORK(T) = lev 
GO TO 11 
10 WORK(I) = 020 
11 CONTINUE 


THE INVERSE ITERATION IS PERFORMED ON THE MATRIX UNTIL THE 
INFINITE NORM OF THE RIGHT-HAND SIDE VECTOR IS GREATER 
THAN THE BOUND DEFINED AS O.O1/(N#EX) 4 

BOUND = O0.O1/(EX * FLOATI(N)) 

NS = O 

ITER = 1 


NAAN 


C THE BACKSUBSTITUTION. 
12 R = 0.0 
DO 15 IsleM 
J = M-I+1 
S = WORK(J) 
IF (JeEQ.M)GO TO 14 


nan 


NANNAAN 


C 
Cc 


AN NABNANANNANARDANDNADANDANAANANAN 
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L = J+] 
DO 13 K=LoM 
SR = WORK(K) 
13 S = § - SR¥A(JU5K) 
14 WORK(J) = S/A( Jed} 


T = ABS(WORK(J)) 
IF(ReGE»T)GO TO 15 
R= T 

15 CONTINUE 


THE COMPUTATION OF THE RIGHT-HAND SIDE VECTOR FOR THE NEW 
ITERATION STEPe 
DO 16 I=19M 
16 WORK( IT) = WORK(I)/R 


THE COMPUTATION OF THE RESIDUALS AND COMPARISON OF THE 
RESIDUALS OF THE TWO SUCCESSIVE STEPS OF THE INVERSE 
ITERATIONS IF THE INFINITE NORM OF THE RESIOUAL VECTOR IS 
GREATER THAN THE INFINITE NORM OF THE PREVIOUS RESIDUAL 
VECTOR THE COMPUTED EIGENVECTOR OF THE PREVIOUS STEP IS 
TAKEN AS THE FINAL EIGENVECTOR. 
R1 = 020 
DO 18 I=19M 
T = 020 
DO 17 JeloM 
17 T = THA(T oJ) *WORK( J) 
T = ABS(T) 
IF(R1e¢GEeTIGO TO 18 
Ris T 
18 CONTINUE 
IFCITER*EQ21)GO TO 19 
IF(PREVISeLEeR1)GO TO 24 
19 00 20 I=1M 
20 VECR(IsIVEC) = WORK(I) 
PREVIS = R1 
IF(NSeEQe1)GO TO 24 
IF(ITER¢GT6)60 TO 25 
ITER = ITER+1 
IF(ReLTeBOUNDIGO TO 21 
NS = 1 


GAUSSIAN ELIMINATION OF THE RIGHT-HAND SIDE VECTOR. 
21K = M1 
DO 23 ImlsK 
R = WORK(I+1) 
IFC IWORK(1)eEQ20}GO TO 22 
WORK (142) WORK (I }4+WORK (141) #ACT +191) 
WORK(I) = R 
GO TO 23 
22 WORK (I+1) #WORK (141) +WORK(I)#A(I4191) 
23 CONTINUE 
GO TO 12 


24 INDICCIVEC) = 2 
25 IF(MeEQeN)GO TO 27 
J = M+) 
DO 26 I=JeN 
26 VECR(IsIVEC} © 020 
27 RETURN 
END 


SUBROUTINE COMPVE (N»NMoMo I VEC sAsVECR oH sEVROEVI sINDICs 
1I WORK »SUBDIA» WORK] »WORK2 »WORK sEPSsEX) 

DOUBLE PRECISION Do01 

INTEGER Isllsi2sITERsIVECsJoK ol MeN sNMoNS 

REAL BsBOUNDsEPS sETAsEX oFKSI sPREVISsReSsUeV 
DIMENSION A(NMo1) sVECR(NMs1) »H(NMs1) sEVRINM) pEVI UNM) » 
LINDIC(NM) » LWORK (NM) »SUBDIA(NM) sWORK1 (NM) »WORK2 (NM) > 
2WORK (NM) 


THIS SUBROUTINE FINDS THE COMPLEX EIGENVECTOR OF THE REAL 
UPPER-HESSENBERG MATRIX OF ORDER N CORRESPONDING TO THE 
COMPLEX EIGENVALUE WITH THE REAL PART IN EVR(IVEC) AND THE 
CORRESPONDING IMAGINARY PART IN EVICIVEC)« THE INVERSE 
ITERATION METHOD IS USED MOOIFIED TO AVOID THE USE OF 
COMPLEX ARITHMETIC. 

THE MATRIX ON WHICH THE INVERSE ITERATION IS PERFORMED IS 
BUILT UP IN THE ARRAY A BY USING THE UPPER-HESSENBERG 
MATRIX PRESERVED IN THE UPPER HALF OF THE ARRAY H AND IN 
THE ARRAY SUBDIAe 

NM DEFINES THE FIRST DIMENSION OF THE TWO DIMENSIONAL 
ARRAYS AsVECR AND He. NM MUST BE EQUAL TO OR GREATER 

THAN Neo 

M IS THE ORDER OF THE SUBMATRIX OBTAINED BY A SUITABLE 
DECOMPOSITION OF THE UPPER-HESSENBERG MATRIX IF SOME 
SUBDIAGONAL ELEMENTS ARE EQUAL TO ZERO. THE VALUE OF M IS 
CHOSEN SO THAT THE LAST N-M COMPONENTS OF THE COMPLEX 
EIGENVECTOR ARE ZEROe 


THE REAL PARTS OF THE FIRST M COMPONENTS OF THE COMPUTED 
COMPLEX EIGENVECTOR WILL BE FOUND IN THE FIRST M PLACES OF 
THE COLUMN WHOSE TOP ELEMENT IS VECR(1sIVEC) AND THE 
CORRESPONDING IMAGINARY PARTS OF THE FIRST M COMPONENTS OF 
THE COMPLEX EIGENVECTOR WILL BE FOUND IN THE FIRST M 
PLACES OF THE COLUMN WHOSE TOP ELEMENT IS VECR(1sIVEC~1)e 


COLLECTED ALGORITHMS (cont.) 


C THE ARRAY INDIC INDICATES THE SUCCESS OF THE ROUTINE AS 
C FOLLOWS 
Cc VALUE OF INDIC(I) EIGENVECTOR 1 
Cc 1 NOT FOUND 
c 2 FOUND 
C THE ARRAYS IWORK»WORK1sWORK2 AND WORK ARE THE WORKING 
C STORES USED DURING THE INVERSE ITERATION PROCESS. 
C EPS IS A SMALL POSITIVE NUMBER THAT NUMERICALLY REPRESENTS 
C ZERO IN THE PROGRAM. EPS = (EUCLIDIAN NORM OF H)*EX» WHERE 
C EX = 2##(—-T)e T IS THE NUMBER OF BINARY DIGITS IN THE 
C MANTISSA OF A FLOATING POINT NUMBER. 
Cc i 
FKSI = EVR(IVEC) 
ETA = EVICIVEC) 
THE MODIFICATION OF THE EIGENVALUE (FKSI + I#ETA) IF MORE 
EIGENVALUES ARE EQUAL. ° ; 
IF (IVECeEQeM)IGO TO 2 
K = IVEC+1 
R = 020 
DO 1 I=KeM 
IFC FKSIeNEsEVR(I))GO TO 1 
IF (ABS(ETA) eNEoABS(EVI(I)))GO TO 1 
R= R + 3.0 
1 CONTINUE 
R = R¥EX 
FKSI = FKSI4R 
ETA = ETA +R 


aN 


€ 
C THE MATRIX (CH<FKSI®L)*(H-FKSI*¥I) + (ETA*ETAI¥I) 15 
C STORED INTO THE ARRAY Ao 


2 R = FKSI*FKSI + ETA*ETA 
S = 2e0*FKSI 
L = M=] 
DO 5 I=l»eM 
DO 4 JelseM 
D = 0.0 
A(JoI) © 020 
DO 3 K & IeJ 
D = DtH tl oK)*H(KeJ) i 
A(IToJ) #& D-S#H(IT»J) 
ACIoI) = AlIs1)+R 
DO 9 I#1eL 
R = SUBDIA(T} 
ACI+1o9I) = ~S#R 
Il # [+41 
DO 6 JelsTl 
6 ACJeI) © AlCIsTIF+R¥H(SoT4+1) 
IF(TeEQe1)GO TO 7 
A(T ¢1lseI-1) = R¥SUBDIA(I=-1) 
DO 8 J=IeM 
A(T41l9J) = ACT+1lsJ)+R#H(19J} 
CONTINUE 


wm ew 


oon 


THE GAUSSIAN ELIMINATION OF THE MATRIX : 
((HFKSE®I)*(H-FKSI®1) + (ETA*ETA)#I) IN THE! ARRAY As THE 
ROW INTERCHANGES THAT OCCUR ARE. INDICATED IN: THE ARRAY 
IWORKe ALL THE MULTIPLIERS ARE STORED IN THE. FIRST AND IN 
THE SECOND SUBDIAGONAL OF THE ARRAY Ac 
K = MH] 
OO 18 I#1lsKk 
Il = I+l 
12 = 142 
IWORK(I) = 0 
IF (1TeEQeK)GO TO 10 
IFCAC T4291) ¢NEe060)GO TO 11 
10 IF(ACI +191) eNE0000)GO TO 11 : 
IF(ACI 9 I) eNEe00e0)GO TO 18 
A(Iel) = EPS 
GO TO 18 
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11 IF(TeEQeK)GO TO 12 
IF(ABSCA(CYE+191)) oGEeABS(A( I +291) ))G0 To 12 
IF(CABS(ACI 91) ) eGEeABS(A(I4+291)))GO TO 16 
L = 142 
IWORK(I) = 2 
GO TO 13 
12 IF(ABS(A(T 91) ) eGEeABS(ACT+191)) GO TO 15 
Ll = I+1 
IWORK(I) = 1 


13 00 14 JaIoM 
R # A(IeJ) 
A(IeJ) # Al(lLoJ) 
14 AlLeJ) = R 
15 IFC ITeNE*K)GO TO 16 
12 2 11 
16 DO 17 Lelisi2 
R = -A(LolT)/ACTI 91) 
AlLslT) = R 
DO 17 JelloM 
17 A(LeJ) = Al(LeJ)+R#A(T oJ) 
18 CONTINUE 
IF CACMsM) eNE*0e0)GO TO 19 
A(MeM) & EPS 


: 
C THE VECTOR (leleeeesl) IS STORED INTO THE RIGHT-HAND SIDE 
C VECTORS VECR( »IVEC) AND VECR( sIVEC=1) REPRESENTING THE 
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C COMPLEX RIGHT-HAND SIDE VECTOR. 

19 DO 21 I=]9N 
IF(1Te¢GTeM)GO TO 20 
VECR(IsIVEC) © 140 
VECR(IsIVEC~1) = 120 
GO TO 21 

20 VECR(IsIVEC) = 000 
VECR(IsIVEC-1) = 020 

21 CONTINUE 


Cc 
C THE INVERSE ITERATION IS PERFORMED ON THE MATRIX UNTIL THE 
C INFINITE NORM OF THE RIGHT-HAND SIDE VECTOR IS GREATER 
C THAN THE BOUND DEFINED AS OcO1/(N#EX) 
BOUND = 0.01/(EX#FLOAT(N)) 
NS = 0 
ITER = 1 
DO 22 ImloM 
22 WORK(I) = HO1sI)-FKSI 
c 
C THE SEQUENCE OF THE COMPLEX VECTORS Z(S) = P(S)4+I#Q(S) AND 
C W(St1)= U(St1)+1*V(S+41) IS GIVEN BY THE RELATIONS 
C (A — (CFKSI-“I#ETA)*#1)#W(S41) = Z(S) AND 
C 2¢S+i) = W(S+1)/MAX(W(S41))~ 
3 THE FINAL W(S) IS TAKEN AS THE COMPUTED EIGENVECTOR« 
C THE COMPUTATION OF THE RIGHT=HAND SIDE VECTOR 
C (A-FKSI*I)#P(S)-ETA#Q(S)e¢ A 1S AN UPPER-HESSENBERG MATRIXe 
23 D0 27 I=loM 
D = WORK(I)#*VECR(IsIVEC) 
IF(1TeEQ01)GO TO 24 
D = D+SUBDIA(I~1)*VECR(I-1sI1VEC} 
24 Les I+l 
IF(LeGTeM)GO TO 26 
DO 25 K=loM 
25 D = D+H(IeK)#VECR(KsIVEC) 
26 VECROTsIVEC#1) © D=ETA*VECRITsIVEC-1) 
27 CONTINUE 
G 
C GAUSSIAN ELIMINATION OF THE RIGHT=HAND SIDE VECTOR. 
K = M-l 
DO 28 I=loK 
L = I+IWORK(I) 
R = VECR(LoIVEC-1) 
VECR(LsIVEC=1) = VECR(IsIVEC-1) 
VECR(IsIVEC-1) = R 
VECR(I+1sIVEC=1) & VECR(I+1 sIVEC~1)+AC T4191) *R 
IF(TeEQeK)GO TO 28 
VECRCIT+2sIVECH1) = VECRIIT+2eIVEC=1)+A( 14291) #R 
28 CONTINUE 
c 


C THE COMPUTATION OF THE REAL PART U(S+1) OF THE COMPLEX 
C VECTOR wW(St+1). THE VECTOR U(S#1) IS OBTAINED AFTER THE 
C BACKSUBSTITUTION. 
DO 31 I=1l5M 
J = M-I4] 
O = VECR(JsIVEC-1) 
IF(JeEQeM)GO TO 30 
L = Jl 
DO 29 KztoM 
Dl = AlJeK) 
29 D = D-DI#VECRI(KsIVEC-1) 
30 VECR(JsIVEC=1) = D/A JJ) 
31 CONTINUE 


c 
C THE COMPUTATION OF THE IMAGINARY PART V(S+1) OF THE VECTOR 
C W(S+1)s WHERE V(S+1) © (P(S)<(A~FKSI®I)*#U(S4#1)) /ETAS 
DO 35 Im=lyM 
D = WORK(I)*VECR(IsIVEC=1) 
IF(1eEQe1)GO TO 32 
D = D+SUBDIA(I-1)*#VECR(I-1»IVEC-1) 
32 Le I+) 
IF(LeGTeM)GO TO 34 
DO 33 K=LoM 
33. siDsS® «D+H(TeK) #VECRIK» IVEC-21) 
34 VECR(IsIVEC) © (VECR( Ts IVEC)-D)/ETA 
35 CONTINUE 


C 

C THE COMPUTATION OF CINFINe NORM OF W(S41))##2 © 
tLzl 
S = 0.0 


DO 36 [ele 
R = VECR(IsIVEC)*#*2 + VECR(TsIVEC-1)*#2 
IF(ReLE«S)GO TO 36 
S = R 
Ls! 
- 36 CONTINUE 
C. THE COMPUTATION OF THE VECTOR Z(S+1) WHERE 2(St1)* W(S41)/ 
C (COMPONENT OF W(S4+1) WITH THE LARGEST ABSOLUTE wAbve? ° 
U = VECR(L»sIVEC-1) 
V = VECRI(L»sIVEC) 
0O 37 I=leM 
B = VECR(IsIVEC) 
R = VECR(IsIVEC=1) 
VECR(ITsIVEC) = (R#*U + BEVIS 
37 VECR(ITsIVEC$1) = (BRU-R#V)/S 
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THE COMPUTATION OF THE RESIDUALS AND COMPARISON OF THE 
RESIDUALS OF THE TWO SUCCESSIVE STEPS OF THE INVERSE 
ITERATIONe IF THE INFINITE NORM OF THE RESIDUAL VECTOR IS 
GREATER THAN THE INFINITE NORM OF THE PREVIOUS RESIDUAL 
VECTOR THE COMPUTED VECTOR OF THE PREVIOUS STEP IS TAKEN 
AS THE COMPUTED APPROXIMATION TO THE EIGENVECTOR. 
B = 0.0 
DO 41 I=#loM 
R # WORK(T)*VECRCIsIVEC-1) - ETARVECR(IsIVEC} 
Us WORK(TI¥VECRUIsIVEC) + ETA*VECRITsIVEC-1} 
IF(TeEQe1)GO TO 38 
R = R+SUBDIACI-1)*VECR(I=1sIVEC-1) 
U = UtSUBDIA(T<1)#VECR(I-1sI1VEC) 
38 L.= I+] 
IF(LeGTeM)GO TO 40 
DO 39 JalLeM 
R = RH (Ie J)*#VECR( Js TVEC-1) 
39 U = UtH( I sJ*VECR( Js IVEC) 
40 U = R¥R + UU 
IF (BeGE.U)IGO TO 41 
Be U 
41 CONTINUE 
IFCITEReEQe1}GO TO 42 
IF(PREVISsLEeB)GO TO 44 
42 DO 43 IzlsN 
WORK1(1) = VECR(Is1IVEC) 
43 WORK2(1) = VECR(IsIVEC~1) 
PREVIS = B 
IF (NSeEQe1)GO TO 46 
IFC ITERsGTe6)GO TO 47 
ITER = ITER+1] 
IF (BOUND eGTeSQRT(S))GO TO 23 
NS = 1 
GO TO 23 
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44 DO 45 IeloN 
VECR(TsIVEC) = WORKI(I1) 
45 VECR( Is EVEC-1) =wORK2(1) 
46 INDIC(IVEC-1) = 2 
INDICCIVEC) = 2 
47 RETURN 
END 


ADDED IN Proor. A small alteration to the program is desirable. 
The four statements in the subroutine SCALE, page 822, lines 
3-6, should be replaced by the four statements below. The 
alteration is necessary so that the program will also give correct 
eigenvectors for the case when no convergence of the process of 
sealing occurs. 


PRFACT (I) = 10 
po 12 J=1,N 
12 A(U,J) = H(,J) 
ENORM = 1.0 


CERTIFICATION OF ALGORITHM 348 [F1] 

EIGENVALUES AND EIGENVECTORS OF A 

REAL GENERAL MATRIX [J. Grad and M. A. 
Brebner, Comm. ACM 11 (Dec. 1968), 820-826] 

H. D. KnosBie (Reed, 2 July 1969 and 18 Sept. 1969) 

The Pennsylvania State University, Computation 
Center, University Park, PA 16802 


KEY WORDS AND PHRASES: norm, characteristic equation, 
degenerate eigensystem, diagonalizable matrix, defective matrix 
CR CATEGORIES: 5.14 


The program used for this certification was copied directly from 
the printed Fortran algorithm [1]. In addition to incorporating 
the suggested modification, the algorithm as used here was modi- 
fied to operate completely in double precision arithmetic. The 
tests were run on an IBM System/360 model 67 using Fortran IV, 
double precision arithmetic (15 significant decimal digits; ¢ = 53). 
One criterion for measuring numerical precision of the results was 
a norm of a residual matrix. That is, given a coefficient matrix, 
A of order n, pose the characteristic equation as AX, = yzXz and 
define the norm of M as || M ||. = max;(:|m.; |), where 
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M = (My) = (AXz — yzXx), Xx is the kth right-hand eigenvector 
of A, and y, is the kth eigenvalue for k = 1, 2, ++. ,n. 

The norm || M ||1 essentially measures the worst eigenvalue- 
eigenvector pair associated with the characteristic equation. In 
order to gain information concerning the other extreme, as well 
as an average measure of precision, the notations | M {min == 
mins (Dos | ms. |) and || M lleve = DosCXe | mi.z })/n will be used 
respectively, the former simply indicating the quantity is not a 
matrix norm. 

The algorithm’s performance was also analyzed by generating 
test matrices with certain known properties thereby permitting 
comparisons to be made between computational and theoretical 
results. 

The objective was to study the algorithm’s sensitivity to ill- 
conditioning and degeneracy by observing its behavior relative to 
the speed and precision, and accuracy where possible, with which a 
variety of eigensystems could be solved. Testing was carried out 
by entertaining four sets of matrices as follows: 

CasE 1. Small Matrices with Known Solutions. Several 
matrices from each of [1, 2, 3] varying in order from 3 to 8 yielded 
eigenvalues, and eigenvectors where documented, accurate to at 
least 7 decimal places. The largest 1-norm was || M ||, < 107%; 
|| 1 llave averaged 1074; and the largest value of | !| min, was less 
than 10-4. Maximum computation time for any of these matrices 
was less than a second. 

Included in this test was a matrix, A, belonging to a large class 
of test matrices discovered by Gear [3]. This matrix, A = (a;,;) 
of order 8 is defined as: 

Giin = Gis = 1, 
A= dis = 3,3 = 1, 
Qi; = 0, otherwise. 
This nonsymmetric matrix has a zero trace and eigenvalue pairs: 
+2, +1, +1, +1. The algorithm yielded four of the eigenvalues 
accurate to 15 decimal places and four values accurate to 7 places. 

Deserving special note here is example (ii?) presented with 
the original algorithm. As the authors [1] stated, although this 
matrix when transformed by scaling becomes invariant under the 
QR process, the original, single precision algorithm yielded correct 
results. However, the double precision version failed completely 
regardless of the value of the hardware parameter t. In addition, 
the algorithm may erroneously indicate success for this case; how- 
ever, with the machine configuration noted earlier, failure was 
correctly indicated. 

CasE 2. Degenerate and Defective Matrices. Using an al- 
gorithm suggested by the work of Hall and Porsching [4], a de- 
generate, nonsymmetric matrix of order 30 with known positive 
eigenvalues was generated with eigenvalues: y1 = 30; ys = 25 
for? = 2,3,--- ,10; y; = 31—7 for? = 11, 12,--- , 20; and 
ys = 1 for 7 = 21, 22,--- , 30. All eigenvalues were returned 
accurate to at least 14 decimal places; || M ||1 < 107", || M |lave < 
10%, and | M |min < 107. Computation time was about 4 sec. 

Gear [3] defines a class of matrices including a matrix B of order 
25 such that 


for 1 =1,2,-:-,7, 


AI 01 0) 01010 
I AIOoO 10100 
B=1!10 I ATI O where A=/|0 101 Of, 
001AIzI 00101 
0101 A 01010 


and I is the identity matrix. Using the theory developed by Gear 
{3], it is easy to show the matrix B has 11 zeros, six pairs of eigen- 
values equal to +2, and one pair of eigenvalues equal to 4. 
The algorithm yielded 14 eigenvalues accurate to 7 decimal places 
and 11 eigenvalues (not all the zero values) with at least 14 place 
accuracy; || M ||; < 107%, || M |lave < 10°, and | M \min < 
10-4. Computation time was less than 3 seconds. 

To gain a measure of the algorithm’s ability to separate eigen- 
vectors corresponding to the same eigenvalue, a degenerate sym- 
metric matrix was generated using an algorithm of Ortega [5]. 
Briefly, a similarity transformation was used to generate a matrix 
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A of order 6. That is, using Ortega’s notation, A = CDC where 
D = diag(1,2,3,1,2,3), C = (I—2vv’), and v is a column vector with 
each element in this case equal to 1/./6. For this case || M ||: < 
10, || M |lave < 107, and | M |min < 10-%. The eigenvectors 
corresponding to each eigenvalue pair are listed below to three 
decimal places. 


Eigenvalue Trans posed eigenvectors 
1 (+.480, —.438, —.4388, —.042, —.438, —.438) 
(+.449, —.447, —.447, —.002, —.447, —.447) 
2 (—.254, +.723, —.254, —.254, —.469, —.254) 
(—.151, +.745, —.151, —.151, —.595, —.151) 
3 (— .3828, —.328, +.672, —.328, —.328, —.344) 


(— .328, —.328, +.671, —.329, —.329, —.342) 


Even though the matrix A is obviously not defective, by inspec- 
tion it can be seen that the algorithm did not yield well-separated 
eigenvectors. This fact is also evident by noting that if the al- 
gorithm extracts independent eigenvectors they will be returned 
orthogonal (in fact orthonormal), yet the determinant of the 
eigenvector matrix for this case is less then 104 in absolute value. 

Case 3, Ill-Conditioning. Two ill-conditioned matrices sug- 
gested by Wilkinson [6] were solved. One of these is a matrix A 
of the form: 

ait = 21 —7 
_ )Gitn = 20 
ee ain3 = 0 
G1 = € 
whose eigenvalues are very sensitive to perturbations of «. With 
e = 0, the matrix is triangular and the eigenvalues were returned 
accurate to 15 places with || M ||: < 10-4, || M |lave < 10-4, and 
| M |min < 10-8, As Wilkinson [6] points out, with e = 10-" the 
eigenvalues change drastically, having been computed in this 
case in complete agreement with this reference. For the perturbed 
case || M ||, > 107, || M lave > 10- and | M [min remained less 
than 10-4, 

The algorithm was tested under a combination of ill-condi- 
tioning and degeneracy by generating nonsymmetric matrices as 
in Case 2, but of order 20, conditioned such that maz | eigen- 
value | = 10° X min | eigenvalue | for j = 2, 3,---, 20; de- 
generacy was introduced by generating the matrices with only 10 
distinct eigenvalues. The values of || M ||: for the matrices tested 
in this class were such that || M ||1 ~ 107-4 for 7 = 2, 3,---, 18. 
|| M |lave followed a similar curve; | M lmin < 10-4 for j < 14 
and never exceeded 10-§. Although the algorithm indicated suc- 
cess, severe computational breakdown was evident during this 
test for values of j greater than 18. However, the largest eigen- 
value in every case was returned accurate to 15 decimal places. 
Computation time for matrices of order 20 was consistently less 
than 2 seconds. 

Case 4. Large Matrices. Several nonsymmetric matrices of 
order 50 with elements uniform on the interval (0, 50) were solved 
yielding the following average figures: || M ||1 < 10-, || M |lave < 
10-", and | M Kia < 10-". Computation time averaged 31 
seconds. 

A diagonal matrix A of order 50 with elements: 

A = (it = 1, for « = 1, 10, 20, 30, 40, 50, 
ai3 = 0, otherwise, 
was solved yielding || M ||: < 107%, || M lave < 1071’, and 
M |min < 107%, 
Computation time was about 5 seconds and all eigenvalues were 
returned correct to 15 decimal places. 

Concuusions. The algorithm is capable of successfully comput- 
ing eigenvalues and eigenvectors of real general matrices even 
under conditions considered unstable. It has the advantage of 
being documented in ANSI (USASI) Fortran, being computa- 
tionally fast, and has the capability of yielding results with as 
much precision as the hardware will permit. The algorithm does 


» for i447 = 1,2,---,20;7 #74,7+1 
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not break down when presented with a matrix which is not diag- 

onalizable; that is, a set of eigenvectors satisfying the eigen- 

equation is computed regardless of the existence of linearly inde- 
pendent eigenvectors. However, when a matrix is diagonalizable 
and degenerate, the algorithm does not yield well separated eigen- 
vectors corresponding to non-distinct eigenvalues. Another 
apparent disadvantage is the possible indication of completely 
successful computation (INDIC), even in clearly ill-conditioned 
situations where computational difficulties are inevitable. This 
latter property, however, is a common fault of other algorithms 
as well. 
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This remark reports certain failures of Algorithm 343 when 
applied to pathological matrices. The smallest example is a 4 x 4 
matrix for which 16 guard bits (5+ digits) proved insufficient; all 
computed eigenvalues were incorrect in the most significant digit. 

The algorithm was implemented on an IBM System/360 model 
50 using Fortran IV-G. The program was not modified to operate 
completely in double precision as was done for Knoble’s certifica- 
tion [2]. Satisfactory agreement was obtained for the three sample 
matrices given with the algorithm. 


Example A 
— 50 53 52 51 
— 52 1 53 52 
—53 0 1 53 
—51 53 52 52 
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The exact eigenvalues are all 1. The computed eigenvalues follow. 
(Computed eigenvalues are reported rounded to 2 places after 
the decimal point, any further figures being, rather obviously, 
pointless.) 


2.35 
1.03 + 1.38 7 
--0.41 


The maximum error in a computed eigenvalue exceeds 2 percent 
of the largest element of the matrix. 


Example B 
~41 55 4 3 2 51 
— 2 10 55 4 3 2 
— 3 0 10 55 4 3 
— 4 0 0 10 55 4 
— 55 0 0 0 10 55 
—5l 55 4 3 2 61 


The exact eigenvalues are all 10. The computed eigenvalues: 


14.76 + 2.92 72 
9.70 + 5.33 7 
5.54 + 2.39 2 


The maximum error in a computed eigenvalue exceeds 9% of the 
largest element in the matrix. 


Example C 
—91 —94 0 0 0 0 0 0 
95 98 0 0 0 0 0 0 
90 99 5 0 0 0 0 0 
90 0 99 6 0 0 0 0 
90 ) 0 99 7 0 0 0 
90 0 0 0 99 8 0 0 
90 0 0 0 0 99 9 0 
99 99 0 0 0 0 99 10 


The exact eigenvalues are 3, 4, 5, 6, 7, 8, 9, 10. The computed 
eigenvalues are: 


12.68 
10.96 + 3.73 7 
6.47 + 5.38 12 
2.09 + 3.73 1 
0.27 


Although all eigenvalues are real, the imaginary part of one pair 
of computed eigenvalues exceeds 5 percent of the largest element 
of the matrix. This matrix, like the other two, was maliciously 
devised to take advantage of the program; it is indicative of this 
that the transpose, being already in lower Hessenberg form, fares 
much better, all computed eigenvalues being correct to within 
+0.05. 

Although, in view of the known sensitivity of multiple eigen- 
values to small changes of certain elements of certain matrices, 
such counter examples are to be expected, it is probably worth 
putting a few examples on record as the casual and unsophisticated 
user is more apt to take warning of the dangers of eigenvalue 
computations in single precision from a concrete case. 
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We had at our disposal a double precision version (all real 

variables are declared to be of type double precision) for the IBM 
360/50 of the algorithm 343 [1] with logical JF statements converted 
to arithmetical ones. In the following three modifications which we 
found to be of practical value are to be discussed. 
a. Modification of the test of smallness of R in HESQR: 10 and 
11 lines after statement 21, a test is rnade on R whether it is zero or 
not. Because R is not of type integer such a test is almost inefficient. 
Let us call @ some value representing the order of the elements of 
the matrix A (for example the Euclidean norm of A), «4 a small 
positive number numerically representing zero elements of A and 
em the relative machine accuracy. In HESQR e. is chosen to be 
€4 ~ ae, . By inspection of the formulas it is seen that R is of the 
order of a?; therefore R should be considered to be small if R < 
em = €42/en. This is equivalent to R/e, < e4/en , Which does not 
have the risk of underflow. 

Following these ideas we changed the statements 


IF(R.EQ.0.0) SHIFT= A(M,M — 1) 
_IF(R.EQ.0.0) GO TO 21 
Z=A(K+2,K+1)*A(K+1,K) 


10, 11, and 12 lines after statement 21 to 


IF(R/EPS — EPS/EX) 215,215,217 
215. IF(SHIFT— A(M,M—1)) 216,217,216 
216 SHIFT=A(M.M—1) 

GO TO 21 
217) Z=A(K+4+2,K+1)*A(K+1,K) 


(keeping in mind that e, = EPS and e,, = EX), and we were able 
to solve example (i) and (ii) as well as example (iii) of [1]. 

b. Modifications in EIGENP: In order to suppress unnecessary and 
possibly impermissible computations in case of failure, the sub- 
routine EIGENP was modified as follows. We changed the state- 
ment L = O, two lines after statement 15, to 


ISW=INDIC() —1 
IFUISW) 24,152,152 


152. L=0 
statement 

16 DO18 J=1,N 
to 


16 IFUSW) 24,161,162 
161 IF(L) 232,202,232 
162 DO18 J=1,N 


and statement 
EVR(D = EVR(D*ENORM 
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one line after statement 20, to 
202 EVR(I)=EVR(I)*ENORM 
Statements 

21 KON=1 


EVI(I—1) = —EVI(D 
have been removed and. reinserted as 
232 KON=1 


EVI(I—1) = —EVI(D 


between statement 23 and 24. Finally statement 


R=0.0, 
five lines after statement 21, has been changed to 
21 R=0.0 


c. Modifications in SCALE: It seems to be reasonable to change 
statement , 


Q=Ail,J) 
preceding statement 9 to 


IF(I— J) 88,89,88 
88 AU, J) = HJ) *PRFACT(D /PRFACT(J) 
89 OQ=ACLJ) 


so that even in case of many iterations being necessary to calculate 
PRFACT, the relation of similarity of the result matrix to the input 
matrix will almost not be changed by rounding errors. 


References 

1. Grad, J., and Brebner, M.A. Algorithm 343, Eigenvalues and 
eigenvectors of a real general matrix. Comm. ACM 11 (Dec. 1968), 
820-826. 
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ALGORITHM 344 

STUDENT’S ¢-DISTRIBUTION [S14] 

Davip A. Levine (Recd. 26 Mar. 1968 and 2 Aug. 1968) 

State University of New York at Stony Brook, Stony 
Brook, NY 11790 

KEY WORDS AND PHRASES: Student’s ¢-Distribution, t- 


test, small-sample statistics, distribution function 
CR CATEGORIES: 5.12, 5.5 


Comment [t-Test evaluates in single-precision the value of 
Student’s [2] ¢-distribution for argument 7 and degrees of freedom 
DF. The two-tailed Student’s t-distribution, A, is obtained as the 
indefinite integral: 


) a _DFti 
A(T, DF) = / =.) 2 
( F)=C (+3 dz 


where C is chosen so that A (0, DF) = 1. 

The integration of A can be accomplished exactly by integrat- 
ing by parts successively, obtaining: 
for DF an odd integer, 


A(T, DF) =1-—- = {arctan a + ab E +b () + pb (a) 
7 3 3 5 


ate ce oe 240 DF —3 
3°85 DF — 2 : 


and for DF an even integer, 


a eg ee 
24  DF-2)\’ 


b= (1+ a)-|, 


T 
where a = —==, 
/DF 

A Fortran program evaluating these series is given below, 
giving at least six correct significant figures after the decimal— 
more than enough accuracy for most statistical applications. The 
t-Test is usually applied in small-sample statistics [1] where DF < 
30. The algorithm presented here is faster and simpler, with ac- 
curacy equal to previous algorithms for DF < 30. In the range 
30 < DF < 100, this algorithm is competitive in speed and ac- 
curacy with previous algorithms. For the range DF > 100, small- 
sample assumptions may be altered by replacing the integrand of 
the distribution by a Gaussian (normal) curve; hence much greater 
speed is obtained in this range by employing, for example, Al- 
gorithm 209 [3]. Instructive comments and bibliography are ob- 
tainable from Algorithm 321 [4], where an algorithm competitive 
for the range 30 < DF < 100 is presented and the use of Algorithm 
209 is discussed. 

Thanks to the referee for many helpful suggestions, which have 
been incorporated, and to Joan Warner, who has aided in the pro- 
gramming and testing of this algorithm. 
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SUBROUTINE TTEST 
Cc HH KKH KKK EHH HEHEHE 


* (TeOF s ANS +KERR) 


Cc 
REAL ANS +eD1sDeeFleFeeTeTleTe 
Cc 
INTEGER DF sl sKERReN 
Cc 
DATA D17e63661977T7 
CG 
Cc 0263661977236 7T58134e0e0e= 2/ PI 
C 
KERR = 90 
c 
IF (DF eGTeC) GO TO 1 
Cc 
C ERROR RETURN IF DF NOT POSITIVE 
Cc 
KERR = 1 
ANS = Oe 
RETURN 
Cc 
c BEGIN COMPUTATION OF SERIES 
Cc 
1 T = ABS(T) 
T1 = T/SGRT(FLOATCDF)) 
Te = LefCletT1*T1) 
G 
IFC (DF/S2)*¥2eEQeDF) GO TO 5 
Cc 
C DF IS AN ODO INTEGER 
C 
ANS = 1l1e-D1I*ATAN(T1) 
Cc 
IF (DF eEGel1) GO TO 4 
Cc 
D2 = D1i*T1L*T2!? 
ANS = ANS-D2 
cS 
IF (DF eEQe3) GO TO 4 
Cc 
Fil = O5o 
2 N = (DF~2)/72 
DO 3 I=1+4N 
F2 = 2@e*FLOAT(I)-F1 
De = D2*TA*F2/ (Fet+1e) 
3 ANS = ANS-De2 
C 
Cc COMMON RETURN AFTER COMPUTATION 
pe 


4 IF CANSeL TeOe) ANS = Oo 
RETURN 
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Cc DF IS AN EVEN INTEGER 
Cc 
5 De = T1*SQRTA4T2) 
ANS = 1e-De2 
Ls 
IF (DF eEQe2) GO TO 4 
C 
Fi = le 
GO TO @ 
END 
REMARKS ON 


ALGORITHM 321 [S14] TEST PROBABILITIES 
[John Morris, Comm. ACM 11 (Feb. 1968), 115-6] 

ALGORITHM 344, STUDENT'S t-DISTRIBUTION 
[David Levine, Comm. ACM 12 (Jan. 1969), 37-8] 

G. W. Hitz, anp Mary Loucuunap* (Recd. 16 Apr. 
1969 and 29 Sept. 1969) 

Commonwealth Scientific and Industrial Research Or- 
ganization, Division of Mathematical Statistics, Glen 


Osmond, South Australia 
* Present address: Monash University, Clayton, Victoria, 
Australia 


KEY WORDS AND PHRASES: t-test, Student’s t-statistic, 
distribution function, approximation 
CR CATEGORIES: 5.12, 5.5 


Algorithm 321, as published, was coded in CSIRO 3200 Atego 
and run on a CDC 3200 with programmed floating point opera- 
tions. A FortTRAN equivalent of Algorithm 321 was run for com- 
parison with the Fortran Algorithm 344, which uses the same 
recurrence relation based on Student’s cosine formula as that 
used in Algorithm 321 for df degrees of freedom less than mazn. 
Numerical results agreed with 6-digit tabulated values [1] and 
double precision calculations indicate that accuracy is limited by 
truncation of intermediate results to the precision of the proces- 
sor, with error in the final result increasing as the square root of 
df. Timing tests rated Algorithm 344 at approximately (3 df+14) 
msec; slightly faster than Algorithm 321, which required ap- 
proximately (2 df+-2})msec' for df < mazn. 

For df > maxn Algorithm 321 uses Fisher’s [2] fifth order ap- 
proximation, whose accuracy is summarized in the diagram for 
df = 10(10)50 (see Figure 1). The shaded regions indicate values 


-3 
-4 
-5 
~~ df=10 
Ss 
* algorithm 321 
ro) » 
— -6 
fe) 
B  [-axt0%-—-—apy- 
ay 
~8 im 
“85 1 2 3 5 


4 
Student's t 
Fria. 1 
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of ¢ for which the claimed accuracy of 3 X 1077 for maun = 30 
is not attained. For ¢t > 6.0 this algorithm returns zero values, 
giving errors up to 1.39 X 10-°. The following alterations avoid 
this error and, by ‘“‘nesting’’ Fisher’s polynomial approximation, 
reduced the time from about 25msec to 20msec and reduced the 
store requirement by 27%. 


Replace the 19 lines beginning “g: ¢ := 1.0 — ?? by 


g: 2:=10-¢ 
end else 
begin z := 2.0 X gauss (—#); 
if df < 16 then 
begin real f, (2; 
f :=0.25/df; i2:=tX ¢; 
zi= (((((C(((¥((((8.0X t2— 183.0) X £2 
+1764.0) X i2— 7516.0) X #2+-5994.0) X ¢2-+- 2490.0) X 12 
+1140.0) X 12+180.0) X 12+-5355.0) X12+17955.0) * f 
+ ((((((15.0X t2—375.0) X 12+2225.0) X 12— 2141.0) Xt2 
— 939.0) X 12—213.0) x 12—915.0) K12+945.0)  f/60.0 
+ ((((2—11.0) x i2+14.0) x #2+6.0) Xi2—3.0) Xi2—15.0) K f 
+ ((8.0% t2—7.0) X12—5.0) X 12—3.0) Xf/6.0 
+ (#2+1.0)) * f X & & 0.7978845608 XK exp (—0.5Xi2) + 2 
end; 
ttest := if  < 0.0 then 0.0 else x 


The last statement, recommended by the referee, avoids negative 
results due to rounding errors when the answer is small. 

In Algorithm 344 the three statements beginning ‘1 T = 
ABS(T)” were replaced by: 

1 T2 = T*T/FLOAT (DF) 
Tl = SQRT(T2) 
T2 = 1./(1.4+-T2) 
to avoid changing the calling parameter T. 

Although Algorithm 321 occupies about twice the store space 
needed for Algorithm 344, and is slightly slower for df < mazxn = 
30, it is about three times faster for df = 100. 

REFERENCES: 

1. Smtrnov, N. V. Tables for the Distribution and Density Func- 
tions of t-distribution. Pergamon Press, New York, 1961. 

2. Fisuer, R. A. Expansion of ‘‘Student’s” integral in powers of 
n-1, Metron. 6, 3 (1926), 109-112. 


REMARKS ON: 

ALGORITHM 332 [S22] 

JACOBI POLYNOMIALS [Bruno F. W. Witte, Comm. 
ACM 11 (June 1968), 436] 

ALGORITHM 344 [814] 

STUDENT'S t-DISTRIBUTION [David A. Levine, 
Comm. ACM 12 (Jan. 1969), 37] 

ALGORITHM 351 [D1] 

MODIFIED ROMBERG QUADRATURE [Graeme 
Fairweather, Comm. 12 (June 1969), 324] 

ALGORITHM 359 [G1] 

FACTORIAL ANALYSIS OF VARIANCE [John R. 
Howell, Comm. ACM 12 (Nov. 1969), 631] 

Artuur H. J. Save (Recd. 16 Feb. 1970) 

Basser Computing Department, University of Sydney, 
Sydney, Australia 


KEY WORDS AND PHRASES: Fortran standards 
CR CATEGORIES: 4.0, 4.22 


An unfortunate precedent has been set in several recent al- 
gorithms of using an illegal FORTRAN construction. This oon- 
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sists of separating an initial line from its continuation line by a 
comment line, and is forbidden by the standard (see sections 3.2.1, 
3.2.3 and 3.2.4 of [1, 2]). The offending algorithms are to date: 
332, 344, 351 and 359. 

While this is perhaps a debatable decision by the compilers of 
the standard, and trivial to correct, it seems a pity to break the 
rules just for a pretty layout as has been done. 

REFERENCES: 

1. ANSI Standard FORTRAN (ANSI X3.9-1966), American 
National Standards Institute, New York, 1966. 

2. FORTRAN vs. Basic FORTRAN, Comm. ACM 7 (Oct. 1964), 
591-625. 
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ALGORITHM 345 

AN ALGOL CONVOLUTION PROCEDURE BASED 

ON THE FAST FOURIER TRANSFORM [C6] 

RIcHARD C. SINGLETON* (Recd. 30 Dec. 1966, 26 July 
1967, 19 July 1968, and 8 Nov. 1968) 

Stanford Research Institute, Menlo Park, CA 94025 


* This work was supported by Stanford Research Institute out of Research 
and Development funds. 

KEY WORDS AND PHRASES: fast Fourier transform, complex 
Fourier transform, multivariate Fourier transform, Fourier 
series, harmonic analysis, spectral analysis, orthogonal poly- 
nomials, orthogonal transformation, convolution, autocovari- 
ance, autocorrelation, cross-correlation, digital filtering, 
permutation 

CR CATEGORIES: 3.15, 3.88, 5.12, 5.14 


Stockham [6] and Gentleman and Sande [3] have shown the prac- 
tical advantages of computing the circular convolution 


nw1 
C= Sy AB e aindin's k= 0, l,--- 


im0 


snd, 


of two real vectors A and B of period n by the fast Fourier trans- 
form (2, 3, 4]. The Fourier transforms 


nal 


aj = 2) A, exp(i2rpj/n) 
p=0 


and 


n-1l 


8; = » B, exp (i2rqj/n) 
= 


are first computed, then the convolution 
1 ml 
Cy =— dD) aj8;* exp (i2rjk/n) 
n 4 


i=0 


where §;* is the complex conjugate of 8; . By this method the num- 
ber of arithmetic operations increases by a factor slightly more 
than 2 when 7 is doubled, as compared with a factor of 4 for the 
direct method. Tests show a 16 to 1 time advantage for the trans- 
form method at n = 256. 

The operation of convolution is used in computing autocorrela- 
tion and cross-correlation functions, in digital filtering of time 
series, and many other applications. 

Procedure CONVOLUTION computes the convolution of two 
real vectors of dimension n = 2”. The special features of this pro- 
cedure are: (1) the usual reordering of the fast Fourier transform 
results is avoided, and (2) the return from frequency to time is 
made with a transform of dimension n/2 instead of n. The two 
vectors A and 8 are first transformed with a single complex 
Fourier transform of dimension ». The complex product a6* is 
then formed, leaving the result in reverse binary order. Since the 

convolution is real-valued, the real part x of the complex product 
is an even function and the imaginary part y is an odd function; 
thus the Fourier transform of x is real and that of y is imaginary. 
These properties lead to the identity 
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T(e -+ iy) = Re(Tz) — Im(Ty) 
Re(T(x% — y)) + Im(TG@ — y)) 


where 7 represents the Fourier transform and Jz + ty) is the 
desired convolution. We subtract y from z, yielding a real vector 
-of dimension n, then transform using a complex transform of di- 
mension .n/2 and add the resulting cosine and sine coefficients to 
give the convolution. Thus with procedure CONVOLUTION we 
make maximum use of the complex Fourier transform in each di- 
rection and avoid any reverse binary to binary permutation. The 
Fourier transform 


Il 


T(A +7B) = a+ 7B 


of the two original vectors is available in reverse binary order on 
exit from the procedure. We can permute this transform to normal 
order with procedure REVERSEBINARY and readily compute 
the power spectra and cross spectrum of the two data vectors. 

Procedure CONVOLUTION uses procedure REALTRAN, given 
in Algorithm 338 [5], but repeated here with revisions to improve 
accuracy on computers using truncated floating-point arithmetic. 
Procedures FF74 and REVFFT4 are also used and perform the 
same computation as procedures FFT2 and REVFFT2 given in 
Algorithm 338 for use on asystem with virtual memory. The trans- 
form procedures given here are organized without regard to the 
problem of memory overlay. This change yields a 10 percent reduc- 
tion in computing time on the Burroughs B5500 for transforms of 
dimension n = 512 or smaller. Procedure FFT4 is based on an 
organization of the fast Fourier transform due to Sande [3], and 
procedure REVFFT4 is similar to the method proposed by Cooley 
and Tukey [2], except that the data is in reverse binary order. In 
both cases, trigonometric functions are used in normal sequence, 
rather than reverse binary sequence, thus eliminating the need 
for a reverse binary counter. Another gain in efficiency comes from 
reducing the time for computing trigonometric function values. 
The following difference-equation method is used: 


cos(k0) ~— (C X cos(ké) + 8 X sin(ké)) 


cos((k + 1)a) 
and 
sin((k + 1)@) 


where the constant multipliers are C = 2 sin?(@/2) and S = sin(6), 
and the initial values are cos(0) = 1 and sin(0) = 0. 

These initial values should be computed to full machine preci- 
sion; if necessary, a stored table of sin(@) for @ = 2/2, 1/4, x/8, 
+++ , r/n can be added to procedures FFT4 and REVFFT4. Using 
the standard sine function to compute initial values, the ratio of 
rms error to rms data is about 2 X 10" for the transform-inverse 
pair at m = 512 on the Burroughs B5500 computer; this error is 
about the same as that obtained when the sine and cosine functions 
are used for all trigonometric function values. On a computer 
using truncated, rather than rounded, arithmetic operations, the 
sequence of values for cos(ké) + 7 sin(ké) tends to spiral inward 
from the unit circle. Since the error is primarily one of magnitude, 
rather than angle, rescaling to the unit circle at each step gives a 
satisfactory correction. This correction is included in procedures 
FFT4 and REVFFT4 but may be removed to improve running 
speed if rounded arithmetic is used. 

Procedures FF78 and REVFFTS are included as possible sub- 
stitutes for FFT4 and REVFFT4. These procedures use radix 8 


sin(ké) -+- (S X cos(ké) — C X sin(ke)), 
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arithmetic [1], rather than radix 4, and run about 20 percent faster 
on the Burroughs B5500 computer; however, the compiled code is 
twice as long. The code could be shortened by use of subscripted 
variables and FOR statements, but this change would probably 
eliminate most of the time-saving. 

The permutation procedure REVERSEBIN ARY is based on 4 
modified dual counter, one in normal sequence and the other in 
reverse binary sequence. In permuting a vector of dimension n, 
the normal sequence counter goes from 1 to n/2 — 1, and the ele- 
ments indexed 1, 3, --- ,n/2 — 1 are exchanged with their reverse- 
binary counterparts (indexed greater than or equal to n/2) with- 
out need of a test. The reverse binary counter is incremented 
only n/4 times, and exchanges of pairs of elements below n/2 
are done jointly with pair exchanges in the upper half of the 
array; i.e. if 2; and a are exchanged, where 7, k < n/2, then 
Yn1-j and Yy.1-. are also exchanged. This procedure is twice as 
fast on the Burroughs B5500 as REORDER given in Algorithm 
338 [5] and is the better choice when the additional features of 
REORDER are not needed. For a single-variate, complex Fourier 
transform of dimension n = 2”, 


REVERSEBINARY(A, B,m); 
REVFFT8(A, B,n, m, 1) 


was found to be the best combination for n S 512 on the B5500 
computer, giving a time of 0.79 sec. for n = 512. 
REFERENCES: 
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procedure CONVOLUTION (A, B, C, D, m, scale); 
value m, scale; integer m; real scale; array A, B, C, D; 
comment This procedure computes the circular convolution 


Ch = scale om A;B jan mod ny k= 0, 1, eo 
7=0 


where n = 2" and p mod n represents the remainder after divi- 
sion of p by n. (It is assumed that m=1.) Arrays A, B[0 : n—1] 
originally contain the two data vectors to be convoluted, and 
on exit, contain the Fourier transform of A + 7B arranged in 
reverse binary order. A and B must not be the same array. 
On exit, array C[0 : n—1] contains the convolution multiplied 
by the factor scale. Array D is ascratch storage array with lower 
bound zero and upper bound at least n + 2. If the Fourier 
transform of the data is not needed, the procedure can be called 
with arrays A and B used for C and D in either order, for ex- 
ample, CONVOLUTION (A, B, A, B, m, scale). If the Fourier 
transform is used, it should first be permuted to normal order 
by the call REVERSEBINARY (A, B, m). After doing this, the 
Fourier cosine coefficients of the A vector are 


{Azt+An-z)/n, k= 1, 2, omens n/2, 
(2A )/n, k= 0, 


»n—1, 
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and the sine coefficients are 
(Br—Bn-x)/n, k = 1, 2,+-- 


The Fourier cosine coefficients of the B vector are 


,n/2 — 1. 


(BitBa-x)/n, k = 1,2,++- , n/2, 
(2Bo)/n, k=0, 

and the sine coefficients are 
(An-r—-Ax)/n, k=1,2,+++,n/2—1. 


The procedures FFT4, REVFFT4, and REALTRAN are used 
by this procedure and must also be declared. If convolutions of 
large dimension are to be computed on a system with virtual 
memory, procedures FFT2 and REVFFT2 (Algorithm 338) 
(5] should be substituted for procedures FFT4 and REVFFT4; 
begin integer j, kk, ks,n; real aa, ab, ba, bb, im; 
n:= 2m; j:=1; 
FFT4(A, B,n, m,n); 
C[0] := 4 X (A[0]XB[0)); 
L: kk:=j; kss=j:=jt+4; 
2: ks := ks — 1; 
aa := A[kk] + Alks]; ab := A[kk] — Al[ks]; 
ba := Blkk] + Blks]; 6b := B[kk] — Blks); 
im := ba X bb + aa X ab; aa := aa X ba — ab X bb; 
Cikk] := aa — im; Clks] := aa + im; 
kk := kk +1; if kk < ks then go to L2; 
if 7 <n then go to L; 
kk := n+ 2; ks := kk —1; scale := scale/(8Xn); 
for j := 0 step 1 until ks do D{j] := Clj+kk]; 
REVFFTA(C, D, kk, m—1, 1); 
REALTRAN (C, D, kk, false); 
C[0] := scale X C(O]; C[kk] := scale X C[kk]; 
for j := 1 step 1 until ks do 
begin C[n—j] := scale X (ClQ]—D[j]); 
end 
Cj] := scale X (Clj]+DL)) 
end CONVOLUTION; 
procedure FFT4(A, B, n, m, ks}; 
integer n,m,ks; array A, B; 
comment This procedure computes the fast Fourier transform 
for one variable of dimension 2” in a multivariate transform. 
n is the number of data points, i.e. nm = m1 X ma X +++ XK Np 
for a p-variate transform, and ks = ne X ney X -+* X My, 
where n, = 2” is the dimension of the current variable. Arrays 
A[0:n—1] and B[0: n—1] originally contain the real and 
imaginary components of the data in normal order. Multivari- 
ate data is stored according to the usual convention, e.g. aju 
is in A[jXneXnstkxXnstl] for j = 0,1,°++,m—1, k= 0, 
1,-++,nm2— 1, andl = 0,1,-++ , ns — 1. On exit, the Fourier 
coefficients for the current variable are in reverse binary order. 
Continuing the above example, if the ‘‘column’”’ variable m 
is the current one, column 


k= Kn 12") + Ky_o2-? 4 os + 12 + ko 


value n, m, ks; 


is permuted to position 


Keodm—) + ky2m-2 ee Kime? + Km 


A separate procedure may be used to permute the results to 
normal order between transform steps or all at once at the end. 
If n = ks = 2”, the single-variate transform 


nal 


(xz+ty;) = 2) (@etibe) exp (2njh/n) 
k=0 


for j = 0,1,--- , — 1 is computed, where (a+b) represent 
the initial values and (x+y) represent the transformed values; 
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begin integer k0, k1, k2, k3, k, span; end 
real AO, Al, A2, A8, BO, B1, B2, B3, re, im; end FFT4; 
real rad, dc, ds, cl, c2, c3, s1, s2, 83; procedure REVFFT4(A, B,n, m,ks); value n, m, ks; 
span := ks; ks := 2tm; rad := 4.0 X arctan(1.0)/ks; integer n, m,ks; array A, B; 


comment This procedure computes the fast Fourier transform 
for one variable of dimension 2” in a multivariate transform. 
nm is the number of data points, i.e. n = m1 X n2 K++: XK Np 


ks := span+ ks; n:=n-—-1; k:=m; 
for m := m — 2 while m 2 0 do 


egin 
ashe := 1.0; sl :=0; k0:=0; k:= ks; for a p-variate transform, and ks = ney1 X mig2 X ++ X yp, 
de := 2.0 X sin(rad) 2; rad := rad + rad: where n, = 2” is the dimension of the current variable. Arrays 
ds := sin(rad); rad := rad + rad; A[0:n—1] and B[O:n—1] originally contain the real and imagi- 
span := span + 4; nary components of the data with the indices of each variable 
La: kl := kO + span; k2:= kl + span; k8 := k2 + span; in reverse binary order, e.g. ajx is in Alj’/XneXnstk’Xn3+l] 
AO := A[k0); BO := B[k0]; for j=0, Les), m—1, k = 0, 1, eae | —1, and /=0, 
Al := A[kl]; Bl := Blk); 1, +++ ms — 1, where j’, k’, and 1’ are the bit-reversed values of 
A2:= A[k2]; B2 := Bik2]; Jj, k, and 1. On completion of the multivariate transform, the 
A3 := A[k3]; B3 := Blk3); real and imaginary components of the resulting Fourier coeffi- 
A[k0] := AO + A2+ A1+ AB; cients are in A and B in normal order. If n = 2 and ks = 1, 
Bik] := BO + B2 + Bl + B3; a single-variate transform is computed; 
if 1 = 0 then begin integer k0, kl, k2, k3, k, span; 
begin real AO, Al, A2, A3, BO, B1, B2, B3; 
Afkl] :=: AO + A2— Al — A3; real rad, dc, ds, cl, c2, c3, sl, s2, 83; 
Bik] := BO + B2 — Bl — B3; rad := 4.0 X arctan(1.0); n:=n—1; 
A[k2] := AQ — A2— Bl + B3; KO := 0; span := ks; 
Bik2] := BO — B2 + Al — A3; comment If m is odd then compute for one factor of 2; 
A{k3] :=: AO — A2 + Bl — B3; if (m+2) X 2 # m then 
Bik3] := BO — B2— Al + A8 begin 


La: k2 := kO+ span; AO := A[k2]; BO := B[k2]; 


siee A[k2] := A[k0] — AO; A[kO] := A[k0}] + AO; 
begin Bik2] := B[k0] — BO; B[k0] := Blk] + BO; 
re := AQ + A2— Al — A3; im := BO + B2— Bi — B3; SS be open; Ane On ther go tot, 
A[k1] := re X c2 — im X 82; k0 := k0 — n; if kO ¥ span then go to La; 
Bik] := re X 82 + im X c2; span := span + span; rad := 0.5 X rad 
re := AQ — A2— B1+ B38; im := BO— B2+ Al — A3; end; 
A[k2] := re X cl — im X sl; for m := m — 2 while m 2 0 do 
Bik2] := re X sl + im X 1; begin 
re := AO— A2+ Bl— B3; im:= BO — B2— Al+ A3; cl := 1.0; sl :=0; k0:=0; rad := 0.25 X rad; 


A[k3] := re X c8 — im X 83; 

Blk3) := re X 88 + im XK 63 
end; 
kO := k38 + span; if k0 < n then go to La; 
kO := k0 -- n; if KO + k then go to La; 


de := 2.0 X sin(rad) T 2; 
:= sin(rad-+rad); k := ks; 

Lb: kl := kO + span; k2 := kl + span; k8 := k2 + span; 
AO := A[k0]; BO := B[k0]; 
if sl = 0 then 


comment If computing for the current factor of 4 is-not begin 
finished then increment the sine and cosine values; A2:= Alki); B2:= Bik); 
if kO ~ span then Al := A[k2]; Bl := B{k2]; 
begin A8 := A[k3]; B3 := B[k3] 
c2 := cl — (deXcl+dsXs1); . end 
sl := (dsXcl—dceXs1) + s1; else 
comment The following three statements compensate begin 
for truncation error. If rounded arithmetic is used, sub- A2 := Afkl] X c2 — Blkl] X s2; 
stitute cl := c2; B2 := A[k1] X s2 + B[kl] X c2; 
cl := 1.5-0.5 X (c2T2+s1 T 2); Al := A[k2] X cl — B[k2] X s1; 
sl := cl X sl; cl := cl X c2; B1 := A[k2] X sl + Blk2] x cl; 
c2 i= cl T2 = sl f 2; s2 := 2.0 cl X sl; A3 := Al[k3] xX 68 — Blk3] x 83; 
c3 := c2 X cl — s2X sl; 88 := c2 X sl + 82 X cl; B3 := A[k3] X 83 + Blk3] X 3 
k:=k-+-ks; goto La end: 
end; A[k0] := AO + A2+ Al + A3; 
ki=m Bik0] := BO + B2+ Bl + B3; 
and, Alkl] := AO — A2— Bl + B3; 
comment If m is odd then compute for one factor of 2; Bikl] := BO — B2+ Al — A3; 
if k ~ 0 then A[k2] := AO + A2 — Al — A3; 
begin Bik2] := BO + B2 — Bl — B3; 
span := span + 2; kO := 0; A(k3] := AO — A2-+ Bl — B3; 
Lb: k2 := k0 + span; AO := A[k2]; BO := B[k2]; B(k3] := BO — B2— Al + A3; 
A[k2] := A[kO] — AO; A[kO] := A[kO] + AO; kO := k38 + span; if kO <n then go to Lb; 
Bik2] := Bl[k0] — BO; BlkO}] := Blk0] + Bo; k0 := k0O—n; if kO ~ k then go to Lb; 
kO := k2-+ span; if KO <n then go to Lb; comment If computing for the current factor of 4 is not 


k0 := k0 --n; if KO + span then go to Lb finished then increment the sine and cosine values; 
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if kKO + span then Two successive calls of this procedure give an identity trans- 
begin formation; 
c2 := cl — (deXcl+dsxXs1); begin integer j, jj, k, lim, jk, n2, n4, n8, nn; 
sl := (dsXcl—deXsl1) + sl; real /; 
comment The following three statements compensate integer array C[0:m]; 
for truncation error. If rounded arithmetic is used, sub- ClO] := nn :=1; 77 := 0; 
stitute cl := c2; for j := 1 step 1 until m do C[j] := nn := nn + nn; 
cl := 1.5-0.5 X (c2 fT 2+s1 f 2); ifm > 1 then n4 := C[m—2]; ifm > 2 then 78 := C[m—3]; 
sl := cl XK sl; cl := cl X c2; n2:= C[m—1]; lim:=n2—1; nn:=nn—1; m:=m—4; 
c2 :=clT 2— slT2; s2:= 2.0 X cl X 1; for j := 1 step 1 until lim do 
c3 := ¢2 X cl — s2X sl; 338 := c2 X sl + 82 X cl; begin 
k:=k-+ks; goto Lb jk := jj +2; 
end; := Alj]; Aly] := ALjk]; ALjk] := 6; 


t 
t 


span := 4 X span := Bij]; Bi] := Bigk]; Blk] := ¢t; 
-end jgs=g+]; 
end REVFFT4; if jj = n4 then 
procedure REALTRAN (A, B, n, evaluate); begin 
value n, evaluate; integer n; jj := jj — n4; 
Boolean evaluaie; array A, B; if jj 2 n8 then 
comment If evaluate is false, this procedure unscrambles the begin 
single-variate complex transform of the n even-numbered and j= jj — n8; k:=m; 
n odd-numbered elements of a real sequence of length 2n, where L: if C[k] S jj then 
the even-numbered elements were originally in A and the odd- begin jj := jj — Clk]; k:=k-—1; gotoL end; 
numbered elements in B. Then it combines the two real trans- jj s= Clk) + 97 
forms to give the Fourier cosine coefficients A[0], A[1],--:, end 
A[n] and sine coefficients B[0], B[l],--- , Bln] for the full else jj := jj + 78 
sequence of 2n elements. If evaluate is true, the process is end 
reversed, and a set of Fourier cosine and sine coefficients is else jj := jj + 74; 
made ready for evaluation of the corresponding Fourier series if jj > j then 
by means of the inverse complex transform. Going in either begin 
direction, REALTRAN scales by a factor of two, which should k:=nn—j; jk:= nn — jj; 
be taken into account in determining the appropriate overall t:= Afj]; AU] := Aly]; Aj] := t; 
scaling; t:= Bij]; Bij] = Bij]; Bij] := ¢; 
begin integer k, nk, nh; t:= Alk]; A[k] := Aljk]; Aljk] := t; 
real aa, ab, ba, bb, re, im, ck, sk, de, ds; t:= Blk]; Bik] := Blyk]; Bljk] := ¢t 
nh := n+ 2; ds := 2.0 * arctan(1.0)/n; end 
dc := 2.0 X sin(ds) 72; ds := sin(ds+ds); end 
sk := 0; end REVERSEBINARY; 
if evaluate then procedure FFT8(A, B, n,m, ks); value n, m, ks; 
begin ck := —1.0; ds := —ds end integer n, m,ks; array A, B; 
else begin ck := 1.0; A[n] := A[0]; Bln] := B[O] end; comment This procedure computes the fast Fourier transform 
for k := 0 step 1 until nh do for one variable of dimension 2” in a multivariate transform. 
begin n is the number of data points, ie. n = mi X n2X +++ X ty 
nk :=n — k; for a p-variate transform, ks = x X ney X °++ XK np, where 
aa := Alk] + Alnk]; ab := A[k] — A[nk]; nm, = 2” is the dimension of the current variable. Arrays A[0:n— 
ba := Blk] + Bink]; bb := Blk] — Blnk]; 1] and B{0:n—1] originally contain the real and imaginary com- 
re := ck X ba + sk X ab; im := sk X ba — ck X ab; ponents of the data in normal order. Multivariate data is stored 
B[nk} := im — bb; Blk] := tm + bb; according to the usual convention, e.g. aja is in A[jXneXns+ 
A{nk] := aa — re; A[k] := aa + re; “ kXn3+l] for j = 0,1,°++,m—-1, k =0,1,+++,n2— 1, and 
aa := ck — (deXck+dsXsk); 1 =0,1,---, m3; — 1. On exit, the Fourier coefficients for the 
sk := (dsXck—dcXsk) + sk; current variable are in reverse binary order. Continuing the 
comment The following three statements compensate for above example, if the ‘“‘column” variable n: is the current one, 
truncation error. If rounded arithmetic is used, substitute column 
ck := aa; 
ck := 1.5-0.5 X (aa t 2+sk ft 2); k= ee aa + Km—22"-2 + ers + ky2 + ko 
4 i= ck X sk; ck := ck X aa is permuted to position 
en 
end REALTRAN; Koa? p kizm? ves Bema + Kem « 


pace ee Ee ae ne ene SANE A separate procedure may be used to permute the results to 


integer i; ma ek: é : normal order between transform steps or all at once at the end. 
comment This procedure permutes the elements A[j] and Bij] If n = ks = 2", the single variate transform 
of arrays A and B, for j = 0,1,--- , 21m — 1, according to i 
the reverse binary transformation. Element (aj-tiys) = 3 (aatibs) exp G2njk/n) 
a) 


kK = K-12") he? e+ + 2 + ko 

for j = 0, 1,-+-- , m — 1 is computed, where (a+7b) represent 
the initial values and (x-++-7y) represent the transformed values; 
keo2™) ky 2-2 + ees + kimi? + kn. begin integer k0, k1, k2, k8, k4, k5, k6, k7, k, span; 


is moved to location 
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real AO, Al, A2, A3, A4, A5, A6, A7, BO, Bi, B2, B3, B4, Bd, 
B6, B7, x0, x1, x2, x3, x4, x5, x6, x7, yO, y1, y2, y3, y4, yd, y6, 7, 
cl, c2, c3, c4, cb, c6, c7, 81, s2, 83, 84, 85, 86, 87, c45, de, ds, rad; 


span := ks; 
ks := span + ks; n:=n—1; c45 
comment Radix 8 transform; 

for m := m — 3 while m 2 0 do 


begin 
cl: 


ds : 
Se = 


a 
ki: 


8 
or 
paviiiatiivitivi tai aaa TAG 


A7: 
A[k0] := 


= 1.0; 
dc := 2.0 * sin(rad) fT 2; 


ks := 21m; rad := 


4.0 X arctan(1.0)/ks; 
:= sqrt (0.5); k:= m; 


sl:= 0; kO:=0; k:= ks; 


span + 8; 
La: := kO + span; k2:= ki + span; k8 := k2 + span; 


Akl]; Bl: 
A[k2]; B2: 
A[k3]; B38: 
A[(k4]; Ba: 
A[k5]; Bd: 
A[k6]; B6: 
A[k7]; B7: 
AO + A4; 
A0 — A4; 4; 
Al+ AB; yl: 
(Al — Ad — Bl + BS) X ¢45; 
(Al — Ad + Bl — Bd) X 45; 
A2+ A6; y2:= 
B6 — B2; y6 := A2 — Ab; 
A3 + A7; y3 := B3 + B7; 
(A7—A3—B3+B7) X c45; 
(A38— A7—B3+B7) X c45; 
x0 + x2 — x1 — 238; Bl 
x0 — 22-—yl+y3; B2 
x0 — 22+ yl — y8; B83 
wv4+ 26+ 45+ 27; B44: 
v4 + 26 — 25 — 27; BS 
w4—26—-y5+y7; BE 
w4 — 26+ y5 — y7; B7 
20 + 22 + al +23; BlkO] := 


Crit i oo 
& 
Po 
= 


iS 


if sl = 0 then 


begin 


A{kl] : 
A[k2] : 
 A[k8] : 
A{k4] : 
A[k5] : 
A{k6] : 
A[k7] : 


end 
else 
begin 


A[k1] : 
Bik1] : 
A[k2] : 
B{k2] : 
A[k3] : 
Blk3] : 


A[kA] 
Blk4): 


Alkd] : 


Bik5] : 


A[k6] : 


Bik6] : 

A{k7] 

BUk7] =: 
end; 


Hiei ier iii iis 


i i it i i tt ii 


Al; B[k1] : 
A2; Bik2]: 
A3; Blk3]: 
A4; BikA]: 
A5; B[k5] : 
A6; B[k6] : 
A7; B[k7]: 


ied td a a 
& 
cs 


c4 X Al — 84 BI; 
s4 X Al + c4 xX BI; 
c2 X A2— s2X B2; 
s2 X A2 + c2 X B2; 
c6 X A3 — 86 X BB; 
s6 X A3 + c6 X BB; 
cl X A4— 81 X BA; 
sl X A44 cl X B4; 
ch X A5 — 85 X BS; 
s5 X A5 + c5 X Bb; 
c3 X A6 — 83 X BG; 
s3 X A6 + c3 X B6; 


c7 X AT — si X B7; 
s7 X AT + c7 X BT 


rad := rad + rad; 
sin(rad); rad := 4X rad; 


k3 +- span; kd := k4 + span; k6 := k5 +. span; 
k6 + span; AQ := A[kO]; 


BO := B{k0); 


Bi + Bd; 


B2 + B6; 


yO + y2 — yl — 98; 
yO — y2+ 21 — 23; 
yO — y2 — x1 + 23; 
y4 + y6 + y5 + 97; 
y4 + y6 — y5 — 97; 
y4 — y6 + 25 — 27; 
y4 — y6 — £5 + 27; 


yO + y2 + yl + 98; 
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kO := k7 + span; if k0 < n then go to La; 
k0 := kO—n; if 0 + k then go to La; 
comment Increment sine and cosine values; 
if kO + span then 
begin 
a := cl — (deXcl+dsXsl); 
:= (dsXcl—dceXsl) + sl; 
comment The following three statements compensate 
for truncation error. If rounded arithmetic is used, 
substitute cl. := c2; 


cl := 1.5-0.5 X (c2T2+s1 7 2); 
sl := cl X sl; cl := cl X c2; 
c2 := cl f2— slt2; s2:= 2.0 X cl X sl; 
c3 := c2 X cl—s2 X sl; 838 := c2 X sl + s2 X cl; 
ch := c2T2 — s2T2; 84 := 2.0 K c2 X 82; 
cé5 := cl X c4 — 81 K 84; 85 := sl X c4 + cl X 84; 
c6 := c8T2 — s8T2; 36 := 2.0 X c3 X 383; 
c7 := cl X c6 — sl X 86; s7 := sl X c6 +l X 86; 
=k-+ks; goto La 
end; 
kB:=m 
end; 


comment If mis not a multiple of 3, then complete the trans- 
form with radix 2 steps; 
for k3 := k3 — 1 while k3 2 0 do 
begin 
kO := 0; span := span + 2; 
Lb: k2 := kO0 + span; 
A2 := Afk2]; B2 := B[k2); 
A[k2] := A[k0] — A2; B[k2] := B[k0] — 
A[k0] := A[k0] + A2; B[kO] := B[kO] + ae 
kO := k2 + span; if k0O < n then go to Lb; 
kO := k0—n; if k0 < ks then go to Lb; 
if ks = span then go to Ld; 
Le: k2 := kO + span; 
A2 := A[k0] — A[k2]; B2 := B{kO] — B[k2]; 
A[k0] := A[k0] + A[k2]; B[kO] := B[kO] + B[k2]; 
A[k2] := —B2; Bik2] := A2; 
kO := k2 + span; if k0 < 1 then go to Lc; 
kO := k0 — n; if kO < span then go to Lc; 
Ld: end 
end FFT8; 
procedure REVFFT8(A, B, n,m, ks); value n, m, ks; 
integer n,m,ks; array A, B; 
comment This procedure computes the fast Fourier transform 
for one variable of dimension 2” in a multivariate transform. 
n is the number of data points, i.e., m = mi X me X +++ K Np 
for a p-variate transform, and ks = ney X eye KX °° K Mp, 
where nz = 2” is the dimension of the current variable. Arrays 
A[0:n—1] and B[0:n—1] originally contain the real and imagi- 
nary components of the data with the indices of each variable 
in reverse binary order, e.g. ajx is in A[j’XneXnstk’Xngtl’] 
for j= 0, 1,---, m —1, k=0, 1,-:+, m—1, and 1 = 
0,1, +++ ,n3 — 1, where j’, k’, and l’ are the bit-reversed values 
of j, k, and 1. On completion of the multivariate transform, the 
real and imaginary components of the resulting Fourier coeffi- 
cients are in A and B in normal order. If n = 2" and ks = 1, 
a single-variate transform is computed; 
begin integer k0, k1, k2, k3, k4, k5, k6, k7, k, span; 
real AO, Al, A2, A3, A4, A5, A6, A7, BO, B1, B2, B3, B4, B5, 
B6, B7, x0, x1, 22, 23, x4, 25, x6, x7, yO, yl, y2, y3, y4, y5, y6, y7, 
cl, c2, c3, c4, c5, c6, c7, sl, s2, 83, s4, 85, 86, 87, c45, dc, ds, rad; 
rad := 4.0 X arctan(1.0); n:=n—1; 
c45 := sgri(0.5); span := ks; 
comment Compute radix 2 steps if m is not a multiple of 3; 
k3 := (m+8) X 3; 
for k3 := k3 + 1 while k38 s mdo 
begin 
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kO := 0; 
La: k2 := kO0 + span; 


A2 := A[k2]; B2 := B[k2); 
A[k2] := A[k0] — A2; B[k2] := B[k0] — 
A[kO] := A[kO] + A2; B[KO] := B[kO] + B2; 


kO := k2 + span; if k0 <n then go to La; 
k0 := k0— 1; if KO < ks then go to La; 
if ks = span then go to Lc; 
Lb: k2 := kO0 + span; 
A2 := A[k2]; B2 := Blk2]; 
A[k2] := A[kKO0] + B2; Blk2] := B[k0] — A2; 
A[k0] := A[k0] ~ B2; B[k0] := B[kO] + A2; 
kO := k2 + span; if kO < n then go to Lb; 
k0 := k0O— n; if kO0 < span then go to Lb; 
Le: span := span + span; rad := 0.5 X rad 
end; 
comment Radix 8 transform; 
for m := m — 3 while m = 0 do 
begin 
cl := 1.0; sl :=0; k0:=0; 
rad := 0.125 & rad; de := 2.0 
ds := sin (rad--rad) : 
Ld: kl := kO0 + span; k2:= ki + span; kB := k2 + span; 
k4 := k3 + span; ki := k4 + span; k6 v= kb + span; 


k := ks; 
0 X sin(rad) f 2; 


K7 := k6 + span; AO := A[k0]; BO := B{k0]; 
if sl = 0 then 
begin 

Al := Afki]; Bl := Bik1]; 

A2 := A[k2]; B2 := Blk2]; 

A3 := A[k3]; B3 := B[k3]; 

A4 := A[k4]; B4 := Bk4]; 

Ad := A[k5]; BS := Bik5); 

A6 := A[k6]; B6 := Bik6); 

A7 := A[k7]; | B7 := B[k7] 
end 
else 
begin 

Al := A[kl] X c4 — B[k1] X 84; 

Bl := A[kl] X s4 + B[k1] X 4; 

A2 := A[k2] X c2 — Blk2] x 52; 

B2 := A[k2] X s2 + Blk2] X c2; 

A3 := A[k3] X c6 — B[k3] x s6; 

B3 := A[k3] X s6 + B[k38] X c6; 

A4 := A[k4] X cl — Blk4] x s1; 

B4 := Al[k4] X sl + Blk4] X cl; 

A5 := A[k5] X c5 — Blk5] x 85; 

BS := Alk5] X 85 + Blk5] X cd; 

A6 := A[k6] X c3 ~— B[k6] X 53; 

B6 := A[k6] X s3 + Blk6] X c3 

AZT := A[k7] X c7 — Bk7] X 87; 

B7 := A[k7] & s7 + B[k7] XK c7 
end; 
20 := AO + Al + A2+ A3; yO := BO+ Bl + B2+ B3; 
zl := AQ — Al — B2+ B38; yl := BO — Bl + A2 — A3; 
22 := AQ + Al — A2— AB; y2 := BO + Bl — B2 — B3; 
3 := AQ — Al + B2 — B38; y3 := BO — Bl — A2+ A3; 
zm4:= A4+ A5+ A6+ A7; y4 i= B4 + B5 + B6 + B7; 
z5 := (A4—A5—B6+B7) X c45; 
yS := (B4—B5+A6—A7) X c45; 
x6 := A4+ A5 — AG — A7; ¥6 := B4+ B5 — B6 — B7; 
x7 := (A4—A5+B6—B7) x o45; 
y7 := (B4—B5—A6+A7) X c45; 


A[kO] := 20 + 24; B[kO] := yO + y4; 
Af[k1] := 21 +245 — y5; Blkl] := yl + 25 + ¥5; 
A[k2] := 22 -- y6; Blk2] := y2 + 26; 
A[k3] := 23 - 27 — y7; Blk3] := y38 + 27 — 97; 
A[k4] := 20 —- 24; B[kA] := yO — y4; 
A[k5] := al — 25 + y5; Blk5] := yl — 25 — 95; 
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A[k6] := 22 + y6; B[k6] := y2 — x6; 
A[k7] := 238 +27 + y7; B[kK7] := y38 ~ 27 + 97; 
kO := k7 + span; if k0 < n then go to Ld; 
k0 :=k0O—n; if 0 < k then go to Ld; 
comment Increment the sine and cosine values; 
if k0 ~ span then 
begin 
eo. := cl — (deXcl+dsXs1); 
= (dsXcl—dceXs1) + s1; 
comment The following ‘three statements compensate 
for truncation error. If rounded arithmetic is used, 
substitute cl := c2; 


cl := 1.5-0.5 X (c2 fT 2+s1 f 2); 
sl := cl X 81; cl := cl X c2; 
c2 := clT2— slf2; 82 := 2.0 X cl X s1; 
c3 := cl X c2 — sl X 82; 88 := sl X c2+ cl X 82; 
c4 := c2T2 — s2T2; 84 := 2.0 X c2 X 82; 
c5 := cl X c4 — sl XK 84; 85 := sl KX c4 + cl X 84; 
c6 := c8T2— 8872; s6 := 2.0 X c3 X 83; 
c7 := cl X c6 — sl X 86; 87 := sl X c6 + cl X 86; 
k:=k+ks; goto Ld 
end; 
span := 8 X span 
end 
end REVFFTS8 


REMARK ON ALGORITHM 345 [C6] 

AN ALGOL CONVOLUTION PROCEDURE BASED 

ON THE FAST FOURIER TRANSFORM [Richard C. 
Singleton, Comm. ACM 12 (Mar. 1969), 179] 

Ricuarp C. SINGLETON (Recd. 15 May 1969) 

Stanford Research Institute, Menlo Park, CA 94025 


KEY WORDS AND PHRASES: fast Fourier transform, com- 
plex Fourier transform, multivariate Fourier transform, Fourier 
series, harmonic analysis, spectral analysis, orthogonal poly- 
nomials, orthogonal transformation, convolution, autocovari- 
ance, autocorrelation, cross-correlation, digital filtering, per- 
mutation 

CR CATEGORIES: 3.15, 3.83, 5.12, 5.14 


On page 180, column 2, the 8rd and 2nd lines from the end of 
procedure CONVOLUTION must be interchanged, i.e. the final 
four lines should read: 

begin C[n—j] := scale X (C[j] — D[j]); 
Clj] := scale X (Clj] + Dly)) 
end 

end CONVOLUTION; 

The procedures included in Algorithm 345 were punched from 
the printed page and tested on the CDC 6400 ALGoL compiler. 
After making the one correction the test results agreed with those 
obtained earlier with this compiler. 


COLLECTED ALGORITHMS FROM CACM 
346-P 1- 0 


ALGORITHM 346 ee oe as 
F-TEST PROBABILITIES [S14] glacier: 
JoHN Morris (Recd. 10 Apr. 1968, 12 Sept. 1968, and vp := vp — 2.0; 

6 Nov. 1968) ft := va X op/fl X (L.0+ft) 
Computer Institute for Social Science Research, Michigan end; 

State University, East Lansing, MI 48823 “ v= oT (0.5Xf2) x (L.0+f1) 

en 

KEY WORDS AND PHRASES: F-test, Snedecor F-statistic, else if 2 X (df2 + 2) = df2 A df2 < maxn then 

Fisher test, distribution function begin 
CR CATEGORIES: 5.5 for f2 := f2 — 2.0 step — 2.0 until 1.0 do 
procedure Fiest (f, dfl, df2, maxn, prob, gauss, error); begin 

value f, df1, df2, maxn; real f, prob; integer df1, df2, mazn: vp := up — 2.0; 

real procedure gauss; label error; ft := 2X vp/f2 * (1.0+ft) 
comment This procedure gives the probability that F will be end; 

greater than the value of f where ft := 1.0 — (1.0—z) T (0.5Xf1) * (1.0+ft) 

end 
f = a1*/o2?, else if dfl + df2 < maxn then 

«1? is the variance of the sample with size Ni, o2*is the variance begin ; 

of the sample with size Nz, dfl = Ni — 1, df2 = N2 — 1, real theta, sth, cth, sts, cts, a, b, xt, gamma; 

and F is the Snedecor-Fisher statistic as defined and tabled by theta := arctan (sqrt(flXf/f2)); 

Snedecor [4]. sth := sin(theta); cth := cos(theta); 

The present algorithm computes a value which is directly sis := sth} 2; cis: = cthT 2; 
related to that of Algorithm 322, such that prob = 1 — Fisher. irs b := 0.0; 

A number of test runs on various computers suggest that Flest if df2 > 1 then 
may be considerably faster than Fisher. begin ; 

An approximation is included to limit execution time when for f2 := f2 — 2.0 step — 2.0 until 2.0 do 
sample size is large. It should be used when register overflow a := cls X (f2—1.0)/f2 X (.0+a); 
would otherwise result, and the appropriate value for mazn a := sth X cth X (1.0-+a) 
will therefore depend upon the specific implementation. When end; 
marn = 500 the approximation appears to give three-digit igh theta + a; 
accuracy. The real procedure gauss computes the area under if af 1 > 1 then 
the left-hand portion of the normal curve. Algorithm 209 [3] begin : 
may be used for this purpose. If f < 0 or if dfl < 1 or if df2 <1 for fl := fl — 2.0 step — 2.0 until 2.0 do 
then exit to the label error occurs. begin 

National Bureau of Standards formulas 26.6.4, 26.6.5, and vp := vp — 2.0; 

26.6.8 are used for computation of the statistic, and 26.6.15 is b := sts X vp/fl X (1.0+b) 
used for the approximation [2]. end; 

Thanks to Mary E. Rafter for extensive testing of this proce- gamma := 1.0; f2:= 0.5 X df2; 
se iapheries referee for a number of suggestions. for ci := 1.0 step 1.0 until f2 do 

seit = at 1—0.5); 
1. Dorrer, Econ. Algorithm 322, F-Distribution. Comm. sseleballae ak gonna (240-8) b 
ACM 11 (Feb. 1968), 116-117 b := gamma X sth X cth T df2 X (1.0-+6) 
: : ¢ end; 
2. Handbook of Mathematical Functions. National Bureau of ft ia 1.0 + 0.636619772368 X (b—a); 
sage aay arn Ser. .Vol., 65, Washington, comment 0.6366197723675813430755351 +++ = 2,0/r; 
° °) ? . . d 
3. Isperson, D. Algorithm 209, Gauss. Comm. ACM 6 eae 
(Oct. 1963), 616. begit 
4. SNnepEcoR, GeorGE W. Siatistical Methods. Iowa State U. real cbrf; 
ae Press, Ames, Iowa, 1956, pp. 244-250; fl := 2.0/9.0 X fl); f2 := 2.0/(9.0Xf2); 
4 cbrf := f T 0.333333333333; 
if a 3 x df2 Sy se ; 0.0 then go to error; ft _ gauss(— ((1.0—f2) X cbrf-+f1—1.0)/ 
ae 0.0 then prod := 1. sqrt({2Xcbrf t 2+f1)) 
: end; 
sss eee prob := if ft < 0.0 then 0.0 else fi 
rea »J2, Ly Jl, vp; end 
fl := dfl; f2:= df2; ft := 0.0; énd Fisst 


a = f2/(f2+flxf); vp := fl + f2 — 2.0; 
if 2X (dfl+2) = dfl A dfl S mazn then 
begin 
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AN EFFICIENT ALGORITHM FOR SORTING WITH 
MINIMAL STORAGE [M1] 

Ricuarp C. SinaLeton* (Recd. 17 Sept. 1968) 

Mathematical Statistics and Operations Research De- 
partment, Stanford Research Institute, Menlo Park, 
CA 94025 


* This work was supported by Stanford Research Institute with 
Research and Development funds. 


KEY WORDS AND PHRASES: 
ing, digital computer sorting 
CR CATEGORIES: 5.31 


sorting, minimal storage sort- 


procedure SORT(A, i, j); 
value i,j; integer 7, 3; 
array A; 

comment This procedure sorts the elements of array A into 
ascending order, so that 


Afk] SAlkt+1], kmtiti,---,j—1. 


The method used is similar to QUICKERSORT by R. S. Scowen 
[5], which in turn is similar to an algorithm given by Hibbard 
(2, 3] and to Hoare’s QUICKSORT [4]. QUICKERSORT is used 
as a standard, as it was shown in a recent comparison to be the 
fastest among four ACM algorithms tested [1]. On the Bur- 
roughs B5500 computer, the present algorithm is about 25 
percent faster than QUICKERSORT when tested on ran- 
dom uniform numbers (see Table I) and about 40 percent 
faster on numbers in natural order (1, 2, --- , m), in reverse 
order (n, mn—1,-:-, 1), and_ sorted by halves 
(2,4, «+: ,n,1,3,--+ ,n—1). QUICKERSORT is slow in sorting 
data with numerous ‘‘tied” observations, a problem that can be 
corrected by changing the code to exchange elements a[k] > t 
in the lower segment with elements a[q] < ¢ in the upper seg- 
ment. This change gives a better split of the original segment, 
which more than compensates for the additional interchanges. 
In the earlier algorithms, an element with value ¢ was selected 
from the array. Then the array was split into a lower segment 
with all values less than or equal to ¢ and an upper segment with 
all values greater than or equal to ¢, separated by a third seg- 
ment of length one and value ¢. The method was then applied 
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TABLE I. Sorting Times 1n Seconps ror SORT anp 


QUICKERSORT, on THE Burroveus B5500 
CoMPUTER—AVERAGE OF Five TRIALS 


Algorithm 
Original order and number of items SORT QUICKERSORT 
Random uniform: 
500 0.48 0.63 
1000 1.02 1.40 
Natural order: 
500 0.29 0.48 
1000 0.62 1.00 
Reverse order: 
500 0.30 0.51 
1000 0.63 1.08 
Sorted by halves: 
500 0.73 1.15 
1000 1.72 2.89 
Constant value: 
500 0.43 10.60 
1000 0.97 41.65 


recursively to the lower and upper segments, continuing until 
all segments were of length one and the data were sorted. The 
present method differs slightly—the middle segment is usually 
missing—since the comparison element with value ¢ is not re- 
moved from the array while splitting. A more important differ- 
ence is that the median of the values of A[z], A[(¢+j)+2], and 
A[j] is used for ¢t, yielding a better estimate of the median value 
for the segment than the single element used in the earlier 
algorithms. Then while searching for a pair of elements to 
exchange, the previously sorted data (initially, A[¢]<t<A[j]) 
are used to bound the search, and the index values are compared 
only when an exchange is about to be made. This leads to a small 
amount of overshoot in the search, adding to the fixed cost of 
splitting a segment but lowering the variable cost. The longest 
segment remaining after splitting a segment o n has length 
less than or equal to n — 2, rather than n — 1 as in 
QUICKERSORT. 

For efficiency, the upper and lower segments after splitting 
should be of nearly equal length. Thus ¢ should be close to the 
median of the data in the segment to be split. For good statis- 
tical properties, the median estimate should be based on an odd 
number of observations. Three gives an improvement over one 
and the extra effort involved in using five or more observations 
may be worthwhile on long segments, particularly in the early 
stages of a sort. 

Hibbard [3] suggests using an alternative method, such as 
Shell’s [6], to complete the sort on short sequences. An experi- 
mental investigation of this idea using the splitting algorithm 
adopted here showed no improvement in going beyond the final 
stage of Shell’s algorithm, i.e. the familiar “‘sinking’’ method of 
sorting by interchange of adjacent pairs. The minimum time 
was obtained by sorting sequences of 11 or fewer items by this 
method. Again the number of comparisons is reduced by using 
the data themselves to bound the downward search. This 
requires 


Ali-1] < Al], tk <j. 
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foleleke! 


Thus the initial segment cannot be sorted in this way. The 
initial segment is treated as a special case and sorted by the 
splitting algorithm. Because of this feature, the present al- 
gorithm lacks the pure recursive structure of the earlier al- 
gorithms. 

For n elements to be sorted, where 2* < n < 241, a maximum 
of k elements each are needed in arrays IL and IU. On the B5500 
computer, single-dimensional arrays have a maximum length 
of 1023. Thus the array bounds [0:8] suffice. 


This algorithm was developed as a ForTRAN subroutine, then 
translated to Aucou. The original ForTRAN version follows: 


SUBROUTINE SORT(AsTT JJ) 

SORTS ARRAY A INTO TNCREASING ORDER, FROM AC IT) TN AC JJ) 

ORDERING 1S BY INTEGER SUBTRACTION, THUS FLOATING POINT 

NUMBERS MUST BE IN NORMALIZED FORM. 

ARRAYS TUCK) AND TL(K) PERMIT SORTING UP TO 2**(K#1L)-1 ELEMENTS 
DIMENSTON A(1),IUC(16),IL (16) 
INTEGER AyTyTT 
M=1 
I=1T 
J=JJ 

5 IF(I .GEe- J) GO TO 70 

10 K=T 
TJ=(Jt+T)/2 
THAT J) 
IFCACT) .LE. T) 
ACI J)=ACT) 
ACT)=T 
T=A(TJ) 

20 L=J 
IF(A(J) «GE. T) 
AVIJI=A0S) 
A(J)=T 
T=A(TJ) 
IFCACT) .LE. T) 
ACT JV=A(T) 
ACTI=T 
T=ACT JI) 
GO TO 40 

320 A{(L)=A(K) 
A(K)=TT 

40 L=L-1 
IF(ACL) «GT. T) GO TN 40 
TT=A(L) 

50 K=K+} 
IFCA(K) «LT. TY GO TO 50 
IF(K «LE. £) GO TO 30 
IF(L-I .LE. J-K) GO TON 60 


GO TO 20 


GO TO 40 


GO TO 40 


IL(M)=1 
TUCM)=L 
t=k 
M=M+1 

GO TO 80 

60 IL(M)=K 
TUCM)=J 
Jet 
M=M+] 

GO TN 80 

70 M=M-1 
TF(M .EQ. 0) RETURN 
T=TLOM) 

J=TUCM) 

80 IF(J-I .GE. 11) GO TO 190 
IF(I eEQ, II) GO Ta 5 
1=I-1 

90 [=T+l 
IF(I «EQ. J) GO TO 70 
T=ACT#L) 

TF(A(I) .LE. T) GO TO 90 
K=1 
100 A(K41}=AiK) 
K=K~-1 
IFUT -LT. AC(K)) GO TO 100 
A(K41)=T 
GO To 90 
END 


This ForTrRaAN subroutine was tested on a CDC 6400 computer. 
For random uniform numbers, sorting times divided by n loge n 
were nearly constant at 20.2 X 10-6 for 100 < n < 10,000, with 
a time of 0.202 seconds for 1000 items. This subroutine was also 
hand-compiled for the same computer to produce a more efficient 
machine code. In this version the constant of proportionality 
was 5.2 X 10°*, with a time of 0.052 seconds for 1000 items. In 
both cases, integer comparisons were used to order normalized 
floating-point numbe-s. 

REFERENCES: 


1. Buarr, CHarues R. Certification of algorithm 271. Comm. 


ACM 9 (May 1966), 354. 


Ld: Alk+1] := Alk); 
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3. Hipparp, Tuomas N. An empirical study of minimal storage 


sorting. Comm. ACM 6 (May 1963), 206-213. 


4, Hoars, C. A. R. Algorithms 63, Partition, and 64, Quicksort. 


Comm. ACM 4 (July 1961), 321. 
5. Scowren, R. 8. Algorithm 271, Quickersort. Comm. ACM 8 
(Nov. 1965), 669. 


6. SHELL, D. L. A high speed sorting procedure. Comm. ACM 2 


(July 1959), 30-32; 


begin 
real {, tt; 
integer 7, 77, k, L, m; 
integer array IL, IU(0:8]; 
m:=0; w%:=%; goto LA; 
Tl: a7 := (49) + 2; t:= Alij]; Rise it; Li= j; 
if A(t] > ¢ then 
begin A[ij] := Al[z]; 
if Alj] < ¢ then 
begin 
Alij] := Alj]; Alj] <= t; 
if A{z] > ¢ then 
begin A[zj] := A[z]; 
end; 
£2: L:= L—1; 
if A[L] > t then go to L2; 
tt := A[L]; 
D3: k:=k+1; 
if A[k] < ¢ then go to L3; 
ifk < L then 


Alt] := ¢t; ¢:= Alij7] end; 


t:= Altg]; 


Aft] := t; t:= Al[tj] end 


begin A[L] := A[k]; A[k] := it; go to L2 end; 
ifL—i1>j—kthen 
begin [L[m] := 17; IU[m] := L; 1 := k end 


else 
begin [L[m] := k; 
m:i=m+i1; 
IA: if j — 7 > 10 then go to Ll; 
if 7 = 7 then 
begin if 7 < 7 then go to L1 end; 
for 7 :=7+1 step 1 untilj do _ 
begin 
= Afi]; k:=i—-1; 
if A[k].> ¢ then 
begin 


IU([m] := 9; gj := Lend; 


k:=k-—1; 
if Alk] > ¢ then go to L5; 
A[k+1] := t 
end 

end; 

m:=m—1; ifm > O0then 
begin 7 := IL{m]; 7 := 1U[m]; 

end SORT 


go to [A end 
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The algorithm was tested on the CDC 6400 AtcGo.t compiler 
(version 1.1, running under the SCOPE operating system, version 
3.1.4). One trial was made using an array of 5000 pseudorandom 
numbers; the results were correct. 

The central processor time was about 6.9 seconds corresponding 
to a value for K (defined below) of about 110 microseconds. 

It would be more in the spirit of ALaot to follow QUICKER- 
SORT [1] and give arrays JL and JU dynamic bounds. This in- 
volves changing line 4 on page 187 from 

integer array JL, I[U(0:8]; 
to 

integer array /L, IU(0:In(j—7+1)/In(2)—0.9]; 

The Fortran subroutine given in the comments to the.algo- 
rithm was tested on a CDC Fortran compiler (the RUN compiler 
version 2.3, running under the SCOPE operating system, version 
3.1.4). Tests were made with each of the five initial orderings 
described with the algorithm for a variety of array lengths from 
500 to 40,000. For integer arrays, the results were correct; but 
when the actual argument corresponding to the dummy argument 
A was areal array containing large positive and negative numbers, 
errors occurred. This does not invalidate the subroutine, but the 
comments should be changed to 


C SORTS INTEGER ARRAY A INTO INCREASING OR- 
DER, FROM A(II) TO A(IJ) 

ARRAYS IU(K) AND IL(K) PERMIT SORTING UP TO 
2**(K+1) — 1 ELEMENTS 

THE USER SHOULD CONSIDER THE POSSIBILITY OF 
INTEGER OVERFLOW 

THE ONLY ARITHMETIC OPERATION ON THE ARRAY 
ELEMENTS IS SUBTRACTION 


This gives enough information (and a hint) but leaves the re- 
sponsibility for any abuse of American National Standards In- 
stitute (formerly USASI) Fortran where it belongs—with the 
user. 

The subroutine was also tested on the IBM 7040 FortRAN com- 
piler (the IBFTC compiler running under the IBSYS operating 
system, version 9 level 10). The results were correct. The statement 


INTEGER A, T, TT 


OQ Q 4 


was removed and the amended subroutine tested using similar, 
but real, arrays. The results were again correct; running times 
increased by up to 5 percent on the CDC 6400 and were unchanged 
on the IBM 7040. 

Tables I and II summarize the information on running times 
in terms of K, where 


time = Kn log. n 


(runs of other lengths are omitted for brevity). 
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TABLE I. Sortine Times 
K in microseconds where time = Kn log, n 


Test Method 
ss Burroughs CDC 6400 CDC 6400 FOR- 
Original one an number | re 3 FORTRAN TRAN, INTE. SOR rRaN 
Random uniform 
500 107 21.2 20.5 
1000 102 21.7 20.5 
5000 21.1 20.2 269 
10000 21.1 20.1 263 
40000 21.2 20.1 
Natural order 
500 65 12.9 12.5 
1000 62 13.1 12.4 146 
5000 12.6 11.9 148 
10000 12.7 12.0 
40000 12.9 12.1 
Reverse order 
500 67 14.3 13.4 
1000 63 13.9 13.4 
5000 13.4 12.7 158 
10000 13.4 12.7 158 
40000 13.5 12.8 
Sorted by halves 
500 163 34.8 32.6 
1000 173 37.1 35.1 
5000 39.5 37.2 465 
10000 41.8 39.3 491 
40000 46.6 44.1 
Constant value 
500 96 19.2 18.5 
1000 97 19.4 18.7 
5000 19.4 18.7 237 
10000 19.9 19.0 241 
40000 20.2 19.5 
* Calculated from Singleton’s results 
TABLE II. Vauvuess or n log, n 
n 500 1000 5000 10000 40000 
10 n log. n 0.00448 0.00996 0.0614 0.1329 0.6115 


For use as a library routine one slight change is recommended: 
JJ—II should be tested on entry and a suitable error message pro- 
duced if negative. It would be possible to transfer ‘‘work”’ arrays 
to replace IU and IL thus allowing the user more control of storage 
allocation, but the additional instructions needed to handle the 
extra arguments reduce the saving and this is hardly worthwhile. 

The authors would like to thank the referee for his helpful 
comments. 

REFERENCE: 
1. Scowen, R. S. Algorithm 271, Quickersort. Comm. ACM 8 
(Nov. 1965), 669-670. 
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If the values of ij, instead of always being (¢+7) + 2, are at 
varying positions between 7 and 7, then there is less likelihood of 
peculiar initial structure causing failure of the algorithm to per- 
form rapidly. The position of 77 can be made to vary by replacing 
the statements 


m:=0; i:=%1; goto Z4; Ll: ij := G47) + 2; 


by 
real r; r:= 0.875; m:=0; w:=4%; goto LA; 
Di: r := if r > 0.58984375 then r — 0.21875 else r + 0.0390625; 
j:= t+ (Gj-t) X74; 


comment These four decimal constants, which are respectively 
48/128, 75.5/128, 28/128, and 5/128, are rather arbitrary. On 
most compilers their binary representations will be exact, and 
the use of them in the statement L1 causes r to vary cyclically 
over the 33 values 48/128 --- 80/128. Therefore 77 takes a varia- 
ble position somewhere within the middle quarter of the segment 
to be sorted. Wider variation of 77 would be undesirable in the 
special case of a partially presorted array; 


In sorting an array of N elements which are initially in random 
order this will waste (on ICL Atlas) less than N/10° seconds, but 
if the array is, for example, composed initially of two equal pre- 
sorted halves, then the use of the original rather than the modi- 
fied version would more than double the sorting time required if 
N > 104. 

As the author points out, the published version could fail if 
used to sort arrays of 1024 or more elements because the upper 
bounds of IU and IL might be inadequate. For a standard pro- 
cedure the declaration JL, IU [0:8] should be replaced by the 
declaration IL, IU [0:20]. This permits the sorting of arrays of up 
to 4 million elements, which is, with present core store sizes, suffi- 
cient. 

The statement tt := a[L] which precedes L3: will be executed less 
frequently if it is transferred into the next conditional statement, 
which then reads 

if k < L then begin tt := afZ]; a[Z] := afk]; alk] := ut; 

go to L2 end 
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procedure scale (a, m,n, g, u, v); 
value m,n,g; integer m,n; real g; 
real array a; integer array 4, v; 

comment The use of scaling to precondition matrices so as to 
improve subsequent computational characteristics is of con- 
siderable importance. To measure the scaling condition of a 
matrix,ai; ((=1, --- , mand j=1, --- ,n), Fulkerson and Wolfe 
{1] suggested the ratio of the matrix entrv of largest absolute 
value to that of the smallest nonzero absolute value. This 
procedure implements the method of [1], i.e. finding multipli- 
cative row factors, r;, and column factors, s;, which, when ap- 
plied, minimize the above condition number. The minimization 
problem can be expressed as an equivalent additive discrete 
problem by taking logarithms and defining: 


ie 
rmr=g", 


sy = 9°5, bes = log, (abs(ai;)) 
and taking c;; to be the least integer greater than or equal to 
b;;. Thus the formulation becomes: minimize an integer w 
subject to the constraints 0 < «: + vj; + c:3 < w where u; and 
v; are unrestricted and integral in value. The effect of decreasing 
the value of the base g would be to more accurately approximate 
the continuous scaling problem by the discrete form. 
REFERENCE: 
1. Futxerson, D. R., anp Wotre, P. An algorithm for scaling 
matrices. SIAM Rev. 4 (1962), 142-146; 
begin 
integer array c[l:m, 1:n], ri[l:m], si[l:n]; 
real val: 
integer maz, store, markr, markc, num, nopt, 1, j; 
nopt := 0; 
comment Create initial integer matrix c. Due to machine 
round-off errors, it may be desirable for some problems to 
insert a tolerance when checking for zero values of the input 
matrix and for matrix entries which are exact integral powers 
of the base g; 


for 7 := 1 step 1 until m do 
for j := 1 step 1 until 7 do 
begin 
if (alt, j]=0) then 
begin 
elt, J] «= 0; 
go to tnif 
end; 
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val := In(abs(alt, j]))/In(g); 
c[z, j] := entter(val) + 1; 
if ((c[¢, j]-1) =val) then c[z, j] := elt, 7] — 1; 
intf: 
end; 
comment Select initial values of u; and v; that satisfy con- 
straints of discrete formulation; 
for 7 := 1 step 1 until m do 
begin | 
ult] := cz, 1]; 
for j := 2 step 1 until n do 
if (c[t, j]<u[t]) then ul[z] := clz, 7]; 


ult] := —ul[t] 
end; 
for j := 1 step 1 until n do 
begin 


oj] := ef1, 9] + ull]; 
for 7 := 2 step 1 until m do 
begin 
store := cli, j] + ult]; 
if (store<v[j]) then v[j] := store; 
end; 
vfj] := —of3]; 
end; 
comment Step one. Initialize row and column markers with 
unmarked rows and columns denoted by a 1 in ri[z] and s7[j], 
respectively. Locate and mark maximum entry of current 
working array; 


remax: max := 0; 
for 7 := 1 step 1 until m do 
begin 
rift] := 1; 
for j := 1 step 1 until 7 do 
begin 


if (¢ = 1) then si[j] := 
if (nopt=0) then c[z, J] : 
if (c{t, j]>maz) then 
begin 

markr := 1; 

marke := 7; 

maz := ct, 7] 


1; 
= ult] + olj] + eft, J]; 


end 
end 
end; 
nopt := 1; 
ri{markr] := —1; 
comment Repeat steps two and three in succession until 


there are either no freshly marked rows or no freshly marked 
columns. Any row or column marked in the immediately pre- 
ceding application of step one, two, or three is called freshly 
marked and denoted by —1 in the appropriate indicator 
vector. Previously marked rows and columns that are not 
freshly marked are denoted by zero values; 

comment Step two; 

rmarks: num := 0; 

for 7 := 1 step 1 until m do 

begin 
if (ri[i]>—1) then go to rmarkf; 
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ri[t] := 0; 
num := num+ 1; 
for j := 1 step 1 until n do 
if (s¢[j]=1) A (cli, j]=0) then si[j] := —1; 
rmarkf : 
end; 


if (num=0) then go to change; 
comment Step three; 


num := 0; 
for j := 1 step 1 until n do 
begin 
if (si[j]>—1) then go to cmarkf; 
si[j] := 0; 


num := num -+ 1; 
for 7 := 1 step 1 until m do 
if (ri[ftq]J=1) A 
((clz, jJ=max) V (elt, j]= (maxz—1))) then 
rift] := —1; 
cmarkf : 
end; 
if (nwm 0) then go to rmarks; 
comment Step four. Modify integer scaling factors u and v 
and adjust current working matrix (¢;;+ui+v;); 
change: if (si[markc]<1) then go to finis; 


for 71 := 1 step 1 until m do 
if (ri[7]<1) then 
begin 


ult] := ule] — 1; 
for j := 1 step 1 until 1 do 
elt, j] := elt, j] — 1 
end; 
for j := 1 step 1 until 2 do 
if (st{j]<1) then 


begin 
v[j] -= ofj] + 1; 
for 7 := 1 step 1 until m do 
elt, J] = elt, J] +1 
end; 
go to remax; 
finis: 


end 
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KEY WORDS AND PHRASES: polygamma function, psi 
function, digamma function, trigamma function, tetragamma 
function, pentagamma function, special functions 

CR CATEGORIES: 5.12 


procedure polygamma (n, 2, nd, polygam, error); 
value n, z,nd; real z, polygam; integer n, nd; label error; 
comment This procedure assigns to polygam the value of the 
polygamma function of order n for any real argument z. For 
n = 0, we have the psi or digamma function, for n = 1 the tri- 
gamma function, for n = 2 the tetragamma function, and so on. 
For arguments that are poles of the function (nonpositive 
integer values), an exit is made through the label error. The 
parameter nd gives the requested relative precision expressed 
in number of decimal digits. 
It computes the polygamma function through the asymptotic 
series 


=D! al SS , @k+n— pI 
)(z) w (—1)71 | ——— 4+ — (ony) akin 
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except for n = 0, when the first term is —In (z). 
If the simple empirical relationship 


22>n-+nd 


is true, as well as z > n, one enters directly into the asymptotic 
series with the original argument. Otherwise, the computation 
of small arguments is reduced to that of sufficiently large argu- 
ments, applying repeatedly the recurrence relation: 


YO(e+1) = Y(2) + (—1)"nker 


To save computation time, the argument, once larger than n, 
is increased just to the point when the minimum term of the 
asymptotic expansion is sufficiently small so as not to alter the 
value of the result within the chosen precision. 

The order of the minimum term is estimated by the first order 
approximation 


we — n/2, 


and the corresponding absolute value by the approximation 
formula 


(2r)" exp (—2rzz). 
Negative arguments are related to positive ones through the 


reflection formula: 


(—1)27 yo — z) = y(z) + 7 cot ne 
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The nth-order derivative of the cotangent is computed by 
term by term differentiation of the tangent or cotangent series 
after the convenient trigonometric reductions of the argument’s 
value. 

This procedure is not recursive and uses no own variable; 

begin 
real pi, pf, soma, zq, tl, fac, prec, w, sab, pv; 
integer pr, nl, k1, m1; 
real procedure fat (n); 


value n; integer 7; 
begin 
real f; integer 7; 
f:= 1; 
for 7 := n step —1 until 2 dof := f X 1; 
fat :=f 
end of fat; 
procedure inc (s, x1, L); 
real s,z1; label L; 
begin 
real sant; 
sant := 8s; s:=s+al; 


if abs (s—sant) < abs (prec X s) then go to L 

end of inc; 

comment The procedure polygamma uses a table of coeffi- 
cients sb for its series with the value 


| Bu| om (-- 1)*-1/h2 9 
sb) = opi = Gamat) Gn’ 


the last being an asymptotic value for large 7. The compu- 
tation of these coefficients need not to be repeated at each 
procedure call; so it is convenient to transfer the declaration 
and block below to the main program and execute it just once. 
One should replace flund by the smallest positive real 

number within the machine representation, and ms by the 
number of decimal digits of the mantissa; 

array sb [1 : entier (.272 & In(2/flund))]; 

begin 
real pig, sm, pipo, ptwo, dpi, sa; 
integer sg, in, k2, 1maz; 
array tr, g[2 : entier (10 T (ms/22))+1]; 
imax := entier (.272 X ln(2/flund)); 
pig := 9.86960440108935861883449099987615113531369940724079 ; 
pipo := pig fT 11; ptwo := 2097152; dpi := 4 X pig; 
sb [1] := 1/12; 
sb [2] := 1/720; 
sb [3] := 1/30240; 
sb [4] := 1/1209600; 


sb [5] := 1/47900160; 
sb [6] := 691/1307674368103 ; 

sb [7] := 1/74724249600; 

sb [8] := 3617/1067062284288104 ; 


sb [9] := 43867/5109094217170944103 ; 
sb [10] := 174611/80285766269829 12105 ; 


sm := 1; sg:= —1; 
for in := 2,in + 1 while sm 34 sa do 
begin 
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glen] := 1/(in X in); E: v := if C then p X (1—p) else p; 
tr{tin] := sg X glen] T 11; sa := sm; inc(s, -sb[j] Xf X tX v, D); 
sm := sm + trlin]; sg := —sg for 7 := 1 step 1 until 4 do 
end; fit] := file] + 2; 
sb[11] := sm/(pipo X (ptwo—1)); f= FX fll) X fil2] * y2/(Fe13} X Fela); 
i k2 := 12 step 1 until tmaz do p:=4X ps j:= 741 
egin go to E; 
P sm i= sue ager Raeoaas D: soma := 1 X pi fT nl X (if C then s X pr else s) 
: in := in ; trlin] := trlin qlin]; sa := sm; end 
sm := sm + trlin]; if sa ~ sm then go to B; end; 
pipo := pipo x peg; plwo := ptwo X 4; 2:s3l—2; w:=2z fn; 
sb[k2] := sm/(pipo X (ptwo—1)); pv := if n = 0 then In(z) else fac/(n X w); 
if in = 2 then go to L sab := abs(soma) ; 
end; if pv < sab then nd := nd — .484 X In(sab/pv) 
go to A; end 
L: for k2 := k2 + 1 step 1 until «maz do else 
sb[k2] := sb[k2—1]/dpi; begin soma := 0; kl := 1; w:=2z fT nend; 
A: end of sb coefficients computation; if nd < 0 then go to L; 
pi := 3.14159265358979323846264338327950288419716939937510; if2xXz2<n+ndVz<nthen 
prec := 10 7 (—nd); fac :=. fat (n); begin 
pr:i= ifn + 2X 2 = nthen 1 else — 1; Beal teria) :cand; 
pf := pr X fac; nl :=n+1; pies ee): 
ite < O then ine(soma, term, L); 
heeia, cond := (n X 1.8378—In(abs(term)) + 2.3025 X nd) X .1591; 
: _ oo ron <a if cond < n then cond := n; 
aC = enlier(z) then go to erro) if cond < zthenz:=z+1 
eise . 
I 
begin Beain 
ee a d, l; eNOL Cs integer ip, k; 
rls pr; (= 2) Lis bd — 2; ip := cond—2z+1; 
ifz > Othen/ := 1 if ip < 1 then goto L; 
else for k := 1 step 1 until 7p do 
begin x := —x, | := —prend; inc(soma, —pf/(z+k) T n1, L); 
C:=2 > .25; y:= pi X (if C then (.5—z) else z); 232 ap 
ifn = Othen end 
soma := | X pi X (if C then sin(y)/cos(y) else cos(y)/ wi=ztn 
sin (y)) end, 
“ise inc(soma, if n=0 then In(z) else —pf/(n X w), L); 
begin ine(soma, —pf * .5/(2 K w), L); 


zqqi= 2&2; tl := pf X nil/w X 2q); 


integer m, np, 7,7; integer array ft [1:4]; ae Bese GheG ee ed 
or := 2 step 2 until 6. n do 


real y2, p, f, t, 8, 0; 


begin 

mM:=n+ 2; np := mM X 2; 3 

: : 4, —1 X sb[m1+2], L); 
Taher Le fi}. sm-npe. fill 2a ore inc(soma, —¢ ,L); 
ae i= i aa Yi ee 1: me tl := —tl1 X (nl+m1) X (n+m1)/zq 

a ? i ’ ae , d: 
:= fat(np+1); r= 4 m+1); end; 

/ = i ae Z has 1 de ig de olygaie: > 20a xe) 


s:= if C then 0 else pf/y {7 nl; end of polygamma 
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A casual user should not be misled by the title of this algorithm. Algorithm 349 
does not offer arbitrarily precise values of the polygamma functions. It does offer 
results with precision adjustable downward from something somewhat less than the 
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Table I. Consistency Checks 


Order of Magnitude of Relative Error, machine precision = 107° 


Negative arguments 


Requested Positive 


precision arguments Digamma  Trigamma Tetragamma Pentagamma Hexagamma 
(decimal (digamma- ———_ Ss 
digits) | pentagamma) —10(+.005)0 —10(+.01)0 —10(.1)0 
6 (—6) (-7) (-5) (—5) (—5) (--5) 
9 (-9) (-10) (-—10) (-8) (—8) (8) 
10 (—11) (—11) (-11) (—11) (—9) (--9) 
1] (—12) (-—12) (—12) (—12) (—10) (11) 
12 (—13) (—18) (-—13) (—12) (-11) (C=11) 
15 (—15) (—13) (-—13) (—13) (—12) (-11) 
17 (-15) (—13) (—13) (—13) (—-12) (—11) 


—— 


floating-point precision of the computer on which it is run. Further, unlike the 
highly tuned functions to which we have become accustomed, this routine is not 
accurate to the last bit. In general, the last several decimal digits of the results of 
this procedure are in doubt. This procedure does not use rational function approxi- 
mations. Instead, it computes the polygamma functions as limits of asymptotic 
series. Hence it is relatively slow. It is on numerically shaky grounds since some 
values are the result of three separate summation processes where no efforts are 
made to rearrange the terms to preserve accuracy. Despite this, if used carefully 
within its limitations, the procedure performs as advertised. 

Algorithm 349 was translated into Fortran and tested in long precision on Stan- 
Ford University’s 18m 360/67 computer using both the Waterloo waTFIv compiler 
and 1Bm’s Fortran compilers. Since no other software to compute these functions 
is available at Stanford, the routine was checked by comparison with published 
tables of values and by several crude, but revealing consistency checks. For the 
digamma, trigamma, tetragamma, and pentagamma functions we checked directly 
against the tables in Abramowitz and Stegun [1], which give at least 10 and at 
most 11 significant digits in the range 1(.005)2.1 These were checked, requesting 
in turn 6, 9, 10, 11, 12, 15, and 17 decimal digits of precision. In this range the pro- 
cedure either provided the number of digits requested or agreed completely with 
the published tables, except that for the trigamma function, even with full machine 
precision requested, the numerical results (correctly rounded or truncated) for 
most arguments of the form 1.xx5 disagreed with the last digit of the published 
value, an error on the order of 1 X 10~°. The trigamma, tetragamma, pentagamma, 
and hexagamma functions were also compared with tables provided by the authors 
[2] for negative arguments —9.9(.1) (—.1). The results of these tests are recorded 
in Table I. 

The following internal checks were made. For positive arguments in the range 
(0,1) and (2,11), we checked the translation properties of the procedure by com- 
puting the shifts in reverse order (to full machine precision) and compared results. 
For negative arguments the procedure computes derivatives of the cotangent 
function as limits of a series. We computed the necded low order derivatives ana- 
lytically and evaluated them using standard trigonometric functions instead. For 
the functions in the first test we compared results in the range (—10(.005)0), 
skipping the poles at the negative integers. All of the values in the second test, were 
checked similarly. In the latter case, where published tables for negative argu- 
ments were available, this internal check proved sharp—whenever the internal 
check indicated an error larger than the precision of the tables, the error was found 
to be of the expected order. 

Note on translation. In the Fortran program, the first block of the Algol pro- 
cedure was made a separate initializing subroutine. The unnecessary procedures 


11(.01)2 for tetragamma and pentagamma. 
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FAC and INC were replaced by in-line code. To enhance portability, all constants 
are computed at run time. (The dimension of the arrays SB, TR, and Q are machine 
dependent.) The routine is available from the Numerical Analysis Program Li- 
brarian, Stanford Center for Information Processing, Stanford, CA 94305. It 
should not be implemented in single precision on short word-length machines. 
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procedure linprog (m, n, kappa, G, b, d, x, z, ind, infeasible, un- 
bounded, singular) ; 
value m,n; integer m, n, kappa; 
array G, b, d, x; 
bounded, singular ; 
comment linprog attacks the linear programming problem: 


real z; 
integer array ind; label infeasible, un- 


maximize d'x 
subject to Gz = bandz > 0 


Details about the methods used are given in a paper by Bartels 
and Golub [Comm. ACM 12 (May 1969), 266-268]. 

The array G[0:m—1, 0:n—1] contains the constraint coeffi- 
cients. Array 6[0:m—1] contains the constraint vector, and 
d(0:n—1] contains the objective function coefficients (cost 
vector). The computed solution will be stored in z[0:n—1], and 
z will have the maximum value of the objective function if 
linprog terminates successfully. Error exit singular will be taken 
if a singular basis matrix is encountered. Error exit infeasible 
will be taken if the given problem has no basic feasible solution, 
and exit unbounded will be taken if the objective function is 
unbounded. If kappa = 0, problem (2) of the referenced paper 
will be set up and phase 1 entered. If 1 < kappa < m — 1, prob- 
lem (4) of the paper will be set up and phase 1 entered. The last 
lappa columns of G will be preceded by the first m — kappa 
columns of the identity matrix to form the initial basis matrix. 
If kappa = m, phase 2 computation will begin on problem (1) 
with variables numbered ind[0], --- , ind{m—1] as the initial 
basic variables and variables numbered ind[m], --- , ind[n—1] as 
the initial nonbasic variables. Hence each component of ind must 
hold an integer between 0 and n — 1 specified by the user. Fi- 
nally, if kappa > m, problem (8) will be set up, and phase 2 
computation will begin with variables numbered ind[0], --- , 
ind{m] as the initial basic variables and variables numbered 
ind{m+1}, --- , ind[n+kappa—m—1] as the initial nonbasic 
variables. This option is of interest only because linprog, upon 
successful termination, leaves all variable numbers recorded in 
ind in their final order and provides kappa with an appropriate 
value. This permits linprog to be reentered at the phase 2 point 
after modifications have been made to G, b, or d. An understand- 
ing of the simplex method and the accompanying paper by Bar- 
tels and Golub will make clear what modifications can be per- 
mitted. If phase 1 is to be executed, ind must have array bounds 
[0:-+n—kappa] to allow for artificial variables. Otherwise, ind 
must have bounds [0:n+kappa—m—1]. The values in array b 
must be nonnegative if phase 1 is to be executed. The contents 
of m, n, G, b, and d are left unchanged by linprog; 
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begin 

real procedure ip2(7i, ll, uu, aa, bb, cc); 
value uu; integer 2, ll, uu; real aa, bb, cc; 

begin 

comment ip2 must produce a double-precision, accumulated 
inner product. Jensen’s device is used. The main statement in 
tp2 is 
for 11 := ll step 1 until uu do sum := sum + aa X bb 
where the local variable sum has been initialized by cc. How- 
ever, the multiplication aa X bb must produce a double-pre- 
cision result, so swm represents a double-precision accumu- 
lated sum. After all products have been summed together, sum 
is to be rounded to single-precision and used as the value of 
tp2; 

end 1p2; 

procedure trisolv(fis, fid, fie, sis, sie, fi, si, sol, rhs, mat, piv); 


value fid, fie; integer fis, fid, jie, sis, ste, fi, st; real sol, rhs, 
mat, piv; 
comment itrisolv solves a triangular system of linear equa- 


tions. The off-diagonal part of the system’s coefficient matrix 
is given by mat, the diagonal part by piv, and the right-hand 
side of the system by rhs. The solution is developed in sol. 
By appropriately setting the first five parameters, either an 
upper or a lower triangular system can be treated. Column by 
column LU decomposition of a matrix can be compactly ex- 
pressed using trisolv; 
begin real tt, pv; 
for fi := fis step fid until fie do 


begin ti := —ip2(st, sis, ste, sol, mat, —rhs); 
st := fi; pv := piv; 
sol := if pv = 1.0 then tt else tt/pv 

end 


end trisolv; 

array q,h, w,y, v[0:m], P[O:m, O:m}); 

integer array 7x[0:m+n], ro[0:m]; 

integer mu, nu, alpha, beta, gamma, gm1, tml, 2, 7, k, 1; 

real tl, (2, infinity, prevz, eta; 

real procedure Gmat(rt, ct); 
value ri, ci; integer 11, cv; 

Gmat := if ri = m then (if ct < n then 0 else 1.0) 
else if ci < n then G[r7, cz] 
else if ci — n = ri then 1.0 else 0; 

real procedure dvec(it);_ value iz; integer 17; 
dvec := if 11 < n then d{iz] else 0; 

procedure decompose (mat, bottom, top); 
value botiom, top; integer bottom, top; real mat; 

comment This procedure performs a column-by-column re- 
duction of the matrix given by mat, forming an upper and a 
lower triangular matrix into the array P. (Each diagonal ele- 
ment of the lower triangular matrix is 1.) Interchanges of rows 
take place so that the largest pivot in each column is em- 
ployed. If P already contains the LU decomposition of a 
matrix differing from mat in only the (befa)-th column, ad- 
vantage is taken of this. The parameters bottom and top enable 
decompose to concentrate on « lower right-hand submatrix of 
mat. This feature saves computation during phase 1. If mat 
is singular, exit sengular is taken; 

begin 
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for 7 := beia step 1 until mu do 
begin 
wml :=t—1; L:= izfz]; 


irtsolu(if 1=bela then bottom else top, 1, im1, bottom, j — 1, 
j,k, Plro{k], 1], mat, Plro[j], k], 1.0); 
irisolv(i, 1, mu, bottom, im1, j,k, Plro[k], 7], mat, 
P{ro[j], k], 1.0); 
tl := 0; 
for j := 7 step 1 until mu do 
begin 
(2 := Plro[j], 1]; 
if abs(t1) < abs(t2) then begin tl := (2; k := j end 
end; 
if {1 = 0 then go to singular; 
if 7 = mu then go to decompover; 
j := rolt]; ro[t] := ro[k]; rok] := 7; 
for j := 71+ 1 step 1 until mu do Proj], 7] := 
Plrofj], ¢]/t1 
end; 
decompover : 
end decompose; 
procedure findbeia; 
comment This procedure determines which of the basic 
variables is to become nonbasic; 
begin 
tl := infinity; 
for 17 := 0 step 1 until mu do 


begin 
if y(t] > 0 then 
begin 
(2 := Alel/yle]; 
if 12 < tl then begin {1 := (2; beta := i end 
end 
end 
end findbeta; 


procedure findalpha(mat, vec); real mat, vec; 
comment This procedure determines which of the nonbasic 
variables is to be made basic; 
begin 
tl := infinity; 
for 2 := mu +1 step 1 until nu do 
begin 
k := inf]; 
12 := 1p2(7, 0, mu, mat, w[7], vec); 
if (2 < tl then begin alpha :=7; tl := t2end 
end 
end findalpha; 
procedure refine(mat, rhs, od, lp, up, vec, fi, st, ord, ill); value 
ord; integer ord, fi, st; | real mat, rhs, od, lp, up, vec; label 
all; 
comment This procedure makes an iterative refinement of 
vec, which is the solution of the matrix equation mat X vec = 
rhs. The matrix mat has order ord. The LU decomposition of 
mat is specified by od, lp, and up. Exit ill is taken if mat is too 
ill-conditioned for the refinement process to be successful. 
Note the global identifier eta, whose value and purpose are 
given in the next comment; 
begin 
array cor[O:ord]; real cnorm, snorm, eps, tt; integer cni; 
cnt :=0; eps := 5 X eta; 


loop: 


cnorm := snorm := 0; cnt := cnt +1; 
for fi := 0 step 1 until ord do 
begin 
cor[fi] := —-ip2(st, 0, ord, mat, vec, —rhs); 
st:=fi; tt :=abs(vec); 
if tt > snorm then snorm := tt 


350--P 2- 0 


end; 


trisolv(0, 1, ord, 0, fi—1, fi, st, cor[sz], cor[fi], od, lp); 


trisolv(ord, —1, 0, fi+1, ord, fi, st, cor{st], cor[fi], od, up); 
for si := 0 step 1 until ord do 
begin 
tt := cor[sz]; 
vec := vec + tt; 
if abs(it) > cnorm then cnorm := abs(tt) 
end; 
if cnt > 15 then go to ill; 
if snorm ¥ 0 then 
begin if cnorm/snorm > eps then go to loop end 
end refine; 
comment At this point, infinity and eta are set to special 
values. Set infinity to the largest positive single-precision 
floating-point number. Set eta to the largest positive floating- 
point number such that 1.0 + eta = 1.0 — eta = 1.0 in single- 
precision arithmetic. The convergence of the iterative re- 
finement process which is applied in refine is determined using 
eta; 
prevz := —infinity; 
for 7 := O step 1 until m do ro[t] := 7; 
comment Determine from kappa whether phase 1 is to be 
skipped; 
if kappa > m then 
begin 
nu:=n-+ kappa —m-1; 1:=0; 
for 7 := 0 step 1 until nu do 
begin 
j:= indi]; iff >nthenl:=1; ixft]:=7 
end; 
mu := if | = 0 then m — 1 else m; 
go to phase 2 
end; 
mu:=m—1; gamma := m— kappa; gmi := gamma — 1; 
nu:=n+gml; l:=n-—m; 
comment Set up the appropriate phase 1 problem; 
for 7 := 0 step 1 until gm1 do 


begin 
taft] := n+4; 
Pi, 7] := 1.0; 


for 7 := 7+ 1 step 1 until gml do Piz, 7] := Ply, 1] := 0; 
for j := gamma + 1 step 1 until mu do P(t, 7] := Gz, I+] 
end; 
for 7 := gamma step 1 until mu do 


begin 

igi] := +4; 

for j := O step 1 until gmi do P{i,j] := 0 
end; . 


for 1 := mstep 1 until nu do iz[i] :-= 74 — m; 
beta ‘= gamma; 
go to no removal; 


new phase 1 cycle:; 


comment Begin a new simplex step on the phase 1 problem. 
Check the phase 1 problem objective function; 

if ip2(z, 0, mu, w{t], b[z], 0) = 0 then go to phase 2; 

comment Determine which nonbasic variable is to become 
basic; 

findalpha(G{j,k),0) ; 

if (1 > O then go to infeasible; 

j := t2[alpha); 

comment Solve a linear system for a vector y; 

trisolu(gamma, 1, mu, gamma, | — 1, l, k, v{k], Gfro[t],7], 
Plro[t},k], 1.0); 

trisolu(mu, —1, gamma, 1 + 1, mu, l, k, y{k], o[1], 
Plro(l},k}, Pfroft),l}); 

for 7 := 0 step 1 until gm1 do 
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begin 
L:= roft]; 
y{i] := —ip2(k, gamma, mu, y{k], PUl,k], —GIl,7]) 
end; 
comment Use the vector y to determine which basic variable 
becomes nonbasic. If the variable which has become non- 
basic is an artificial variable, remove it entirely from the 
problem and make an appropriate rew and column inter- 
change upon the basis matrix P; 
findbeta, 
if beta > gamma then 
begin 
k := tx[alpha); ix{alpha] := ix|beta]; ix[beta] := k; 
go to no removal 
end; 
k := rolgmi]; 7 := ro[gm1] := ro[beta]; ro[beta] := k; 
Pik, beta] := 1.0; P[t, beta] := 0; 
ix[beta] := tx[gm1]; ix[gm1] := iz[alpha]; beta := gml; 
for 7 := alpha + 1 step 1 until nu do iz[i—1] := iz{i]; 
gamma := gml;gml := gml — 1; nu := nu —1; 
no removal:; 
comment Produce the LU decomposition of the new basis 
matrix; 
k := t2[beta]; 
for 7 := 0 step 1 until gml1 do Plro[t],beta] := G[ro{z],k]; 
decompose (G[ro[j],l], gamma, gamma); 
comment Find the basic solution h; 
trisolu(gamma, 1, mu, gamma, j — 1, 7, k, vk], 
blro[j\], Plroljl,k], 1.0); 
trisolu(mu, —1, gamma, j + 1, mu, j, k, h{k], vfj], 
Pfro[j],k], P{ro[jl,j]); 
for 7 := 0 step 1 until gml1 do 
begin 
k := ro(t]; 
hit] := —ip2(j, gamma, mu, h[j], Plk,j], —b[k]); 
wk] := —1.0 
end; 
comment Solve a linear system for the vector, w, of simplex 
multipliers; 
for 7 := gamma step 1 until mu do 
begin 
il := 0; 
for j := 0 step 1 until gml1 do #1 := t1 + Plro{j],7]; 
vz] := tl 
end; 
trisolv(gamma, 1, mu, gamma, i — 1, ¢, j, vf], v[Z], 
Plro[j], t) Plrolé],i]); 
irisolv (mu, ~— 1, gamma, i + 1, mu, i, j, wlrolJll, vl2], Plroly), a, 
1.0); 
go to new phase 1 cycle ; 
phase 2: ; 
comment Set up the appropriate phase 2 problem and make 
an initial LU decomposition if necessary ; 


beta: =0; 
if kappa < m then 
begin 
if gamma > Othen 
begin 
kappa := m; nu := nu+l;mu:=m; 
tz[nu| := iz[mu]; tz[mu] := n +m 
end 
end; 


if kappa > m then go to decomp 
else irisolv(0, 1, mu, 0,7 — 1,39, k, q{k], if ro[j] = m then 0 else 
biroljll, Plrofj],4], 1.0); . 7 
new phase 2 cycle: ; 
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comment Begin a new simplex step on the phase | problem. 
Solve a linear system for the vector, w, of simplex multipliers; 

trisolv(0, 1, mu, 0,7 — 1, 1,7, v{j], dvec(ix[t]), Pfrof{y],z], P{ro[c],¢]); 

trisolu(mu, —1, 0,7 + 1, mu, 2, 7, wlro[y]], vf2], Plro{j],c], 1.0); 


comment Determine which nonbasic variable is to become 
basic; 
findalpha(Gmat(j,k), —dvec(k)); 
comment Check whether the solution has been found; 
if t1 > 0 then go to finished; 
not done yet: 
4 := ix{alpha); 
comment Solve a linear system for a vector y; 
trisolv(0, 1, mu, 0, j — 1, 7, k, vlk], Gmat(ro[j],7), P{ro[j],k], 1.0); 
trisolu(mu, —1, 0,7 +1, mu, j,k, ylk], of7], Plrolg],k], Plrotjl,j)); 
comment Use y to determine which basic variable is to be- 
come nonbasic; 
jindbeta; 
if {1 = infinity then go to unbounded; 
k := iz{[beta]; 1x[beta] := iz{alpha]; ixlalpha] := k; 
decomp: ; 
comment Produce the LU decomposition of the new basis 
matrix; 
decompose(Gmat(ro[j],l), 0, beta); 
comment Compute the basic solution h; 
trisolv(beta, 1, mu, 0, 7 — 1, 7, k, glk], if ro[j] = m then 0 else 
bfro[j]], Plrof{j],k], 1.0); 
trisolu(mu, —1, 0,7 +1, mu, j,k, Aik], gly], Plroly],*], Plrofyl,j); 
go to new phase 2 cycle; 
jinished: ; 
comment Refine w and the basic solution h. Compute the 
objective function. Check the refined results to determine 
whether the optimum has been reached. If the check indicates 
nonoptimality but the objective function is less than any 
value previously computed for it, return the best basic solu- 
tion obtained so far and print a warning that the solution 
has doubtful validity; 
refine (Gmat(ro[j],iz[t]), dvec(tx[t]), P[ro[j],¢], Plro[z],z}, 1.0, 
wlroljll, 2, 7, mu, singular); 
Z2°= tp2(t, 0,m—1, wilt], b{z], 0); 
if z < prevz then 
begin comment Print out ‘‘doubtful solution’; end 
else 
begin 
prevz := Zz; 
refine(Gmat(ro[j], ix[k]), if ro[j] = m then 0 else b[ro[j]], 
P{ro{j],k], 1.0, Plro[y],j], Ak], 7, k, mu, singular) ; 
L:=n—1; kappa := nu +1; 
for 7 := 0 step 1 until / do 2[2] := 0; 


for 7 := 0 step 1 until nu do indi] := iz{z]; 
for 7 := 0 step 1 until mu do 
begin 
j= tz(t]; 
if j < n then z[j] := h{z] 
end; 


findalpha(Gmat (j,k), —dvec(k)); 
if (1 < Othen go to not done yet 
end 
end linprog 
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Comments. ROMINT calculates the approximate value, 
VAL, of the definite integral 


r= fo ropax 


A 


and an error bound ERR for VAL, i.e. | VAL — I| < ERR. The 
integrand F(X) must be given as a function subprogram with the 
heading FUNCTION F(X). VAL is obtained from a modified 
form of Romberg quadrature which is less sensitive to the ac- 
cumulation of rounding errors than the customary one. In this 
procedure, which was devised by Krasun and Prager [1], the 
following ‘“‘skeleton’’ Romberg table is constructed: 


TS 
TY 
Ro? T2 
R,° . 

Ro RY T° 
R} ‘ _ 
Ro: : ee, 
, lie 
Rr" 

Ry™ 


where m < MAXE, MAXE being on entry the maximum number 
of extrapolations wanted. In this subroutine MAXE < 15. The 
quantities Ret (k = 0,1, --- , m) are the rectangle values, 


Bp=A®* _ 1\B-A 
R& = ok b r(a4(s-} 9% ), 


jal 


which are calculated using a procedure proposed by Rutishauser 
[2] to reduce the effect of rounding errors. The quantities R,* 
(j > 0) are computed using the usual extrapolation formula: 


+1 0 pk 
Re = Ret y REL Bis k=0,1,::+,m—j. 


47-1 
The formula (see [1]) 
2-4-1 —]1 : 
T;? = Rj-1 + aaa (Ti — Rj), fg = lees ym, 


enables one to determine the extrapolated trapezoid values 
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T,°, T2°, --- , Tm® in the skeleton table from the trapezoid value 


B-A 


T? = [F(A) + F(B)] 


and the rectangle values Ro®, Ri°, «+: , Rm—1- In this subroutine 
only one linear array for storing the quantities Rf? ’, j =0,°-:, 
m (< MAXE) is required. 

The subroutine is left when (see [3]) 


0 0 
- [Tot — Bell & aps, 


where EPS specifies the desired accuracy, or when MAXE ex- 
trapolations have been performed. On exit, VAL = (7,9 + R,°)/2 
(m < MAXE) and N = 2+) + 1 is the number of function 
evaluations. The exit value of MAXE is m unless the maximum 
number of! extrapolations wanted has been performed without 
the desired accuracy being obtained, in which case the exit value 
of MAXE is zero. 

This subroutine can be used to estimate the definite integral J 
provided F(X) is at least three or four times differentiable and is 
not periodic with period B — A. 


Test cases. Two test cases were carried out on the IBM 1620 
of the Computing Laboratory, University of St. Andrews, to 
compare ROMINT with a Fortran II-D version of havtetntegrator 
[4]. The calculations were carried through in single-precision, i.e. 
working to 8 significant decimal digits. The results are summarized 
in the following table. 


Number of 


Integrand A B EPS Bie he havieintegrator ROMINT _ extrapo- 
lations 
cosz 0.0 7/2 10¢ 1.0 0.99999985 0.99999995 3 
ee?” 0.0 4.3 10-¢ 0.88622692 0. 88622665 0. 88622675 5 
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SUBROUTINE ROMINT 
Cc BERK A a AG a a OK AK OK NC OK a 
* (VALsERR EPS 9A9B aNyMAXE) 
DIMENSION RM(16) 
c INITIAL TRAPEZOID VALUE oe 
T = (B-A)*(F(A)+F (B))*055 


Cc INITIAL RECTANGLE VALUE ee 
RM(1L) = (BrA)*F ( (A+B) *O.5) 


COLLECTED ALGORITHMS (cont.) 


oO OOO O 


aa 


Aa an 


N = 2 
R= 4 
DO 11 K = 1,MAXE 
BB = (R¥0.5-1.)/(R-Le) 
IMPROVED TRAPEZOID VALUE e 
T = RM(1)+BB*( T-RM(L)) 


DOUBLE NUMBER OF SUBDIVISIONS 
OF (A,B) eo 


N = 2*N 
$s =0 
H = (B-A)/FLOAT(N) 


CALCULATE RECTANGLE VALUE ec 
IF (N32) Lyly2 
NO =N 
GO TO 3 
NO = 32 
IR(N-512) 494,95 
NL =N 
GO TO 6 
Nl = 
00 9 K2 = 1l9yN7512 
S1 =0 
KK = K2+N1-1 
DO 8 Kil = K2,KK,32 
SO =0 
KKK &® K1+NO-1 


-_ 


On PWN 


DO 7 KO = KlyKKKs2 
SO = SO+F(A+FLOAT(KO) *H) 


7 CONTINUE 
Sl = SO+S1 
8 CONT INUE 
S = S+$1l 
9 CONTINUE 
RM(K+1L) = 2.¥*H*S 


END CALCULATION OF RECTANGLE VALUE. 


R =4 
FORM ROMBERG TABLE FROM RECTANGLE 
VALUES oe 
00 10 J = 1,K 
L = K+l-J 
RM(L) = RM(L41)4+¢(RM(L4+1)-RM(L)) 
* /(R=1e) 
R = 4e*R 


10 CONTINUE 


ERR = ABS(T-RM(1))*0.5 


CONVERGENCE TEST ee 
IF(ERR-EPS) 12512511 


11 CONTINUE 
12 VAL = (T+RM(1))*0.5 
N = N+1 
TF (K~MAXE) 14913513 
13 MAXE = 0 


GO TO 15 
14 MAXE = K 
15 RETURN 
END 
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There is an error in calculating the output value MAXE in the 
algorithm in the case where the desired accuracy is obtained by 
the last requested extrapolation. Statement 11 (the end of the DO 
loop on K) should be followed by: 

K=0 

12 VAL = (T-+-RM(1))#0.5 

N = N+1 

MAXE = K 

RETURN 

END 
When the two test cases were repeated in single precision on the 
CDC-6600, the 14-digit arithmetic yielded results accurate to 10 
digits with the same number of extrapolations as used to get 6- 
digit results on the 8-digit IBM-1620, The time spent in ROMINT 
was 0.7 and 2.0 msec for the cosine and e-*’ integrals respectively, 
with a total time of 1.1 and 3.8 msec when the time spent evaluating 
the functions is included. 
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Algorithm 351 was compiled and run successfully in Fortran 
IV on a CDC 6400 computer. Computation times for equivalent 
orders were essentially the same as for a FortRan version of Al- 
gorithm 60 Romberg Integration [1]; storage requirements were 
approximately 20 percent greater. 

Algorithm 351 incorporates two modifications to the standard 
Romberg algorithm, each designed to reduce roundoff: (1) the 
Krasun and Prager [3] replacement of the table of trapezoidal 
values 7;* with a table of rectangular values R;*; (2) the method 
proposed by Rutishauser [6] for the evaluation of the rectangular 
sums #o*. Since neither of these modifications has been properly 
evaluated we have chosen to compare integral values returned 
by five variants of the Romberg algorithm: 

1. Conventional Romberg integration as described by Algo- 
rithm 60 

2. A Krasun and Prager modification of Algorithm 60 (7/4 
table replaced by R# table) 

3. A Rutishauser modification of Algorithm 60 (7,* table 
extrapolation with improved evaluation of the Ro) 

4. Modified Romberg integration as described by Algorithm 
351 (R# table; improved Ro* evaluation) 

5. Algorithm 351 with the Rutishauser procedure replaced 
by the standard evaluation of the Ro’ (R;* table extrapolation) 


COLLECTED ALGORITHMS (cont.) 


The following test integrals were investigated. 


11 
A. i x dz, 
01 


a = 3.0, 4.0, 5.0 


1 
B. / (1 + 2)! dz, a = 1.0, 4.0 
0 


10 
C. i In x dx 
1 


5 
D. i en" dx 
0 


Integral A was suggested by Thacher [7], Integral B by Rabi-. 


nowitz [5], Integral C by Hillstrom [2], and Integral D by Hill- 
strom and by Kubik [4]. All computation was carried out in CDC 
6400 single-precision floating-point arithmetic. Results were re- 
corded to 14 decimal digits. (CDC 6400 word length corresponds 
to 14+ decimal digits.) The data obtained in this manner are 
summarized in Tables I-IV. 

For a specified order of extrapolation m, Algorithm 60 variants 
require 2” -+- 1 function evaluations and return T',,°. Algorithm 
351 requires 2+) + 1 function evaluations and returns 7'q}. 
Thus one cannot meaningfully compare integral values returned 
by the two algorithms for the same specified order. We have there- 
fore chosen to compare integral values resulting from the same 
number of function evaluations and have tabulated these data 


in terms of the Algorithm 60 order m. The corresponding specified 


order for Algorithm 351 variants is m — 1. 

In each example considered, Algorithm 351 returns integral 
values for the optimum extrapolation order that are more accurate 
than the Algorithm 60 solutions by from one to two significant 
figures. There is, of course, no increase in the rate of convergence 


and little difference in solution accuracy for approximation. 
orders less than that corresponding to the maximum attainable. 


accuracy. If one were interested in, e.g. six or eight significant 
figure accuracy, either algorithm would be satisfactory. If ac- 
curacy requirements are not severe and one is satisfied with 
integral values correct to a number of significant figures less 
than half the computer word length, either algorithm may be used. 
If one seeks the maximum achievable accuracy, Algorithm 351 
is clearly the proper choice. 

Tables I-[V include data recorded when the order was over- 
specified, i.e. when m was greater than that required for optimum 
accuracy. For both algorithms the accuracy at first increases with 
increasing order. This continues until an optimum accuracy ob- 
tains. With Algorithm 60 a further increase in m results in a 


decline, at times rather rapid, in evaluation accuracy. With 


Algorithm 351 there is little loss in accuracy with increasing 
order. The accuracy decline rate is strongly retarded and in many 
cases practically eliminated. This is a very significant result. 

In routine use of the algorithms, the unwary may overestimate 
the order required for optimum convergence (Algorithm 60 termi- 
nates only when a specified order has been obtained) or may 
specify an accuracy criterion for termination that cannot be 
satisfied. With Algorithm 351 the only loss is that of computer 
time; with Algorithm 60 solution accuracy may be impaired. 

From the data presented in Tables I-IV we may determine the 
extent to which each of the procedural modifications contributes 
to the overall superiority of Algorithm 351. It is immediately 
evident that the Krasun and Prager modification has little effect 
either on the accuracy of the algorithms or on the loss of accuracy 
as the optimum order is exceeded. Results obtained using this 
modification differ from those returned by Algorithm 60 by at most 


2 in the 14th figure. When the Rutishauser procedure is sub-_ 


tracted from Algorithm 351, the algorithm becomes, for all prac- 
tical purposes, equivalent in accuracy to Algorithm 60. This con- 
clusion has been further supported by results obtained in the 


351-P 3- 0 


evaluation of eight additional test integrals selected from the 

literature. 

If, on the other hand, the Rutishauser procedure is added to 
Algorithm 60, the results obtained are essentially the same as 
those recorded for Algorithm 351. Clearly the Rutishauser modifi- 
cation is the dominant factor determining the superiority of 
Algorithm 351. 

The success of the Rutishauser modification tempts one to 
expand the procedure to include an additional summation level. 
Experiments with such expansions indicate that they may be of 
value where slow Romberg convergence requires the use of orders 
m> 18. 

The following changes are suggested as possible improvements 
in the algorithm. The integration interval (B—A) is now com- 
puted K + 2 times where K is the order of approximation on exit 
from the routine. We suggest an initial definition of a variable, 
e.g. SH = (B—A) and the replacement of (B—A) by SH in these 
statements where (B—A) appears. Initialization should also 
include a test to insure that the maximum extrapolation order 
MAXE permitted is less than or equal to 15 with a possible re- 
placement. MAXE = 15 if this condition is violated. Alternatively, 
one could replace the statement DO 11K = 1, MAXE with 
DO 11 K = 1,15 and test for K < MAXE prior to executing 
statement no. 11. The GOTO3 statement following statement 
no. 1 should read GO TO 4. If N < 32, N is also < 512. 

Upon exit, the input parameter MAXE is assigned either the 
value MAXE = K, where K is the approximation order, or 
MAXE = 0 if the accuracy criterion has not been satisfied. We 
believe that it is poor programming practice to have a subroutine 
alter the value of an input parameter. We suggest the addition of 
an output parameter, e.g. MFIN = K which returns the order 
on exit. Where we now set MAXE = 0, we could set MFIN = 16. 
One can test as easily for MFIN < 15 as for MAXE = 0. This 
would eliminate the necessity for resetting MAXE each time the 
subroutine is entered. It is also useful to return the final value 
of the accuracy ERR. In the event that MAXE = 0, one could 
test ERR to determine whether or not the returned integral 
value falls within acceptable limits. 

In practical applications we prefer to express the procedure 
as a function subprogram and to add the name of the generating 
function F to the argument list. We also consider a test for rela- 
tive error rather than absolute error to be more useful in routine 
use of the algorithm. 

The author wishes to thank the Mobil Research and Develop- 
ment Corporation for permission to publish this information. 
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TABLES. Comparisons oF RoMBERG METHOD VARIATIONS 


(KP = Krasun-Prager Modification; RUT = 


Variations Returning Tn? Variations Returning Ty) 


Algorithm 351 |A!sorithm 


Rom- Algorithm 60 60 60 351 
berg +KP | +RuT | KP + RUT) | KP onty) 
© \Order aN a | 
i“ oy a Digits ay 
Digits 1-14 | NSF |) NSF SL) NSF) GS’ | NSF 34) NSP 
q q 8 


1 
I. In tHe Evauvation or I(a) = i; (1 + 2%)-1 dx 
I(1) = 0.69814 71805 59945; I(4) = 0.86697 29873 3991 


69314 74776 4482] 6 |4482| 6 14482) 6 |79014 8123) 5 {8123) 5 
69314 71819 1673] 8 |1673} 8 {1673} 8 |71830 7192 
69314 71805 6227; 11 16228} 11 6227) 11 |71805 6360) 11 |6360) 11 
69314 71805 5991] 13 |5992} 13 5992] 13 {71805 5993] 13 5992] 13 
69314 71805 5987} 12 |5988] 12 |5991! 13 |71805 5992; 13 5988) 12 
69314 71805 5984) 12 |5984) 12 {5990} 13 /71805 5992; 13 5984) 12 
60314 71805 5971] 12 |5972| 12 |5989} 12 |71805 5990) 13 |5972) 12 
69314 71805 5951] 12 |5951/ 12 |5988) 12 |71805 5989) 12 {5951) 12 
69314 71805 5906] 11 |5906) 11 |5991) 13 /71805 5990; 13 |5906) 11 
69314 71805 5822] 11 (5822) 11 (5987) 12 |71805 5989) 12 |5822) 11 


4.0) 4 |86697 29736 8070} 7 18070) 7 {8070} .7 (30046 3711) 7 {8711| 7 
86697 29872 2539} 9 |2539) 9 )2539) 9 |29872 1216) 9 j1216) 9 
86697 29873 4006) 12 |4006) 12 |4007; 12 (29873 4005) 12 |4003] 12 
86697 29873 3983] 12 |3984) 12 |3987) 13 |29873 3988, 13 |3984) 12 
83697 29873 3977| 12 |3978] 12 |3986) 13 {29878 3987; 13 |3979| 12 
9 {86697 29873 3968] 12 |3964) 12 /3985] 12 /29873 3986; 13 |3964) 12 
10 |86697 29873 3939] 11 |3940} 11 -|3985} 12 /29873 3985] 12 |3940) 11 
11 |86697 29873 3890; 11 {3890} 11 |3984) 12 |29873 3986) 13 /3890) 11 
12 |86697 29873 3787 11 |8788} 11 {3983} 12 [29873 3985} 12 {3788} 11 


1.0 


-_ 
Se Onan, w 


_ 
-_ 


_ 
»w 


11 
II. In tHe Evauuation or I(a) = ies a7? de 
1(3). = 0.49995 86776 85950 X 104; (4) = 0.33333 30828 95066 X 108; 
1(5) = 0.24999 99982 9247 < 108 
3.0] 8 {50289 45604 1249) 2 1249) 2 11255) 2 [49952 9475) 2 |9469) 2 
9 |50007 88217 4010] 3 |4010/ 3 |4037} 3 /88324 8156) 3 |8128| 3 
10 /49996 05996 3754] 5 |3755| 5 |3813} 5 |05997 5088] 5 |5029| 5 
11 49995 86888 2917/ 7 |2917| 7 {3041} 7 |86888 3087/ 7 (2962) 7 
12 /49995 86777 0553) 10 1/0553] 10 0814] 10 |86777 0815] 10 |0553} 10 
13 |49995 86776 8069] 10 |8070) 10 |8588) 12 [86776 8590} 12 |8070] 10 
14 |49995 86776 7547| 10 |7549| 10 18585) 12 |86776 8587] 12 |7549| 10 
15 [49995 86776 6495] 10 |6496/ 10 [8581] 12 |86776 8583] 12 (6496) 10 


4.0; 8 -(33918 76383 3713} 1 {3713} 1 |3717) 1 |83821 8573) 1 (8568) 1 
9 (33362. 40891 0012} 3 0011} 3 ;0028) 38 |41103 2353) 3 |2337] 3 
10° |83333 86458 8643} 4 )8642) 4 |8682) 4 |86461 5904) 4 |5865) 4 
11 (38333 31207 4466; 7 (4466) 7 |4547| 7 |31207 4679} 7 |4598) 7 
12 38333 30829 8056; 9 '8055; 9 |8220) 9 {30829 8220) 9 |8056) 9 
13 {33333 30828 9178 11 |9178} 11 |/9508} 13 |30828 9509} 13/9178) 11 
14 /38333 30828 8842) 10 |8843] 10 |9500} 12 |30828 9501; 12 |8843] 10 
15. |33833 30828 8163) 10 /8163) 10 {9497/ 12 |30828 9499) 12 |8163; 10 


5.0] 8 {25979 73076 7608} 1 /|7608} 1 |7611) 1 |82577 2026} 1 {2023} 1 
9 |25058 17539 3846} 2 3846) 2 |3857; 2 {17890 9312; 2 |9300) 2 
10 {25001 31264 6257| 4 |6257) 4 |6282) 4 |31270 0511) 4 |0486/ 4 
11 |25000 01021 0524; 6 /0524; 6 [0576) 6 |01021 0887) 6 |0835] 6 
12. |24999 99985 6515} 9 /6515) 9 [6621} 9 |99985 6622) 9 |6516) 9 
13 {24999 99982 9053; 1% |9053) 11 (92671 12 (99982 9268! 12 {9054) 11 
14 [24999 99982 8817; 11 /8818) 11 /9242) 13 /99982 9243] 138 |8818) 11 
15 [24999 99982 8379] 10 |8380) 10 |9241) 12 |99982 9242} 13 18380 10 


Rutishauser Modification; NSF = Number of Significant Figures) 


Variations Returning T° Variations Returning T»} 
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a an an Digi os 
Digits 1-14 | NSF a NSF |'3.4| NSF ees | NSF a4 | NSF 
a q ~) 
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III. In trae Evauvation or I = f ‘Inedz = 


14.025 85092 99404 6 


4 |14025 60234 7275) 6 {7275|° 5 |7275| 5 (60498 3885) 5 |3885| 5 
5. |14025 84455 4627/ 6 /4627) 6 |4627;} 6 . |84433 5675} 6 {5675) 6 
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IV. In tHE Evauvation or I = sf e~ dz = 
0.88622 69254 51396 


5 188622 59970 9402} 5 |9043/ 5 (9042! 5 (59296 9073) 5 19073) § 
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8 -|88622 69254 5117} 12 (5117; 12 /5134) 12 /69254 5135; 13 |5117) 12 
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11 [88622 69254 4974) 11 4975) 11 /|5180; 12 69254 5133) 12 /4976) I1 
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13 |88622 69254 4463; 10 (4463) 10 |5128) 12 (69254 5129) 12 /4464) 10 
14 [88622 69254 3801; 10 |3802! 10 !5125) 12 ‘69254 5127! 12 |3803; 10 
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An unfortunate precedent has been set in several recent al- 
gorithms of using an illegal FORTRAN construction. This oon- 
sists of separating an initial line from its continuation line by a 
comment line, and is forbidden by the standard (see sections 3.2.1, 
3.2.3 and 3.2.4 of [1, 2]). The offending algorithms are to date: 
332, 344, 351 and 359. 

While this is perhaps a debatable decision by the compilers of 
the standard, and trivial to correct, it seems a pity to break the 
rules just for a pretty layout as has been done. 

REFERENCES: 

1. ANSI Standard FORTRAN (ANSI X3.9-1966), American 
National Standards Institute, New York, 1966. 

2. FORTRAN vs. Basic FORTRAN, Comm. ACM 7 (Oct. 1964), 
591-625. 


COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 352 

CHARACTERISTIC VALUES AND 

ASSOCIATED SOLUTIONS OF 

MATHIEU’S DIFFERENTIAL 

KQUATION [822 

DonaLp 8S, CLteum (Reed. 2 June 
1967, 1S Apr. 1968, 6 Jan. 1969 
and 10 Mar. 1969) 

Aerospace Research Laboratories 
Wright-Patterson Air Force Base 
OH 45433 


KEY WORDS AND PHRASES: Ma- 
thieu’s differential equation, Mathieu 
function, characteristic value, periodic 


solution, radial solution 
CR CATEGORIES: 5.12 


Comments Algorithm 352 is a package of 
double-preeision FORTRAN routines which 
consists of the folowing primary routines: 


MECVAL---referred to as Algorithm 352 

(Part A) 

MATH--referred to as Algorithm 352 

(Part B) 

BESSIL -referred to as Algorithm 352 

(Part C) 

MECVAL computes characteristic values of 

Mathieu's differential equation. MATH 

computes the associated solutions of this 

equation, using BESSEL as an auxiliarv 
routine to evaluate Bessel funetions. This 
latter routine may be used independently. 

There are other, secondary routines in- 
cluded in the package, and the numbering 
system (e.g, Algorithm 352 (Part A.1)) indi- 
cates somewhat the mutual relation between 
them, as well as their relation to the primary 
routines. The functioning of the routines 
and the linkages between them are explained 
in the comments prefacing each one. All 
literature citations refer to the following 
list. 

REFERENCES: 

1. ApramowiTz, M., AND Srecun, I. A. 
(i:ds.). Handbook of Mathematical 
Functions. NBS Appl. Math. Ser. 55, 
US Govt. Print. Off., Washington, 
D.C., 1964. 

2. Buancu, G. Numerical evaluation of 
continued fractions. STAM Rev. 6, 4 
(1964), 383-421. 

. Buancu., G. Numerical aspects of Ma- 
thieu eigenvalues. Rend. Cire. Mat. 
Palermo (2) 18 (1966), 51-97. 

4. BuaNcH, G., AND CLemMM, D.S. Jables 


a“ 


Relating to the Radial Mathiexn Func- 
tions, Vol. 1, Functions of the First 
Kind. US Govt. Print. Off., Washing- 
ton, D.C., 1962. 

5. BuancuH, G., AnD Cimmm, D. 8S. Tables 
Relating to the Radial Mathieu Func- 
tions, Vol. 2, Functions of the Second 
Kind. US Govt. Print. Off., Washing- 
ton, I).C., 1965. 

6. Ince, Ef. L. Tables of the elliptic cvlin- 
der funetions. Proc. Roy. Soc. Edin- 
burgh 62 (1932), 355-428; also Zeros and 
turning points. Proc. Roy. Soc, Edin- 
burgh 52 (1932), 424-433. 

7. National Bureau of Standards. Tables Re- 
lating to Mathieu Functions. Appl. 
Math. Ser. 59, US Govt. Print. Off., 
Washington, 1).C., 1967. (second ed.) 

8. Strarron, J. A., Morse, P. M., Cut, L. 
J., AND Hurnen, R.A. Elliptic Cylin- 
der and Spheroidal Wave Functions. 
Wiley, New York, 1941. 


Algorithm 352 (Part A) 
MFCVAL (Characteristic Values) 


Comments ‘Thesubrountine MFCVAL com- 
putes the first N characteristic values, a, to- 
gether with upper and lower bounds, of 
Mathieu's differential equation for nonnega- 
tive values of the real parameter, q. The 
equation can be written in the form 


y” + (a — 2q cos 2x)y = VU, (1) 


where a = a, (a = b,) indicates a character- 
istic value associated with the even (odd) 
periodic solutions. 

The method consists of three steps: (1) 
caleulate a rough approximation based on 
coefficients obtained from curve-fitting of 
available tabulations, (2) determine crude 
upper and lower hounds, and (3) iterate, 
using a variation of Newton’s method. For 
a justification of this method, see {3]. 

Explanation of the arguments: 

N the given number of characteristic 
values desired 

R_ given as N—1 or N according as the 
characteristic values are to be asso- 
ciated with the even or odd_ solu- 
tions, respectively 

QQ the given nonnegative parameter q 

CV the computed 6 by N array of charuc- 
teristic values and bounds 

J the number of characteristic values 
successfully computed. J # N indi- 
cates that J values were computed 
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with an error occurring on the J + 1 
value. A printed message will ac- 
company such an error condition. 
The output array, CV, must be appro- 
priately dimensioned in the calling program 
and upon return will contain the following 
data: 
For the Kth characteristic value, K = 1, 


Qe J, 
CV (1, kK) the characteristic value a 
CV (2, K) the function D(a) = —T', (a) 


Tin! (Q) 
CV (3, 15) ap, a lower bound of a 
CV (4, 1K) the funetion D(az) 
CV (5, K) ae, an upper bound of a 
CV (6, Ky the function Day). 

teference is again made to {3], where the 
funetion 7',,(a) is defined and it is proved 
that 7,(a@) = Oif and only if a is a charac- 
teristic value. From this, it can be said that 
the function D is an indication of the ac- 
curacy of its argument, since a + D(a) 
would be the value of the next iteration. 

The first executable statement in 
MFCVAL sets a tolerance of 1073. This may 
be changed by the user, but the following 
comments should be heeded if it is at- 
tempted. 

If it is desired to reduce the tolerance iu 
order to achieve the greatest possible ac- 
curacy, care should be taken that the toler- 
ance is not less than 107‘*~» when executing 
the routines on a machine which uses n-digit 
arithmetic. In other words, if the user’s 
computer employs 24-digit arithmetic, this 
tolerance should be no less than 10722. A too 
small tolerance will impose an unattainable 
accuracy requirement and overflow may 
oceur.! 

On the other hand, some time-saving may 
be achieved, at the expense of accuracy, by 
making the tolerance less stringent. A toler- 
ance of 10-¢ will produce results good to at 
least d digits. This is a conservative esti- 
mate, since one additional iteration is per- 
formed after the tolerance is met. and, nor- 
mally, the convergence of successive itera- 
tions is quadratic. 

Perhaps it should be noted again that the 
accuracy of any characteristic value, a, can 
be determined from the size of it relative tu 
the funetion D(a). See the description of the 
contents of the output array CV. MFCVAL 
calls on the subroutines: 


BOUNDS—referred to as Algorithm 352 
(Part A.1) 
MFITR8—referred to as Algorithm 352 
(Part A.2) 
TMOFA—referred to as Algorithm 352 
(Part A.3) 


1 The constant in statement numbers 425 and 445 is intro- 
duced to avoid the possibility of a zero tolerance. This 
should not be altered unless the routines are being run on 
a machine which uses arithmetic of more than 16 digits, 
und then it must not he less than 107(*®), with n defined 
as above. 


¢ 


SUBROUTINE MFCVAL (NeReUUel Ved? 


HRHRHHRRHRHHK EH HERE 
INTEGER 
+ JeK oKK ol oMeoNoh yo TYPE 
DOUBLE PRECISION 
* AsCVeDL »DRsDIMsU0 sQus 
* TsTMesTOLsTOLA 
DIMENSION 
* CVC69N) 
EQUI VALENCE 
* (DL »sDR»oT) 
COMMON /MF1/ 
* QeTOL»sTYPE »sOUMMY (4) 
TOL = 120-13 
IF (N=R} 10910920 
IO L = 1 
GO TO 30 
20 L = 2 
39 Q = QQ 
NO 590 K = loN 
J = K 
IF (Q) 9609490949 
490 KK = MINO(Ks4) 
TYPE = 2*MOD(L92)4+M0D(K=-L+152) 
FIRST APPROXIMATION 
GO TO (1009200200 5400)» KK 
100 IF (Q=-1200) 11021469140 
110 GO TO (1209130)5s L 
120 A = 1.00-Q~0125D0*Q*Q 
GO TO 420 
130 A = Q*Q 
A = A®(=0e5D0+2e 05468 75D00*A) 
GO TO 429 
140 IF (Q-2.D0) 1505180918C 
150 GO TO (1609170)5 L 
160 A = 12033D0-1e0746L0#0— 
* eC688D0*O*Q 
GO TO 420 
170 A = 023D0-2495D0*Q- 
* el IIDOFQ*Y 
GO TO 429 
180 A = ~0e25D00-2-D0*G+ 
* 2eLO*DSORTIQ) 
GO TO 420° 
200 DL et 
IF (Q*OL—-60D0) 21053500350 
210 GO TO (2209230)5 L 
220 A = 42601521D0-0* 
* (6046D0+ eC667857D00#Q) 
GO TO 429 
230 A = 1-00+1-.05007D0#Q- 
* © 180143D0*Q#Q 
GO TO 42% 
300 IF (Q-B8.D0) 31093509350 
310 GO TO (32059330)5 L 
320 A = 869386700+.178156D0*Q- 
* 00252132D00*Q*G 
GO TO 420 
330 A = 3-7001700+4+.953485D0*0— 
* °00475065D0*¢ *Q 
GO TO 429 
350 DR = K-l 
A = CV(lsK-1)-DR+ 
* &4eD0*DSQRT(Q) 
GO TO 420 
400 A CV CL eK=1)-CVC1 eK=2) 


A 3eDO0*A+CV( 1 9K-3) 
420 IF (QeGEeleD9) GO TO 440 


IF (KeNEol) GO TO 430 
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425 TOLA = DMAX1(DMINI( TOL »DABSI(A)) 
* 9120-14) 
GO TO 450 
436 TCLA = TOL*DABS(A} 
GO TO 450 
440 TOLA = TOL*DMAX1(GsDABS{(A) ) 
445 TCLA = DMAX1(DMINICTOLAsDABS(A}) 
* »¢4D0*DSQRT(N)) 
* 91e¢0~-14) 


¢ CRUDE UPPER AND LOWER &bOUNDS 
459 CALL BOUNDS (KeAsTOLAsCVeNoh) 


IF (MeNEed) 
* IF (M-1) 47099109990 


Cc ITERATE 
CALL MFITRB (TOLAsCV( 19K)» 
* CV( 29K) 9M) 
IF (MeGTeO) GO TO 920 
Ia FINAL BOUNDS AND FUNCTIONS» DO 
47s T = CV(1sK)-TOLA 
CALL TMOFA (TsTMeDTM»M) 


IF (MeGTe0) GO TO 940 


CVO39K) = T 
CV(4eK) = -TM/OTM 
48D T = CV(C1sK)+TOLA 


CALL TMOFA (T»TMyDTMsM) 


IF (MeGTeO) GO TO 950 


CV(59K) = T 
CV(C69K) = -TM/DTM 
GO TO 500 
Cc @ EQUALS ZERO 
490 CVC1sK) = (KeL+1)**2 
CV(29K) = 0200 
CVC3eK) = CVC1sK) 
CVC4seK) = 0200 
CVC59K) = CVC 15K) 
CVC6sK)} = 0600 
509 CONTINUE 
550 RETURN 


C PRINT ERROR MESSAGES 
900 WRITE (69901) K 
901 FORMAT (20HOCRUDE BOUNDS CANNOT» 
* 22H BE LOCATED» NO OUTPUT» 
* 7TH FOR K=12) 
GO TO 930 
919 WRITE (69911) K 
911 FORMAT(20HOERROR IN SUBPROGRAMs 


* 22H TMOFAs VIA SUBPROGRAMs 
* 18H BOUNDS» NO OUTPUT» 
* 7H FOR K#I2) 

GO TO 930 


920 WRITE (69921) K 
921 FORMAT( 2QOHOERROR IN SUBPROGRAM» 


* 22t4 TMOFAs VIA SUBPROGRAMy 
* 18H MFITR@»s NO OUTPUT» 
* 7TH FOR K=12) 
930 J = J-1i 
GO TO 550 


940 WRITE (69941) K 
941 FORMAT(20HCERROR IN SUBPROGRAMs 


* 22H TMOFAs NO LOWER BOUND» 
* TH FOR K=I2) 
CV(30K) = 02600 
CVG4eK) = 0000 
GO TO 480 


950 WRITE (69951) K 
951 FORMAT(20HOERROR IN SUBPROGRAMy» 


* 22H TMOFA»s NO UPPER BOUND>s 
* 7H FOR KsI2) 
CV(59K) = O6D0 
CV(6eK) = 0200 
GO TO 500 


960 WRITE (69961) 
961 FORMAT(20HOQ GIVEN NEGATIVELY os 
* 20H USED ABSOLUTE VALUE) 
Q = -Q 
GO TO 40 
END 


Algorithm 352 (Part B) 
MATH (Mathieu Functions) 


Comments The subroutine MATH com- 
putes various solutions (and their deriva- 
tives), of either Mathieu's differential equa- 
tion or Mathieu’s modified equation, which 
are associated with the characteristic 
values. 

The even periodic solution of equation 
(1) is 


2 
ce, (%,q) = > Aerap cos (Qkh+p)x, (2) 
v==() 


associated with a,(q), and the odd periodic. 
solution is 


be] 
se,(2,q) = Du, Box» sin (2k+p)xr, (3) 


associated with b,(q). The order, 7, is of the 
form 2n + p. The nis a nonnegative integer 
while p = 0 or 1 indicates the solution is of 
period 7 or 27. Calculation of the periodic 
solutions allows the following three options 
of normalization: 

(a) Neutral. We define neutral coeffi- 
cients such that clorip = Aexap/classp, 
where s ischosen so that -42.;, is the numeri- 
cally largest one of the set. The Baz are 
similarly defined. This has the computa- 
tionally convenient effect of making the 
largest coefficient equal to unity, hence all 
calculations are carried out with them. If a 
normalization other than neutral is selected, 
it is effected on the output array F only, the 
coefficients themselves remaining un- 
changed. 

(b) Ince. The normalization adopted in 
[6] is defined so that if y(z,q) represents 
either function (2) or (3) then 


i yr, gdr = x. 
0 


(c) Stratton. As defined in [8], and in 
the notation of [7], this normalization is 
effected so that 


i 


(2h 

Se-(q, 0) = | — So,(q, x) 

dr Nat 

where Se is the even solution and So the odd. 
If we replace x by ¢x in (1), we get 


y” — (a—2q cosh 2x) y = 0, (4) 


known as Mathieu’s modified equation, The 
solutions of (4) have been termed radial in 
{8] and, for characteristic values, can be put 
in the following form, using the notation of 
[4] and [5]: 


Mc (2,q) = 
ve} 


D(H 1)8 Asse Pe +Gal/tecnecesny 
v=() 
associated with a-(q), and 
Ms)? (2,q) = 
2 (6) 


> (—1)"**Barep (Fx — 4]/Bas-p, 
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associated with 6b,(q). The order r equals 
2n + p, as in (2) and (8), and en = 1 if 
m ¥ 0, but e9 = 2. The choice of s is arbitrary 
here, but for numerical purposes we choose 
it in the manner described previously for 
neutral normalization. The coefficients are 
the same as defined in (2) and (3), while F; 
and G, involve the Bessel functions as fol- 
lows: 


F, = Jis (u1) Ae (uz), (7) 
Gi = Ji, +D+8 (u1) Ze (uz), (8) 
uw = qhe7* : ue = gre", 


Zn (u) = Jm(u), ZO (u) = Yn(u). 


The solutions (5)-(6) are said to be of the 
first or second kind depending on whether 
j = lor2in (5)-(8). 

Ixplanation of the arguments: 


XX the given independent variable x 
QQ the given positive parameter q 

R the given order r 

CV the given characteristic value, 


a,(q) or 6,(q) 

SOL given as 1, 2, or 3 according as the 
desired solution is (1) radial of 
the first’ kind, (2) radial of the 
second kind, or (3) periodic 

FNC given as 1, 2, 3, or 4 according as 
the desired solution is (1) asso- 
ciated with b,., (2) associated 
with a,, (3) the derivative of 
solution (1), or (4) the deriva- 
tive of solution (2) 


NORM ss given as 1, 2, or 3 according as the 
desired normalization is (1) de- 
fined as neutral, (2) defined by 
Ince, or (8) defined by Stratton. 
(This argument is decoded only 
if SOL = 3.) 

F the computed three-element array, 
containing: (1) the solution 
value, (2) the series term of 
largest magnitude, and (3) the 
last term included in the summa- 
tion 


K the computed two-element array, 
containing: (1) the index, k, of 
the term in F(2), and (2) the 
index of the term in F(3) 


M the error indicator cell: M = 0 
indicates successful execution of 
subprogram, M = 1 signifies an 
error condition explained by an 
accompanying printed message. 

The accuracy of results (within limits) 
and the speed of convergence may be altered 
by the user. See SUM (Algorithm 352 (Part 

B.2)) for details. 

MATH calls on the subroutines: 

COEF—referred to as Algorithm 352 (Part 

B.1) 
SUM—referred to as Algorithm 352 (Part 
B.2) 


BESSEL—referred to as Algorithm 352 
(Part C) 


CUBROCUTINE MATH (XXsUGek eCV¥elOLe 
+ PHC pNORMoF oF oM) 
C RHEE OKHH EEE ETH 


INTEGER 
* FNCoT oK(Z) oF LALIT oKMAXoL» 
* LL eM oF oIAL 9 MM ao MG M1 9250 
* No NORM oF ok el eslLOL es TYPE 


QOUBLE PRECISION 
AsAB eo CV sLLAST »UMAX oF (3) 9G0 


# JoGeGGeTeTOL eUl SUZ aXoxXXeY 

EXTCRNAL 

* DC eUDC sUUSeULeDPCeuPLe 
PCPS 

COMMON 


J( 250) 9V(256) ou] 9U2 oNoPo ie 
LoXeTol oll eGeOMAXeVLALCTs 
KMAX pKLASTs®OUM1(57U) oh» 
DUM2(6) »>MMeML ABR (2C0) 


xe kK * 


COMMON /MFI1/ 


* Qe TOL sTYPE M1 9 M0 oM2 oF 
M = 0 

IF (SOLe*L Tel eRe 
* SOLeGTe3 eORe 
* FNCeL Tel e0Re 
* FNC eGTe4) GO TO 40C 

A = CV 

Q = QC 

TOL = 1e0-13 

TYPC = 2*MOD(FNCs2)4+MCO(Re2) 
CALL COEF (™) 


IF (M) 4109109420 
10 R/2 
MOD(R»2) 
MM / 2 


iow ot won on 


IF (SOLe¢EQ.3) 
* GO TO (150916091709180)s FNC 


Ul = DSQRT(Q)*DEXP(-X) 
Ue = Q/U1 
LL = L+5+P 


C COMPUTE BESSEL FUNCTIONS 
CALL BESSEL (1leUlsJsLL) 
CALL BESSEL (SOL »U2sY LL) 


nn 


EVALUATE SELECTED FUNCTION 
GO TO (56260970280) 5 FNC 
50 CALL SUM (DS) 


oo TO 300 
60 CALL SUM (DC) 

GS TO 300 
72 CALL SUM (DDS) 

GO TO 300 
80 CALL SUM (DDC) 

G9 TO 390 
150 CALL SUM (PS) 

GO TO 200 
169 CALL SUM (PC) 

GO TO 200 
170 CALL SUM (DPS) 

GO TO 200 


180 CALL SUM (OPC) 
200 IF (NORM-2) 300292105250 


Cc INCE NORMALIZATION 
210 T = AB(1)**2 


IF (TYPEeEQe0) T = T+T 


DO 220 1 = Isl 
T = T+AB(1+1)*#2 
220 CONTINUE 
T DSQRT(T) 
I MO/2 


nou 
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IF (ABC T}ebLTeO0e00) T= -T 


GO TO 300 


Cc STRATTON NORMALIZATION 


250 IF (TYPCeGTel) GO TO 270 


T = ABC 

DO 260 I = 
T = 

260 CONTINUE 


} 
lel 
T+AB(1+1) 
GO TO 300 

270 T = DBLE(FLOAT(P))*AEC]) 

DO 280 I lol 

T T+AB(14+1)* 

* DBLE(FLOAT(2*I1+P)) 

280 CONTINUE 


not 


G/T 
DMAX/T 
DLAST/T 
K(1) KMAX 
K(2) KLAST 
350 RETURN 


300 Fl) 
F(2) 
F(3) 


C PRINT ERROR MESSAGES 
400 WRITE (69401) 
491 FORMAT(18HOSOL CR FNC OUT OF » 
* 17H RANGEs NO OUTPUT) 
GO TO 450 

410 WRITE (69411) 

411 FORMAT(15HOMORE THAN 200 » 
22HCCEFFICIENTS REQUIREC 9» 
* 20H QQ AND R TOO LARGE9s» 

* 10H NO OUTPUT) 


* 


GO TO 450 
420 WRITE (60421) 
421 FORMAT({20HOERROR IN SUBPROGRAMs 


* 22H TMOFAs VIA SUBPROGRAMs 
* 13H COEF» VERIFY» 
* 21H ARGUMENTS» NO OUTPUT) 
450 M = 1 

F(1) = 0600 

F(2) = 0«D0 

F(3) = 0600 

K(1l) = Cc 

K(2) = 0 

GO TO 350 
END 


Algorithm 352 (Part A.1) 
BOUNDS (Crude Bounds) 
(Called by MFCVAL) 


Commenis Thesubroutine BOUNDS deter- 
mines crude upper and lower bounds for the 
Kth characteristic value, K < N. 
Iixplanation of the other arguments: 
APPROX | the first approximation 
TOLA the tolerance determined by 
subroutine MFCVAL 
CV the 6 by N array deseribed in 
subroutine MFCVAL 
N variable dimension of the CY 
array 
MM an indicator cell used to com- 
municate unusual and error 
conditions to — subroutine 
MECVAL 
The output, ao < a < a), Is put into the 
common block labeled MF 2. 
BOUNDS ealls on the subroutine: 
TMOFA—referred to as Algorithm 352 
(Part A.3) , 


SUBROUTINE BOUNDS (K9APPROXs 
: TOLAsCVeNsMM) 
C HAHAHA KRHNHRHEAE HERE 


10 


20 
30 


INTEGER 
* KeKAsMeoMMoN 


DOUBLE PRECISION 
* AsAPPROX sAC sAlsCVeDIM» 
* DOsD1lsQeTMsTOLA 


DIMENSION 
* CV(6eN) 


COMMON /MF1/ 
* Q+sDUMMY (7) 


COMMON /MF2/ 
* AO eAsAl 


KA = 0 
IF (KeEQe1) GO TO 20 
IF (APPROX-CV(19K-1)) 10910920 


AQ 


CV(1sK-1)+12e00 
GO TO 30 
AQ = APPROX 
CALL TMOFA (A0sTMsDTMsM) 
IF (MeGTeO) GO TO 250 
D9 = -TM/DT 


IF (D0) 100300959 


C AC IS LOWER BOUND» 
C SEARCH FOR UPPER BOUND 


50 


60 


Al = A0+D0+e1D0 
CALL TMOFA (AlsTMsDTMoM) 


IF (MeGTed) GO TO 250 
Dl = -TM/DTM 


IF (D1) 2009350560 


AD = Al 
DO = Dl 
KA = KA+] 


IF (KA-4) 5054002400 


C Al IS UPPER BOUND» 
C SEARCH FOR LOWER BOUND 


110 


120 


150 


160 


200 


250 
300 
310 
350 


400 


Al = AO 
Dl = DO 
AO = DMAX1(A14+D1-01D09-26D00*Q) 


IF (KeEQel) GO TO 110 


IF (A9=CVOC19K-1)) 15091509116 


CALL TMCFA (A0sTMsDTMoM) 
IF (MeGTeD) GO TO 250 
DO = -TM/DT™ 
IF (D0) 12093002200 
KA = KA+1 
IF (KA-4&) 100949009400 
KA = KAt] 
IF (KA=4) 16024009400 
AQ = A1L+OMAX1(TOLAsDABS(D1)) 
GO TO 30 
A = ¢500*(A0+D0+A1+01) 
IF (AeLEeAO 2ORe 
* AeGEeAl) A = eSDO*#(AD+A1) 
MM = M 
RETURN 
CVtlsekK) = AO 
CVl2]e0K) = ODO 
M = -]} 
GO TO 250 
CVC1eK) = Al 
GO TO 310 
M = 2 
GO TO 250 
END 
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Algorithm 352 (Part A.2) 
MEFITR8 (Improves Characteristic Value) 
(Called: by MFCVAL) 


Comments Given ayo < a < ai , where ay is a 
lower and a, an upper bound, the subrou- 
tine MFITR8 iterates to the characteristic 
value, replacing one of the bounds with a 
better approximation at each step. The 
process terminates after 40 iterations unless 
one of the following conditions occurs first: 
(1) a— a@ S$ TOLA, (2) a, — a S$ TOLA, or 
(3) | D(a) | < TOLA. See Appendix 3, 
method 2, of [3] for a detailed description of 
this process. 
Explanation of output: 

CV the characteristic value, a 

DCV the function D(a) 

MM san indicator cell used to communi- 
cate an error condition to subrou- 
tine MFCVAL. 

MFITR8 calls on the subroutine: 

TMOFA~—referred to as Algorithm 352 (Part 

A.3) 


SUBROUTINE MFITRE (TOLAsCVsDCVeMM) 
C HERR EEE EERE EE RENE 


INTEGER 
* Me MMoN 


DOUBLE PRECISION 


* AsAO0sAlLsA2sCVeDesDCVeDIMe 
* TMsTOLA 

LOGICAL 

* LAST 


COMMON /MF2/ 


* AO sAsAl 
N = 0 
LAST = «FALSE. 
50 N = N+] 
CALL TMOFA (AsTMsDTMoM) 


IF (MeGTe0) GO TO 400 


D = -TM/OTM 
Cc 1S TOLERANCE MET 
IF (N eEQe 40 eORe 
* A~AO eLEeTOLA OR. 
+ AL-A eLEeTOLA -ORe 
* DABS(D) eLTeTOLA) LAST = eTRUEe 


IF (DBD) 11051009120 


100 CV = A 
OCV = 0200 
GO TO 320 
C REPLACE UPPER BOUND BY A 
110 Al = A 
GO TO 200 
c REPLACE LOWER BOUND BY A 
120 AO = A 
200 A2 = A+D 


IF (LAST) GO TO 300 
TF CA2eGTeADeANDeA2eL TeAl) 


a 60 TO 250 
A = e5DO*(AO+tA1) 
GO TO 50 
250 A = A2 
GO TO 50 


300 ITF (A2eLEeAQeOReA2aGEeAl) 
* GO TO 350 


CALL TMOFA (A2sTMsDTMoM) 


IF (MeGTe0) GO TO 400 


D = -TM/OTM 
cv = A2 
319 DCvV =D 
320 MM = M 
RETURN 
350 CV = A 
GO TO 310 
409 CV = 0-D0 
DCV = 06D0 
GO TO 320 
END 


Algorithm 352 (Part A.3) 
TMOFA (Accuracy Indicator) 
(Called by MFCVAL, BOUNDS, MFITR8 
and COEF) 


Comments The subroutine TMOFA evalu- 

ates the function 7'»(a) and its derivative 

dT m(a)/da. See [3] for the definitions, theo- 

rems, and numerical methods relating to the 

computation of these quantities. 
Explanation of the arguments: 

ALFA the given argument, a 


T™ T'm(a) 
DIM dT,n(a)/da 
ND internal error indicator cell 


TMOFA calls no other subprograms. 


SUBROUTINE TMOFA (ALFAsTM»DTMoeND) 


(e MRRHEHRREHRHKRAHER 

INTEGER 
* KeKK oKTolL oMF oMOoMi sM2Se 
* ND es TYPE 

DOUBLE PRECISION 
* AsAAsALFAsBsDGeDTMeDTYPEs 
* F eFLeGeH( 200) sHPsQeQINVs 
* Q1lsQ2eToTMsTOL sTT»V 

Cc OMMON 
* G620092) sDG( 200021 sAAs 

* A(3)2B(3) sSOTYPEsQINVeQ]> 
* Q2eTeTTsKsb oKKoKT 
COMMON /MF1/ 

* Qs TOL »sTYPE»M] »MO0 oM2S 9 MF 
EQUI VALENCE 

* CH(1)9GC191))9(Q1l»HP)» 

* (Q29F) 

DATA FL /1¢eD+30/ 


C STATEMENT FUNCTION 
VK) = (AA-DBLECFLOAT(K))##2)/Q 


ND = 0 

KT = 0 

AA = ALFA 

OTYPE = TYPE 

QINV = 1-D0/Q 

00 10 L = le2 
DO 5 K = 19200 


DG(Ksl) = 0200 
5 CONTINUE 
10 CONTINUE 


IF (MOD(TYPEs2}) 20930920 


20 MO = 3 
GC TO 40 
30 MO = TYPE+2 
40 K = «5D0*#DSQRT(DMAX1 ( 
* 3eDO#Q+AA502D0)) 
M2S = MINO(2#K+M0+4,5 


* 398+MOD(MO0e2)) 
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EVALUATION OF THE TAIL OF A 
CONTINUED FRACTION 


Al) = 12D0 
A(2) = V(M2S+2) 
B(1) = v(M2S) 
B(2) = A(2)#B(1)=-1.00 
Ql = A(2)/B(2) 
00 5C K = 1%200 
MF = M2S+2+2*K 
T = V(MF) 
A(3) = TRAC2)-A(1) 
B(3) = T¥Kb(2)-BC(1) 
Q2 = A(3)/B(3) 
IF (DABS(Q1-Q2)-eLTseTOL) 
* GO TO 70 
Ql = Q2 
A(1l) = AC2) 
A(2) = A(3) 
Bil) = Bl2) 
B(2) = B(3) 
50 CONTINUE 
KT = 1 
70 T = 1eD0/T 
TT = -TeTRQINV 
L = MF=M2S 
DO 80 K = 2ele2 
T = 1eDO/(VIMF-K)-T) 
TT = THTR(TT-QINV) 
80 CONTINUE 
KK = M2S/2+1 


IF (KTeEQel) Q2 = T 


G(KKe2) = e¢5D0*(Q2tT) 
OG(KKs2)= TT 


STAGE 1 
Gl2e1) = 1-D0 
DO 140 K = MOsM2S92 
KK = K/2+1 
IF (KoLlLTe5)} 

* IF (K=3) 1001103120 
G(KKel) = VEK=2)-1-¢D0/G(KK=191) 
DG(KKol)= QINV4+DG(KK-191)7 

* G(KK-ls1l)#*2 

S60 TO 130 
100 G(2e1l1) = VO) 
OGl2s1)} = QINV 
GO TO 130 
110 G(20l) = vt1)+OTYPE-2-D0 
DG(2s1) = QINV 
GO TO 139 
120 G(3el) = VO2)4+(DTYPE-22eD0)/ 

* Gl291) 

DG(3e1l) = QINV+(2eD0-DTYPE) * 

* DG(2o1)/Gl251)**2 
IF (TYPEc*EQe2) Gl251) = OeDO 

130 IF (DABS(G(KK91l)) el TelelO) 

* GO TO 200 


140 CONTINUE 


BACKTRACK 
™™M = G(KK92)-G(KKo9l) 
DIM ® DG(KK»s2)-DG(KK»s1) 
M1 = M2S 
KT = M2S=MO 
DO 180 L = 29KTo2 
K = M2S<L 
KK = K/2+1 
GiKK92) = LeDO/(IVIKI-GIKK+192)) 


OG(KK92)= -GIKK 92) ##2% 
* (QINV~DG(KK+192)) 


IF (K-2) 15091509160 


150 G(292) = 2eD0*G( 292) 
DG(292) = 2eDO*DGI 292) 

160 T = G(KKs2)-G(KKol) 
IF (DABS(T)-DABS(TM)) 

bas 1705180180 

170 ™ = T 
DTM = DG(KKe2)-DG(KKol) 
M1 = K 


180 CONTINUE 
GO TO 329 


ig STAGE 2 


20C Ml = K 
K = M2S 
KK = K/2+1 


210 IF (KeEQeM)) 


* IF (K-2) 3009300531C 
K = K-2 
KK = KK=1 
T = V(K)“GUKK+152) 


IF (DABS(T)=12eD0) 25022209220 


220 G(KKs2) = 1¢D0/T 
DG(KKo2)= (DG(KK+192)-QINV) /T#*#2 
GO TO 210 


¢ STAGE 3 
250 IF (KeEQeM1) IF (1) 22092909220 


HP = DG(KK+192)-GINV 
260 G(KKy2) = FL 

H(KK) = TO 

K = K-2 

KK = “KK=1 

F = V(K)#T=14D0 


IF (KeEQeM1) IF (F) 28092902280 


IF (DABS(F)-DABS(T)) 27092802280 


27C HP = HP/T##*2-QINV 
T = F/T 
GO TO 260 
280 G(KKs2) = T/F 
DG(KK92)= (HP-QINVET#T) /FR#2 
GO TO 210 
299 ND =) 
GO TO 320 
C CHAINING M EQUALS 2 
300 Gl292) = 2eD0*G1292) 
DG(292) = 26DO*DG(202) 
310 ™ = G(KKe2)-G(KKs1) 
D™ = DG(KKs2)-DG(KKol) 
320 RETURN 
ENC 


Algorithm 352 (Part B.1) 
COEF (Coefficients) 
(Called by MATH) 


Comments The subroutine COEK com- 
putes the neutral coefficients, as defined in 
the Comments of Algorithm 352 (Part B), 
and returns them via common array AB. 
Argument M is an internal error indicator 
cell. For details of the method used, see Ap- 
pendix 6 of [8]. COEF ealls on thesubroutine: 
TMOFA—referred to as Algorithm 352 
(Part A.3) 


SUBROUTINE COEF (M) 


Cc HH eH EE 
INTEGER 
* KeKAsKBoKK oMoMF oML oMMy 
* MO sM1l sM2Se TYPE 
DOUBLE PRECISION 
Sd AsABoFLeGsH( 200) sQoTs 
* TOL »VsV2 
Cc OMMON 
* G( 20092) sDUM1 (800) sArToKs 
# KA »KB KK »MMeML sAB( 200) 


COMMON /MF1/ 
* Qs TOL » TYPE sM1] M0 9M2S MF 


EQUIVALENCE 
* CHL) sGllel)) 


DATA FlLeV2/1eD+3091eD-15/ 
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STATEMENT FUNCTION 
VOK) = (CA~DBLE( FLOAT(K))*#2)/0 


CALL TMOFA (AoToToM) 
IF (MeNEe0) GO TO 300 
DO 60 K = 19200 


AB(K) = 0-D0 
60 CONTINUE 


KA = M1=M0+2 
DO 90 K = 29KAs2 
KK = (M1—K)/2+1 


IF (K=2) 70570s80 


70 AB(KK) # 12DC6 

GO TO 90 
80 AB(KK) 2 AB(KK4+1)/G(KK+191) 
90 CONTINUE 


KA = 0 

DO 130 K = M1leM2Se2 
KK = K/2+) 
ML = K 


IF (G(KK92)e¢EQeFL) GO TO 100 


AB(KK) = AB(KK~1)#G(KK92) 


GO TO 110 
100 T 


AB(KK=-2) 
IF (Ke EQe4eANDoMlLeEQe2) T = T+T 
AB(KK) = T/0V(K-2) #H( KK) —12eD0) 


210 IF (DABSCAB(KK) )eGEole0-17) 
* KA = 0 


IF (KAeEQe5) GO TO 260 


KA = KA+1 
130 CONTINUE 
T = DLOG(DABS(AB(KK))/V2)/ 
* DLOG(1eD0/DABS(G(KK92))) 
KA = 2¥IDINTI(T) 
ML = KA+24M25 


IF (ML eGTe399) GO TO 400 


KB = KA+2+MF 
T = 12¢00/V(KB) 
KK = MF-M2S 


DO 150 K ® 29KKos2 
T 2 1-D0/(V(KB-K)-T) 
150 CONTINUE 
KK = ML/2+1 
G(KKs2) = T 
oO 200 K = 29KAe2 


KK = (ML-K)/2+1 
G(KKs2) = 1eD0/(V(ML-K)- 
* G(KK+192)) 
200 CONTINUE 
KA = M2S+2 
DO 250 K = KAeMLe2 
KK = K/2+) 


AB(KK) 2 AB(KK=-1)#G(KKs2) 
259 CONTINUE 


NEUTRAL NORMALIZATION 


260 T = ABCL) 
MM = MOD(TYPEs2) 
KA = MM+2 
DO 280 K @ KAeMLo2 
KK = K/2+1 
IF (DABS(T)-DABS(AB(KK))) 
* 27002809280 
270 T = AB(KK) 
MM = K 


280 CONTINUE 

90 290 K &= 19KK 

AB(K) = AB(KI/T 
299% CONTINUE 
300 RETURN 
400 M = =} 
GO TO 300 
END 


Algorithm 352 (Part B.2) 
SUM (Series Evaluation) 
(Called by MATH) 


Comments The subroutine SUM performs 
the summation, truncating the series when 
the magnitude of two successive terms, rela- 
tive to the magnitude of the largest term, is 
less than or equal to 107". 

If the user is willing to accept reduced ac- 
curacy, he may save some computing time 
by making this tolerance larger. On the 
other hand, however, a smaller tolerance 
will not necessarily increase the accuracy, 
since on a machine using 16-digit arithmetic 
the sum will be, at best, goud to 16 digits. 

The particular series being evaluated is 
determined by the arguments SOL and FNC 
within subroutine MATH and communi- 
cated to this subroutine via argument DUM. 

Output is returned via common: varia- 
bles F, DMAX, DLAST, NMAN, and 
KLAST. 

SUM ealls on one of the functions of 
Algorithm 352 (Part B.2.1). 


SUBROUTINE SUM (0UM) 


Cc SHEBPRHEKHREEEE 

INTEGER 
* Ke KLAST sKMAX el oS 

DOUBLE PRECISION 
* DLAST »DMAX sDUMoF o T 

Cc OMMON 
* DUM1 (1006) »SelL sDUM2(6) oF » 
* DMAX » DLAST sKMAX sKLAST oT 
K = 0 

F =z DUM(0O) 

DMAX = F 

T = DABS(F) 

KMAX = 0 


0O 30 KLAST = let 
DLAST = DUM(KLAST) 
F = F+DLAST 


IF (T-DABS(DLAST)} 10910920 


10 DMAX = DLAST 
T = DAGS(DMAX) 
KMAX = KLAST 
2c IF (KLASTeLE*eS) GO TO 30 
IF (DABS(DLAST)/TeGTeleD-13) 
* XK = O 
K = K+] 


IF (KeEQe3) GO TO 49 


3° CONTINUE 
KLAST = L 
40 RETURN 


END 


Algorithm 352 (Part C) 
BESSEL (Bessel Functions)? 
(Called by MATH) 


Comments The subroutine BESSEL eval- 
uates Bessel functions of the first or second 
kind, according as the argument SOL = lor 
2, of orders0,1,-+-: ,n and argument wu, both 
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COLLECTED ALGORITHMS (cont.) 


of which must be nonnegative. Functions of 
order zero and one are always evaluated, re- 
gardless of the value of m. Results are re- 
turned via array JY, with element JY (K) 
containing the function of order K-1. 

It should be noted that for SOL = 2 and 
u = 0, a large negative constant (— 1037) is 
returned as the function value for all orders 
and no warning is given. 

Different methods of computation are 
used for Jo(u), Ji(u), Yo(w), and Yi(u), 
depending upon whether u < 8, or not. (See 
subroutines JOJ1, YOY1, and LUKE for 
details.) The Ja(u), n = 2, 3,°°+ , m, are 
computed by means of a continued fraction 
(see subroutine JNS), whereas the Y,(w) 


for corresponding orders are calculated di- 
rectly from the recurrence relation: 


Vuitas = i = Vesa) 


BESSEL ealls on the subroutines: 
J0J1—referred to as Algorithm 352 (Part 
oa Sere to as Algorithm 352 (Part 
a cee to as Algorithm 352 (Part 
INS refone to as Algorithm 352 (Part 

4 


SUBROUTINE BESSEL (SOLeUsJYeN) 
c eee ec ere reer er ss 


INTEGER 
* NoNNsSOL 


DOUBLE PRECISION 
* J7€250) 9U 


NN = MINOINS249) 


IF (UsEQeDeD0eANDe SOL eEQe2) 
* GO TO 80 


IF (UseGEe8eD0) GO TO 30 


GO TO (10920)+ SOL 
10 CALL JOJ1 (UsJY) 


GO TO 40 
20 CALL YOY] (UsJY) 
GO TO 40 
30 CALL LUKE (UsSOLeJY) 
40 ITF (NeLTe2) GO TO 100 
GO TO (50960)»% SOL 
50 CALL JNS (JY9UsNN) 
GO TO 100 
C RECURRENCE FORMULA 
60 DO 70 K = 25NN 
JY(K+1) = 2000 
* DBLE( FLOAT (K=-1) }* 
* JYCK)/USSY(K=1) 
79 CONTINUE 
GO TO 100 
80 NN = NA+] 
DO 90 K = 1»NN 
JY(K) = 100437 
90 CONTINUE 
100 RETURN 


END 


‘ This subroutine (together with its subsidiary routines) 
may be removed in toto, with no changes, and used inde- 
pendently as a Bessel function algorithm. The results are 
good to 14 significant digits or decimal places, whichever 
is least accurate, with an error of no more than one unit in 
the last digit or place. 


Algorithm 352 (Part C.1) 
JOJ1 (First Kind) 
(Called by BESSEL) 


Comments The subroutine JOJ1 computes 
the Bessel functions of the first kind, Jo(zx) 
and Ji (2), for x < 8. This is done by evalu- 
ating formula 9.1.10 of [1]. The results are 
returned via array J. 

JOJ1 calls no other subprograms. 


SUBROUTINE JOU1 (Xed) 
¢ FETT ET 


DOUBLE PRECISION -" 


* J(2)9T(5) ox 

Cc OMMON 

* DUM(1014) »T 

TC1l) = X/2200 

JCl) = 1200 
J(2) = TO1) 

TO2) = =-T(1L)**2 

T(3) = 1200 

T(4) = 1400 

10 T(4) = TOE4)¥*TC2)7TC3) #2 

JO1L) = JO1L)4+TO4) 

T(5) = T(4)*T01)/06T03)41200) 
J(2) = J(2)4+T(5) 


IF (DMAX1(DABS(T(4)) sDA8S(T(5))) 
. el TeleD-15) RETURN 


T(3) = TO3)+1¢D0 
GO TO 10 
END 


Algorithm 352 (Part C.2) 
YOY1 (Second Kind) 
(Called by BESSEL) 


Comments The subroutine YOY1 computes 
the Bessel functions of the second kind, 
Yo(x) and Y,(x), for x < 8. This is done by 
evaluating formulas 9.1.13 and 9.1.11 of {1]. 
The results are returned via array Y. 

YOY1 calls no other subprograms. 


SUBROUTINE YOY] (X»sY) 
C HERE RE 


DOUBLE PRECISION 


* T(10) 9X9¥(2) 
Cc OMMON 
* DUM(1014) oT 
TO1) = X42200 
T(]2]) = -TC1)##2 
Y(1) = 1-00 
YC2) = TO)) 
T(7) = 0200 
T(10)= -T(1) 
T(3) = 0200 
T(4) = 0200 
T(5) = 1400 
10 T(3) = T(3)+1-D0 
T(4) = T064)4+1e¢D0/T(3) 
TOS) = TE5)*TO2)/T03) ##2 
YOL) = Y¥C1)4TC5) 
T(6) 3 -T(5)*T(4) 
T(7) = TCT)4+T(6) 
T(8) = TO5)*T(1)70T03)41000) 
YC2) = YC2)+T(8) 
TCD) = -T(BI*(2]eDO*T(4)+ 
® 12¢00/(T(3)+1.200)) 


TC1O0)= TC1LO)+T(9) 
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IF (OMAX1T(NABS(T(6))sDABS(T(9}9) 


* eGEeleD-15) GO TO 1C 

T(2) = 657721566490153¢E0D0E0+ 

* DLIG(TCL)} 

YOL) = ©€6366197723675813400* 

* CYCLIETOC2]V+TO7)) 

YC2) = ©63661977236758134D0* 

* CYC2PETC 22-1 eDO0/X I +T( 197 

* 321415926535897932D0 
RETURN 

END 


Algorithm 352 (Part C.3) 
LUKE 
(Called by BESSEL) 


Comments The subroutine LUKE evalu- 
ates Bessel functions of order zero and one, 
of the first or second kind, according as the 
argument KIND = 1 or 2, for u 2 8. The re- 
sults are returned via the 2-element array 
JY. 

The Bessel function of the third kind 
(Hankel function), H{?(u) = J,(w) + 
7Y,(u), can be expressed in terms of the 
Chebyshev polynomials, T,* (x), as follows: 


ON? Gf o2h = 
HYD = (=) if 2 4 
ru 
(9) 
2 ax” + iph”)Ti*(R/u). 
=O 

We now define ag” = Ari, BO = Bay, 
ak = Cry, BE? = Diar, © = R/u, and 
T.*(@) = Geyi(x). The recurrence relations 

for the G;(x) are as follows: 


G(x) 
Gi (2) 


il 


1, Gi(%) = 2x2 — 1, 
(42—2) Gei(z) ~ Ge_2(zx), 
k 2 3. 


il 


If we let » = 0 and make other appropriate 
substitutions in (9), while remembering that 
e? = cos 6+ 7Sin 6, we can separate the real 
and imaginary parts and get the following 
relations: 


S(O) = (2) 
TU 


cos 6 >, Ax Gk(z) — sin @ >> Be G(x) 
k=l k=1 


3 
You) = (2) 
TU 


cos 6 >, Bi Gi(z) + sin 6 >> Ax Gila) 
k=1 


’ 
ke] 


where 6 = u— 7/4. 

Notice that if » = 1 in (9), then @ is re- 
placed by 6 — 7/2. Also, cos (@—7/2) = 
sin 6 and sin (@—27/2) = —cos 6. There- 
fore, proceeding as before, we get 


be 2\3 
Oe (2) 
TU 


sin 6 > Cu Gilz) + cos 6 >) Di G(x) 
k=1 


kak 


2\' 
ron = (2) 
TU 


sin 0 >. Dk Gi(x) — cos 6 > Cr G(x) 
k= 5 k=1 


The coefficients 4, B, C, and D nave been 
computed for R = 8 in eq. (9) and are guar- 
auteed to the number of digits given. 

LUKE ealls no other subprograms. 


SUBROUTINE LUKE (UsKINDoJY) 
C TEA JE TRAE TS a aE Ee 


INTEGER 
* Ks KIND 


DOUBLE PRECISION 

* A(19)9B(19) sCSeC(19)» 
* D(19) eGl3esJV(2)sR(2)5 
* S02) 9SNoT 9UoX 


C CMMON 
* DUM( 1014) sReSsGerXoTsSNeCsS 


WARNING = THE FOLLOWING DATA 
STATEMENTS ARE NOT IN ASA 
STANDARD FORTRAN 


NON 


DATA A / 

0999595064 768672b7T416D0 
—0538079561396069130-3» 
—013179677123361570D—-3» 
© 1514224970486440—-% 9 

2 15846861792063D-64 
©856069553946D-8 

© 29572343355D-9» 
©6573556254D-10» 
0223749703D0-11> 
°©44821140D-125 
©69548270-13>s 
0151340D-~14,5 
©924220-15> 
e15558D-15>% 

4760-17» 

2740-175 

«610-18» 

e4D-195 

e1D-19/ 


xe Ke ROR OR KK RR Kee ek Rm 
| 


DATA B / 
0 7769355694205321360-2» 
— 2 77480323096544767()D—25 
© 2536541165430796D-4» 
© 3942735983997T11D0—5>» 
0107234982991 29D—6> 
© 721389799328D0-8» 
© 73764602893D—%5 
e1506878110-11s 
0©574589537D-11>s 
°©459965740-12>5 
022703230-13,5 
¢887890D-14> 
. 74497D=-15 ’ 
e5847D-1lb»s 
«24100-16» 
© 2650-17» 
© 130-18» 
2100-18» 
020-197 


RK ROR RR OR ROR OR RRR OK A ROOK 
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COLLECTED ALGORITHMS (cont.) 


DATA C / 


x eR Ke 


1690006775 358659134623400» 
©90100725195908183D-3» 
02217243491 8599454D—-3 9 

—01965759463191040-5» 

—©20889531143270D—6% 

©1028144350894D-7>5 

©375970547890-9+5 

© 7638891358D-10» 

©238734670D-1l» 

©51825489D0—12, 

© 76939690-13, 

°©144008D0=-14, 

0103294D0-14, 
©168210-15% 

4590-17» 

©302D-17> 

2650-18» 

e4D-19> 

e1D-19/ 


1 


DATA D / 


xe eR OK ee RR KE HR KH EK HK HK 


x 
G1) 
G2) 
R(1) 
£()) 
R(2) 
£(2) 
90 10 K 
G(3) 
R(1) 
$1) 
R(2) 
$(2) 
G1) 


un # tf uw ow tt 


Gi 
19 CONTI 


SN 
cs 


GO TO 


20 JY(1) 
JY(2) 


30 JY(1) 
JY(2) 
40 
END 


©23376829986285303280-1» 
© 233468012232545575330—-1 >,» 
© 3576010590901 3820-45 
© 560863149492627D-5» 
0132738940843240D—65 
*e9169758450660-8 » 
—286838880371D—-%>5 
-23780730050-11l» 
©663145586D-11l» 
= e50584390D-12, 
~227207820-13>5 
©985381D-1L4d»s 
—079398D—15> 
~e67570-16» 
o2625D-1b6s 
—2280D-17>9 
—e1l50-18>» 
210D-18>5 
ee 2D-19/ 


8eDO/U 

1-D0 
2eDO0#*X=-1-00 
AC1)+A(2)*G6(2) 
BC 1l)+802)*G(2) 
COLI+FC(22*G602) 
D(1)+002)*G(2) 


3919 
(4eD0*X=2.D0)*G(2)-Gt)) 
RCLIFAUK)*GO3) 
S(1)+B(K)*G(3) 
R(2)4+C(K)*G(3) 
§(2)4D(K)*G(3) 

G(2) 


iow wo wo oma 


2) = G3) 
NUE 


© 7978845608028654D0/DSQRT(U) 
DSIN(U-« 78539816339 74483D0) 
DCOS (Ue 785398163397448300) 


(20930)»% KIND 
T#(R(1)*#CS-SCL}*SN) 
T#(RE2])*SNt+S(2)*CS) 

GO TO 40 
T#(SC1)*CStR01)*SN) 
T#(SC2)*SN-R(C2)*CS) 

RETURN 


Algorithm 352 (Part C.4) 


JNS 


(Called by BESSEL) 


Comments 


Jo(u) and 
tion, 
Jn +1 (u) 


The subroutine JNS evaluates 
Bessel functions of the first kind, of orders 
n = 2, 3,°°: , m, for argument u, given 
J(u). From the definition 
Gr = Jn(u)/Jn1(w) and the recurrence rela- 


= (2n/u) Jn(u) — Jailu), 


we can derive the following equation: 


G, = ————-... 
Qn (10) 
alts n+l 
Uu 


Since G, 1 is of the same form as G, , we can 
continue the process and obtain the con- 
tinued fraction, 


oe ae 1 


2n _ 2m + 1) 
u uw 
(11) 
1 


2(n + k) 


Ons A, Gnikt 


Uu 


Gm is evaluated using (11), then the other 


G, are computed from (10) for n= m — 1, 

m—2,:++ , 2. Finally, the J, are evaluated in 

a forward direction from J, = Gadn-1 and 

returned via argument array JJ. See [2] for 

a more detailed treatment of this process. 
JNS calls no other subprograms. 


SUBROUTINE JNS (JJeUeM) 
C HREM H HEHE 


INTEGER 
* KoKAsKKoM 


DOUBLE PRECISION 


* A»sBsD(2) 2DMsG( 249)» 
* JJ(250) 9 P (3) 9013) 9U 
EQUIVALENCE 
* {AsG) e(DsGl2)}5 
* (PeG(4))9(QsG(7))>» 
* (DM»9G(10))0(BsG(11)) 
Cc OMMON 
* DUM( 1014) sGoMsKsKKaKA 
DM = 2*M 
P(1) = 0.00 
@(1) = 1-60 
P(2) = 1400 
Q(2) = DM/U 
D(1) = P(2)/Q02) 
A = 2200 
10 8B = (DM+A)/U 
P(3) = B*eP(2)-P(1) 
Q(3) = B¥*Q@(2)-Q(1) 
O(2) = P(3)/Q(3) 
IF (DOABS(0(1)-D(2)) 
* eL TeleD-15) GO TO 20 
P(1l) = P(2) 
P(2) = P(32) 
@(1) = Q(2) 
Q(2) = Qt3) 
O(1) = O02) 
A = A+2.00 
GO TO 10 
20 G(M) = D2) 
KA = M=2 
DO 30 K = 19KA 
KK = M=K 
A = 2*KK 
G(KK) = U/(CA=U*G(KK+1)) 
IF (G(KK) eEQe0eD0) 
* G(KK) = 1¢0~35 
30 CONTINUE 
DO 40 K = 29M 
JJ(K+1) = GOK #FIU(K) 


40 CONTINUE 
RETURN 
END 
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COLLECTED ALGORITHMS (cont.) 


Algorithm 352 (Part B.2.1) 
DS, DC, DDS, DDC, PS, PC, DPS, DPC 
(Called by MATH via SUM) 


Comments The following collection of fune- 
tion subprograms is utilized by SUM to eval- 
uate the kth term (k = 0,1,°-: ) of one of 
the following: eq. (2), (3), (5), (6), or their 
derivatives. 

DS and DC eall on functions FJ and FY. 

DDS and DDC eall on functions FJ, FY, 
DJ and DY. 

PS, PC, DPS, and IPC eall no other sub- 
programs. 


DOUBLE PRECISION FUNCTION DS(KK) 


IRE RRR RSESES ER ESSE SESS SE SE 


ay 


INTEGER 
* KeKK eNoNloN2eP os 


QOUBLE PRECISION 


* AByFUSFY 

C OMMON 

* DUM1 61004) eNePsSeDUM2(17)s 
* KoN] oN22DUM3( 583) 2AB( 000) 


C EVALUATES ONE TERM OF THE RADIAL 
¥ SOLUTION» ASSOCIATED WITH B(Q) 


K = KK 

NI = K=S 

No = K+S4+P 

DS = AB(K+])4(FJONLI*FFY(N2)— 
* FUINZ)*FY(N1)) 


IF (MOD(K+Nec) eNEeO} OS = -DS 


RETURN 
END 


DOUBLE PRECISION FUNCTION OC(KK) 


RRFKHHRHRREKHRERAHRRRAREEE 


INTEGER 
* KoKKaNoNIoN2aP 55S 


DOUSLE PRECISION 


* ABsFUsFY 

C OMMCN 
* DUM1 (1004) sNePeSeDUM2(17)s 
* Ke Nl oN2sDUM3(583) sAB( 200) 


EVALUATES ONE TERM OF THE RADIAL 
SQLUTIONs ASSOCIATED WITH 4(Q) 


eairy 


K = KK 
Nl = <=S 

NC K+#S+4P 

DC = AB(KHLIFCFUCNT) FFYON2)+ 
* FI(N2)*©Y(N1)) 
IF (MOC(K+Ne2)eNESC) CC = -DC 


TF (S+#%eEQed) DC = «5DC*#DC 


END 


DOLBLE FRECISION FUNCTION DDS(KK) 


ORT OL TET te RO te EE RS 


INTEGER 
* KoKK sNoNlohosF oS 


COUBLE PRECISION 


* AL sDIsDY oF IS sF Yel eued 

¢ OMMON 

* DUMLELOUC) pUisUSaNsPoue 
* OUMZ (17) sKeNl Neo 

* DUMN3(383) sABCcOC) 


EVALUATLS ONE TERM OF THE OERIVATIVE 
OF THE RADIAL SOLUTIONs 
ASSOCITATE™ WITH 3200) 


Voy ey 


ryrveys 


OV 


ah 3 


fay 


an 


K = Ki 

Ni = X-S 

N2@ = K+S+P 

DOS = ABC K+1) *#(UZ*E(FICNTD#OYING 
* FIOCN2)*#DY(NI)I-“UL¥CEVOND)* 
* DJCNLIMFY(N1) #DUIN2))) 

[F ("OV(K+Ne 2) eNEoO) DOS = =ODS 

RL TURN 

END 


DOUBLE PRECISION FUNCTION DOC(KK) 


STM MAH TE RE OE et 


INTESER 
* KeKKoNeNloN2ePor 


i CUBLF PRECISION 


* ABDI eDY oFUsFY eUl nU2 
<OMMON 

% DUM1L (1000) sUlsU2sNePobs 

% DUM2(17) »KoNlsN2s 

* OUM3(583) sABI(Z0C) 


EVALUATES ONE TERM OF THE DERIVATIVE 


OF THE RADIAL SOLUTION» 


ASSOCIATED wIiTH Al(G) 


K = KK 

Nl = K-S 

NZ = K+S+P 

DDC = AB(K+lLI#(USH(CFICNIDHDY(NE) + 
* FJ(N2)*DY(N1))-UL*¥(FYCN2)* 
* DICNLTIFFYOCNT) #OU(NZ1)) 

IF (MOC(K+Ne2) eNEeC) DDC = ~DDC 


IF (S+PeEQed) DDC = 6 5DC800C 


RETURN 
ENC 


QCUBLE PRECISION FUNCTION PStK) 


HRB ERATE RH PREER RR HEE 


INTEGER 


x K9P 


YOUBLE PRECISION 


7 ABoX 

C GMMON 
x DUM1(1065) »sPsDUM2(2) 9X 
* DUM3 (600) sAB( 200) 


EVALUATES ONC TERM OF THE ODD 
PEPTOCIC SOLUTION 
PS = AB(K+1)* 
® DSIN(DBLE(FLOAT(2*K+P))*X) 
RETURN 
END 


{PCUBLE PRECISION FUNCTION PCI(X) 
MR RO 


INTEGER 
x KeP 


DOUBLE PRECISION 


* AB »X 

7 OMMON 

x DNUM161005) »P ¥DUM2(2) 9X» 
* DUM3 (600) »AB( 200) 


EVALUATES ONE TERM OF THE EVEN 
PERIODIC SOLUTION 
PC = AB(K+1)* 
= DCOS{(OBLE( FLOAT (2*K+P))*X) 
RETURN 


DOVSLE PRECISION FUNCTIUN UPSIK) 
ALE MK KE 
TNTEGER 
¥ Kab, 


COLLECTED ALGORITHMS (cont.) 


DOUBLE PRECISION 
# ABoT oX 


c OMMON 
: DUM1 (1005) sPeDUH2(2) 9X» 
DUM2 (14) 9TsDUM4(584) » 
AB(200) 


wo 


EVALUATES ONE TERM OF THE DERIVATIVE 
OF THE ODD PERIODIC SOLUTION © 

2*K+P 

ABI K4+1)#T#DCOS( TEX) 


On 


oH 


y 


DPS 
RETURN 
ND 


SOUULE PRECISION FUNCTION DPC(K) 
Cece ce ee ee 2 


ri 


INTEGER 
* KsP 


DOLELE PRICISICON 
’ ABsToX 


2 UMMON 
DUM1 (1005) 9PsOUM2(2) 9X» 
OU%3 (14) eT »DUMG (564) » 
AB(230) 


“area 


© EVALUATES ONE TERM OF THE DERIVATIVE 
‘ CF THE EVEN Pi. RTOUVIC SOLUTION 


T = 2*K+p 

DPC = -AL(K+1)#T#OSIN(T#X) 
RETURN 

END 


Algorithm 352 (Part B.2.2) 

FJ, FY, DJ, DY (Bessel Functions and De- 
rivatives) 
(Called by DS, DC, DDS, DDC) 


Comments The following collection of fune- 
tionsubprograms produces Bessel functions 
or their derivatives for integer order n, x 
being positive or negative. This is accom- 
plished by using the already computed fune- 
tions of nonnegative order (Algorithm 352 
(Part C)) and substituting them in one of 
the following formulas: 


Jin) = (—1)"J,(u), 
Y_,.00 = (—1)"¥,(w), 


J,’ (a) = i J (a) = J nail), 
UU 


IS Lawes YQ), 
U 


whichever is appropriate. 
DJ calls on function FJ. 
DY calls on function FY. 
FJ and FY call no other subprograms. 


DOUBLE PRECISION FUNCTION FJ(N) 
C OK HER GE eR RH 


INTEGER 
* KoN 


NOUBLE PRECISION 
® J 


Cc OMMON 
* J(250) »DUM(527) 9K 


Cc PRODUCES BESSEL FUNCTIONS 
C OF THE FIRST KIND 
kK = TABS(N) 


nn 
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IF (KeGle250) GO TO 20 
FJ = J(K+1) 
IF (MOD(Ne2)elL Te) FU = ~-FU 


10 RETURN 


20 FI = 02D0 
WRITE (6999) N 
99 FORMAT(2HOJI397H NEEDED) 
GO TO 10 


END 


DOUBLE PRECISION FUNCTION FYC(N) 
Petrie r it rrer rr er ee re ES 


INTEGER 
* KoN 


DOUBLE PRECISION 
* Y 


Cc OMMON 
* DUM1 (500) »Y (250) sDUM2 (27) 9K 


PROCUCES BESSEL FUNCTIONS 
OF THE SECOND KIND 

K = [TABS(N) 

IF (KeG£e250) GO TO 20 

FY = Y(K41) 

IF (MOD(Ne2)eLT*eO) FY = -FY 
10 RETURN 
20 FY = 06D0 

WRITE (6999) N 


99 FORMAT (2HOY13»97H NEEDED) 
GO TO 10 


END 


DOUBLE PRECISION FUNCTION DJIN} 
Pere rer ere rere re ee ee ee 


INTEGER 
* N 

DOUBLE PRECISION 
* FUsFNsUL 

Cc OMMON 

* DUM1 (1000) »U1 »DUM2(26) 9 FN 


DERIVATIVES OF BESSEL FUNCTIONS 
OF THE FIRST KIND 


FN = N 
TF (N=249)- 10920940 
10 OJ = FN¥FICNI/UIL-FU(N4+1) 
GO TO 3¢ 


20 DJ FIUCN=])-FN®F SOND JUL 
30 RETURN 


40 DJ.= O-DC 
WRITE (6999) N 
99 FORMAT(3HOJ@I357H NEEDED) 
GO TO 30 
END 


DOUBLE PRECISION FUNCTION DY(N) 
SE SEER MEE ETE A IE IEA AE SEI IEE FEE EE ee 


INTEGER 
* N 


DOUBLE PRECISION 
* FNeFY U2 


Cc OMMON 
* DUM1 (1002) »U2sDUM2(24) 9 FN 


DERIVATIVES OF BESSEL FUNCTIONS 
OF THE SECOND KIND 


IF (NeGle250) GO TO 20 


COLLECTED ALGORITHMS (cont.) 


FN = N 
DY FYCN-1)-FN®FYON) JU2 
10 RETURN 


29 DY = 06bD0 
WRITE (6999) N 
99 FORMAT (3HOY@1397H NEEDED) 
GO TO 1¢ 


END 


REMARK ON ALGORITHM 352 [S822] 

CHARACTERISTIC VALUES AND ASSOCIATED 
SOLUTIONS OF MATHIEU’S DIFFERENTIAL 
EQUATION [D. 8. Clemm, Comm. ACM 12 (July 
1969), 399-407] 

ARTHUR H. J. SALE (Recd. 4 May 1970 and 28 May 1970) 

University of Sydney, Sydney, NSW, Australia 


KEY WORDS AND PHRASES: Mathieu’s differential equation, 
Mathieu function, characteristic value, periodic solution, radial 
solution 


CR CATEGORIES: 5.12 


This algorithm contains a number of syntactically incorrect 
FORMAT statements: labeled 901, 911, 921, 941, and 951 in sub- 
routine MFCV AL, and 99 in the functions FJ, FY, DJ, and DY. 
The error consists of omitting a comma separating the Hollerith 
field descriptor and the integer field descriptor, as required by 
Sections 7.2.3 and 7.2.3.2 of the Fortran standard [1, 2]. In all cases 
this may be corrected by inserting a comma immediately preceding 
the field descriptor J3 in these statements. 

It has also been pointed out by the referee and the Algorithms 
Editor that the two FORMAT statements in functions DJ and DY 
contain a character not in the standard Fortran character set. 
The standard is somewhat ambiguous on this point: any represent- 
able character is permitted in a Hollerith constant ina CALL ora 
DATA statement, and also in data to be read in with an Aw field 
descriptor (Sections 4.2.6, 5.1.1.6), but since Hollerith field de- 
scriptors are not Hollerith constants, it must be presumed that 
the prohibition of Section 3.1 applies. The ‘‘at’’ symbol (@) in 
these two statements should therefore be replaced by a blank or 
some other character in the standard set. 

There is another, more serious, error: subroutines BOUNDS 
and MFITR8 both reference a named common block which is not 
referenced by the routine that calls them (MFCV AL). According 
to Section 10.2.5 of the standard, the contents of this block will 
therefore become undefined at the moment either of these two 
routines executes a RETURN, unless this common block is ref- 
erenced by a routine which is directly or indirectly calling 
MFCVAL. This undefinition permits named common blocks to 
be overlaid, and since it is not the author’s intention to allow this 
block to become undefined, the following two statements should 
be added to MFCVAL immediately following the existing 
DOUBLE PRECISION and COMMON statements respectively: 

DOUBLE PRECISION FILL(3) 

COMMON /MF2/ FILL 

REFERENCES: 

1. ANSI Standard Fortran ANSI (USASI) X3.9-1966. American 
National Standards Institute, New York, 1966. 

2. FORTRAN vs Basic FORTRAN. Comm. ACM 7 (Oct. 1964), 
591-625. 
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Remark on: 

Algorithm 352 [S22] 

Characteristic Values and Associated Solutions of 
Mathieu’s Differential Equation 

[Donald S. Clemm, Comm. ACM 12 (July 1969), 
399-407 | 


Michael J. Frisch [Recd. 27 Jan. 1971] 
University Computer Center, University of Minnesota, 
Minneapolis, MN 55455 


Key Words and Phrases: ANSI Fortran standard 
CR Categories: 4.0, 4.22 


The following items were found during compilation of the 
algorithms written in Fortran published to date in Communica- 
tions. The MNF compiler written at the University of Minnesota 
for CDC 6000 Series machines by Lawrence A. Liddiard and E. 
James Mundstock was used to check the validity of the algorithms. 


Algorithm 352 does not conform to the standard in subroutine 
MATH which calls subroutine SUM with arguments that were in 
an EXTERNAL statement but not in a type staternent. The dummy 
argument in subroutine SUM has type DOUBLE PRECISION 
so a statement DOUBLE PRECISION DS, DC, DDS, DDC, PS, 
PC, DPS, DPC should be inserted before the EXTERNAL state- 
ment in subroutine MATH (Section 8.4.2). 

In subroutine JNS, the dummy argument M is also in blank 
common, contrary to 7.2.1.3. In the same subroutine, arrays D, G, 
P, and Q are referenced by array name instead of array element 
name as required in Section 7.2.1.4. The statement should be: 

EQUIVALENCE (A,G(1)), (D(1),G(2)), (PC),G(4)), (QM), 
G(7)), (DM,G(10)), (B,G(11)). 


COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 353 

FILON QUADRATURE [D1] 

STEPHEN M. CHasE AND Luoyp D. Fospickx (Recd. 
7 July 1967 and 6 Jan. 1969) 

Department of Computer Science, University of [linois, 
Urbana, IL 61820 


KEY WORDS AND PHRASES: quadrature, Filon quadrature, 
integration, Filon integration, Fourier coefficients, Fourier 
series 

CR CATEGORIES: 5.16 


comment FSER! evaluates the integrals 


1 1 
C= i F(X) cos (MrxX)dX, S= F(X) sin (MrX) dX 
0 0 


using the Filon quadrature algorithm. The user may request an 
evaluation of C only, S only, or both C and S. FSER1 contains 
an automatic error-control feature which selects an integration 
step size on the basis of an error parameter supplied by the user. 
The Filon quadrature formulas, truncation error, rounding error, 
and automatic error control are described in a companion paper 
[1] bv the authors. 

The calling parameters for this subroutine are defined as fol- 
lows. F is the name of a FUNCTION subprogram F(X), supplied 
by the user, which evaluates F(X) appearing in the integrand, 
EPS is the name for e appearing in inequalities (45) and (46) of 
[1]. It is used in the error control portion of the algorithm. The 
error in the computed values of C and § is related to e by the in- 
equality (76) given in [1]. The user must assign a value to EPS 
before calling FSER1. MAX specifies the maximum number of 
halvings of the step size that are allowed. The minimum step size, 
hin equation (16) of [1], is 2-M4*, The user must assign a value to 
MAN before calling FSER1. M is the parameter appearing in 
the argument MrX of the cosine and sine functions. The user 
must assign a value to M before calling FSERI. C is the value 
of the cosine integral determined by FSER1. 8 is the value of the 
sine integral determined by FSER1. LC is used on entry as asignal 
that the user does want C evaluated (LC = 1) or does not want 
C evaluated (LC = 0). It is used on exit to report the value of h 
used by the subroutine to evaluate C, this value being 2-4°. The 
user must assign a value of 1 or 0 to LC before calling FSER1, 
and if LC = 1 on entry, then the subroutine will assign a new value 
to LC related to the step size by 2-'°. LS is used on entry as a 
signal that the user does want S evaluated (LS = 1) or does not 
want S evaluated (LS = 0). It is used on exit to report the value of 
h used by the subroutine to evaluate S, this value being 2%. 
The user must assign a value of 1 or 0 to LS before calling FSER1, 
and if LS = 1 on entry, then the subroutine will assign a new 
value to LS related to the step size by 2-18, 

FSER1 calls a subroutine ENDT1 which is also listed below. 
The purpose of ENDT1 is to perform the end test described by 
inequalities (45) and (46) of [1]. 

REFERENCES: 

1. Fospicx, Luoyp D., anp CuasE, STEPHEN M. An algorithm 
for Filon quadrature. Comm. ACM 12 (Aug. 1969), 453-457. 
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SUBROUTINE FSERL(F,eEPSsMAXeMy Co Sy LC, LS) 
PT = 3.1415926535898 
XM = M 

C FL = COS(M*PIT) TEMPORARY. 

FL = 1 - 2 * ( M- (M/2 ) * 2 ) 

FO = F(0.0) 

Fl = F(l.0) * Fl 


C 'CIR* WILL BE USED THROUGHOUT THESE COMMENTS TO STAND FOR *SIN* OR 
C 'COS* WHEREVER THUSE TWO SYMBOLS MAY OCCUR, 
C NOW DEFINE SUMCIR OF THE ENDPOINTS. 
SUMCOS = (FL + FO ) * 25 
SUMSIN = 0.0 
Bl = 2. / 3. 
C TMAX IS THE SWITCH-OVER POINT IN THE ANGLE T. 
C OUR ANALYSIS INDICATES THAT TMAX = 1/6 IS THE BEST FOR THE ILLIAC II 
C WHICH HAS A 44 BIT FLOATING POINT MANTISSA. 
TMAX = 0.166 
€ N IS THE NUMBER OF THE ITERATION. NOTE THAT WE START AT THE 
C FOURTH ITERATION STEP. 
C ACTUALLY, THE FIRST EVALUATIUN OF AN INTEGRAL IS AT N = Sy AND 
C THEREFORE, THE FIRST COMPARISUN GF VALUES IS AT Ne 66 
N= 4 
€ BOTH TMAX AND N MAY BE CHANGED IF THE MACHINE FOR WHICH THIS 
C ROUTINE TS INTENDED HAS GREATER GR LESS ACCURACY THAN ILLIAC II. 
C IF N IS CHANGED » THEN THE CORRESPONDING CHANGES MUST BE MADE 
C IN THE ASSIGNMENTS OF H AND NSTOP, 
H= 1. / 16. 
CH = 2 ** -N, 
NSTOP = 15 
C NSTOP = 2**N - 1 
T =H * XM 
Te = T * PI 
NST = 1 
ASSIGN 67 TO MSWTCH 
C LLC AND LLS ARE USED BY THE ROUTINE IN CUMPUTED-GO-TO STATEMENTS. 
C AS SUON AS LLS AND LLC HAVE BEEN DEFINED, WE CAN USE LS AND LC 
C AS RETURN PARAMETERS (SEE ABOVE). 
IF ( LS ) Ly ly 2 
l LLS = 2 
GO TO 3 
LLS = 1 


IF ( LO ) 4, 4, 5 


tN = 1 
C ALL GF THE ABOVE IS EXECUTED ONLY ONCE PER CALL. 
C NOW THE ITERATION BEGINS. 
10 OoDCOS = 0. 
ODSIN = 0. 
C BEGIN SUMMATION FOR ODCOS ANDO ODSIN. 
DO 65 IT = ly NSTOP, NST 
XE=st 
THA = XI * T 


C THA*PI IS THE ANGLE USED IN THES ITH TERM. 
C CIR(I*T*PI) IS CALCULATED HERE USING THE IDENTITY 
C CIR ( INTEGER MULTEPLE OF PI + FRACTIONAL MULT OF PI ) 
C = COSCINTEGER*PI) * CIR(FRAC*PI) 
C = (+ OR -) * CIR(FRAC*PI). 
FRAC = THA 
IN = THA 
THA = IN 
FRAC = (FRAC -— THA) * PI 


C THA IS A FLOATING POINT INTEGER, FRAC IS THE FRACTIONAL PART ¥*PT. 
COSIP = L — 28(IN — 2%(IN/2)) 
TEMPL = COSIP * F(XT#H) 

C TEMPL = COSCINTEGER PART) * F(I*H). 
GO TO ¢ 50 , 55 ) + LLS 

50 OOSIN = TEMP1L * SIN(FRAC) + ODSIN 


55 GO TO ( 60 , 65 ) » LLC 
60 ODCOS = TEMP] * COS(FRAC} + ODCOS 
65 CONTINUE 


GO TQ MSWTCH, (67,70) 
67 NST = 2 
C NOW HAVE MADE UP FOR THE FIRST 4 ITERATIUN STEPS, SG RESET THESE 
C TWO NUMBERS TO LOOK LIKE THE GENERAL CASE, 
NSTOP = 16 
C NSTOP = 2**N (IN CASE YOU CHANGE STARTING VALUE OF N)e 
ASSIGN 70 TO MSWTCH 
GO TO 92 
70 TSQ = TP#TP 
IF (T -TMAX) 74, 74, 75 


C 74 1S THE POWER SERIES FOR SMALL Ty, 75 IS THE CLOSED FORM USED WITH 
C LARGER VALUES OF T. 

C THE POWER SERIES ARE (WITH *TN* = TP**N) 

CA = (26./45e)*T3 - (2./315.)*TS + (20/4725, )¥*T7 

CB = (26/3 -) + (2/15 e)*T2 — (4e/105.)%T4 + (26/5674) ¥*T6 

Cc ~ (4./22275.)*T8 

CG = (46/36) — (26/156)*T2 + (1./2100)*T4 - (1./11340.)*T6 

C THE NEXT TERM IN G IS TOO. SMALL. IT IS (1./997920.)¥*T8 


T4 A = TP * TSQ * (1. —- TSQ * (Le — TSQ / 156) / Ted / 22.5 


B2 = BL * TSQO* .2 
B3 = B2 * TSQ’# 2./7. 
B4 = B3 * TSO / 10.8 
B5 = B4 * TSO * 14./275. 
B = Bl+ B2 - B3 + B4 — BS 
G = 2.*Bl = B2+ B3/ Be. - B4/40. 
CG = 2.*Bl - B2 + B3/8. —- B4/40. + 5.¥*B5/896. IF YOU WANT THE T8 
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C TERM INCLUDED IN G. 
GO TQ 80 
c eee FORM OF THE COEFFICIENTS, WHERE AGAIN 'IN# MEANS TP¥#N, 
c = 1Le/TP + COS(TP)*SIN(TPI/T2 -— 2.%(STN(TP))#*2/T3 
Cc R = 2e*((L + (COS(TP)D**2)/T2 - 2.*SIN( TH) ¥CUS(TP)/T3) 
CG = 4.2(SINCTP)/T3 - COSITP)I/T2) 
75 IN = T 
TEMPL = 1-2 * ( IN- 2 * ( IN/ 2} ) 
TEMP2 = IN 
C TEMPL 1S COS ( INTEGER PART OF TP), TEMP2 IS FRACTIONAL PART UF TPe 
TEMP2 =(T - TEMP2 ) * PI 
S1 = TEMP] * SIN (TEMP2) 
C Sl = SIN(TP) 
Cl = TEMPL * COS (TEMP2) 


Cc Cl = COS(TP) 
P= Sl * CL 
ater = Si * Sl 
= (((-2.*S1SQ/TP) + PIY/STP +1.)/ TP 
s = 20 * ((-2.% P/ TPI+ 2. -S5150) / TSO 
= 4. * (S1 / TP =~ Cl1)/ TSO 
80 GO TO (81+ 85), LLS 
C HAVE CALCULATED THE COEFFICIENTS, NOW READY FOR THE INTEGRATION 
C FORMULAS. 
81 T2 = H*¥ (A * (FO - Fl) + B® SUMSIN + G * UDSIN) 
C ENDTL 1S A SUBROUTINE WHICH CHECKS FOR THE CONVERGENCE OF THE 
C ITERATIONS. ENDT] REQUIRES THE PRESENT VALUE TO AGREE WITH THE 
C PREVIOUS VALUE TO WITHIN EPS2_ WHERE 
C EPS2 = (1.0 + ABSF(PRESENT VALUE)) *EPS 
C EPS ITS SUPPLIED BY THE USER. 
CALL ENDT1L (PVT2, T2, EPS» Sy LLSs LN) 
GO TO ( 85»y 84 )» LLS 
a4 LS = N 
85 GO TO (96,90) LLC 
C THIS IS THE COSINE INTEGRAL. 
6 Tl = H * ( B * SUMCOS + G * ODCOS) 
CALL ENDTL (PVTL, Tl, EPS, Cy LLC, LN) 
GO TO ( 90, BI )> LLC 
ag tC = N 
90 LN = 2 
C NOW TEST TO SEE IF DONE. 
IF (LLC + LLS - 3) 92, 92, 100 
92 N=N+ 1 
C THIS IS THE BEGINING OF THE ITERATION. 
TF (N=MAX) 95, 95,100 


95 05 * H 


NSTOP = 2 * NSTOP 
= SUMSIN + ODSIN 

SUMCOS = SUMCOS + OULCUS 

GO TO 10 
100 S$ = T2 

c= Tl 

RETURN 

END 

SUBROUTINE ENDT1 

GN TO € 29, 20),% L2 
20 REPS = EPS * (1.9 + ABS(QUANT)) 


(PREVOT, QUANT,EPS, VALUE, bls L2) 


23 TF (ABS{(PREVOT — QUANT) — REPS) 25, 25, 29 
25° VALUE = QUANT 
Ll = 2 


29 PREVOT = QUANT 
30 RETURN 
ENO 


REMARK ON ALGORITHM 353 [D1] 

FILON QUADRATURE [Stephen M. Chase and 
Lloyd D. Fosdick, Comm. ACM 12 (Aug. 1969), 
457-458] 

Bo Ernarsson (Recd. 8 Dec. 1969) 

Research Institute of National Defense, Box 98, 
8-147 00 Tumba, Sweden 


KEY WORDS AND PHRASES: quadrature, Filon quadrature, 
integration, Filon integration, Fourier coefficients, Fourier series 
CR CATEGORIES: 5.16 


The algorithm has been tested in double precision on an IBM 
360/75 with great success. An improvement to the algorithm to 
take care of heavily oscillating functions can easily be made. The 
starting value of the number N of iterations is chosen to give at 
least four quadrature nodes for each full period of the trigono- 
metric function. The following changes are therefore suggested: 


line 22: N = ALOG(2.+XM)/0.693 
line 27: H = 1.0/FLOAT (2«+#N) 
line 29: NSTOP = 2«4N — 1 

line 79: NSTOP = 2«+N 
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The following example shows the importance of this change at 
the computation of the sine integral for m = 64 with the function 
f(z) = 22(1—2z), which is zero at both endpoints. Entry variables 
were in both cases MAX = 20, M = 64, LC = 1, LS = 1, and 
EPS = 1.0E-10. The computation in double precision gave the 
results: 


—0.2473661710D -04 
= 0.0 

improved version LC = LS = 9 C = —0.2473661709D-04 
S —0.7381790409D -06 


original version LC =. LS: =6 C= 
Ss 


The exact values are 
C = —1/(64r)? 


ll 


—0.2473661710- 1074 
and 


S = —6/(64r)? = —0.7381790413-10-°. 

The failure of the original computation is due to the fact that 
all the inner nodes of the sine integral are multiples of 7, and the 
boundary contribution is zero. The connection with the sampling 
theorem is obvious. 

The original version of the algorithm is not valid for negative 
values of M. The use of ALOG(2.*XM) is therefore no essential 
restriction, since the algorithm is rather slow for computing an 
ordinary quadrature (with M = Q). 

It is important to observe that the present version will give 
correct values only if the maximum number MAX is larger than 
the computed value N. 
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ALGORITHM. 354 

GENERATOR OF SPANNING TREES [H] 

M. Doveuas McItroy (Recd. 29 Apr. 1966, 9 Sept. 1968 
and 6 Mar. 1969) 

Bell Telephone Laboratories, Murray Hill, NJ 07971, and 
Oxford University Computing Laboratory, Oxford, 
England 

KEY WORDS AND PHRASES: 

CR CATEGORIES: 5.32 


spanning trees, trees, graphs 


/* This procedure finds all trees that span a nondirected graph 
on n nodes. The essential step of this procedure partitions the set 
T(G) of trees which span graph G into two classes. Trees of one 
class contain a branch connecting a selected pair of nodes, ¢ and 
j; trees of the other class exclude such branches. To formalize the 
effect of partitioning with respect to nodes 7 and Jj, we let Ai; be the 
“attachment set’’ of branches between them, and G;; be the graph 
derived from G by combining 7 and 7 into aisingle node. Then 


T(G) = T(Gi;) X Ais U TG@—Ais). 


The algorithm generates T(G) by a particular combination of re- 
cursive and iterative applications of this partition. A set S of 
combined nodes is ‘“‘grown’’ by incorporating one node at each 
level of recursion. The attachment set for node 7 is the set of 
branches radiating from 7 to members of S. The recursion bottoms 
whenever S contains all nodes, and a ‘“‘family”’ of trees is then 
produced, where a familyis the Cartesian product of the attach- 
ment sets from each level. 

The basic method would work for any graph, but to simplify 
data representation, this algorithm requires that G be free of 
paralleled branches and self-loops. All computations are done in 
terms of the original graph to save actually having to combine 
nodes and, incidentally, to avoid parallels arising from combina- 
tion. A set of nodes is represented by a string of n bits, with 1’s 
for nodes present and 0’s for nodes absent. The original graph is 
represented by an array of n strings, where the 7th string indicates 
the set of nodes neighboring node 7. An attachment set for node 7 
is a suitable subset of its neighborhood. 

The algorithm maintains the graph G, the set of combined 
nodes S, and a boundary set B of nodes neighboring members of 
S. B is disjoint from S. Initially S contains only node 1; B is the 
neighborhood of node 1. The key recursive routine ‘‘grow”’ iterates 
over the nodes of B. For each node 7 in B it finds the attachment 
set (necessarily nonempty) connecting z to S. It then removes the 
attachment set from G and node i from B, and calls ‘“‘grow”’ 
recursively with S augmented by node 7 and B augmented by 
neighbors of 7 (except those in S). The recursive call thus yields 
trees which include branches from node 7 to S, while the iteration 
over succeeding nodes in B yields trees which exclude such 
branches. 

As an example, for the graph 


1(2,3,4) 2(1,3) 3(1,2,4) 4,3) 
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the algorithm generates eight trees in four families: 
1() 20) 30,2) 4(,3) 
1() 21) 34) 4(1) 
1() 28) 380) 44,3) 
1() 2(8) 34) 4() 


In these lists a set is represented by its index together with a 
parenthesized list of contained nodes. 

Unlike other algorithms in the literature [1, 2, 3], this produces 
unique trees and hence does not require storage for a checklist of 
trees already produced. An algorithm of Burstall [4] generalizes 
this strategy to a wide class of problems; however, a direct par- 
ticularization of Burstall’s algorithm for spanning trees would be 
less efficient. 

Acknowledgment is due S. C. Johnson, A. J. Goldstein, J. B. 
Kruskal, and D. M. R. Park for discussion and help, also P. 
Seaman and IBM U. K. Laboratories for testing. 
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/* Nomenclature 
G = the graph, or modified graph, 
S = set of nodes covered by growing family, 
B = “boundary” set of uncovered nodes neighboring to 
members of S, 
A = array of attachment sets, 
f(A) = routine for processing each family once it has been 
generated .*/ 
trees: 
procedure(G, f); 
declare G(*) bit(*), f entry, n fixed binary; 
n = hbound(G, 1); 
begin; 
declare A(n) bit(n), 
untt entry (fixed binary) returns (bit(7)); 
/*Start at node 1. Arguments by value.*/ 
eall grow((G), unit(1), (@(1))); 
grow: 
procedure(G, S, B) recursive; 
declare(G(n), S, B) bit(n), 7 fixed bimary; 
if “S = '0' b then call f(A); 
else 
doi = 1ton; 
if subsir(B, 1, 1) then 
do; 
substr(B, 7, 1) = '0'b; 
A(i) = Gi) &S 
Git) = Gt) & -S; 
call grow((G). Slunit(t), BIG(t)); 
if G7) = 'O' b then return; 
end; 


COLLECTED ALGORITHMS (cont.) 354-P 2- 0 


end; 
end grow; 
unit: 
procedure(z) bit(n); 
declare i fixed binary, u bit(n) initial (''b); 
substr(u, 7,1) = '1'b; 
return (wu); 
end unit; 
end; 
end trees; 
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ALGORITHM 355 
AN ALGORITHM FOR GENERATING ISING CON- 
FIGURATIONS [Z] 
J. M. 8. Stim6es Pererra (Recd. 20 Dec. 1967 and 10 
Mar. 1969) 
University of Coimbra, Coimbra, Portugal 


KEY WORDS AND PHRASES: Ising problem, zero-one se- 
quences 
CR CATEGORIES: 5.39 


procedure [sing (n, x, t,S); integer n, 2,1; integer array S; 

comment Ising generates n-sequences (S;, ---, Sn) of zeros and 
ones where z = > f=1 Ss-and t = )o?-i | Siz: — Ss | are given. 
The main idea is to interleave compositions of « and n — 2 
objects and resort to a lexicographic generation of composi- 
tions. We call these sequences Ising configurations since we 
believe they first appeared in the study of the so-called Ising 
problem (See Hill [1], Ising [2]). The number R(n, 2, t) of dis- 
tinct configurations with fixed n, x, ¢ is well known [1, 2]: 


Rn, 2,¢ = m4 1) = 2% 1)" 2 2) 

m m 

R(n, 2, t = amo FONE ‘y+ E277) 
m \ m—il m—-1 ps 


Now define a block of 1’s (or zeros) in the sequence as a set 


of a maximum number of consecutive 1’s (or zeros) eventually 


consisting of a single element. For given n, x, t, the number p 
of blocks of 1’s may easily be deduced from ¢, as well as the num- 
ber g of blocks of zeros. In fact, a block of 1’s including either 
S; or S, yields one variation and each one of the others yields 
two variations; hence we get p = q = m+ lwhent = 2m+ 1 
(t odd requires S; ~ S,) and either p = m+ 1,q =m (Si = 
S, = 1), or p=m,q=m-+i1 (Si = Sz = 0) when t = 2m. 
Clearly, there is a 1-1 correspondence between the compositions 
of x with p parts and the distributions of the x 1’s into p blocks. 
And for each distribution of 1’s, distinct distributions of the 
nm — x zeros into g blocks correspond to distinct configurations. 

The main body of the algorithm is compose, which generates 
compositions of an integer x with & parts and stores them in the 
array L. The role of sort and bisort is to form the final sequence 
(S:, °°: , Sn) from the structure of one-blocks L; and zero- 
blocks M; . , 

The Ising problem was brought to my attention by Dr. B. 
Dejon during an informal visit to the IBM Research Laboratory 
in Zurich. Thanks are also due to Prof. Paul Erdés for pointing 
out to me reference (1] and to Prof. A. A. Zykov for correspond- 
ence. The procedure was tested on the NCR 4130 of the Labora- 
tério de Célculo Autom&tico, Universidade do Porto. Thanks 
are also due to the Director and his Staff. 
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2. Istne, E. Beitrag zur Theorie des Ferromagnetismus. Z. 
Physik 31 (1925), 253-258; 
begin 
integer k; integer array L, M[1 : t+2+41]; 
procedure sort (L, M,z); integer array L, M; integer 2; 
begin 
integer 7, 7,7, m, 2b; 
for m := 1 step 1 until n do S[m] := z; 
r:=t:= 1; 2eb:= 1 — 2; 
AA: j:=r+Li] —1,; 
for m := r step 1 until j do S[m] := 2b; 
if? +1<k then 
begin r := 7+ M(t} +1; 7:=7+1; goto AA end; 
comment Insert here an output procedure such as out- 
array (1, 8); 
end sort; 
procedure bisori (L, M); integer array L, M; 
begin sort (L, M, 0); sort (M, L, 1) end bisort; 
procedure compose (z,k, L, p); value z; integer 2, k; 
integer array L; procedure 7; 
begin 
integer 7, a; 
if c < k then goto CC; 
Lili] :=2—k+1; 
for 7 := 2 step 1 until k do L{z] := 1; 
P; 
if k < 1 then go to CC; 
a:=1; 
BB: if L{fa] > 1 then 
begin 
La} := Lia] ~ 1; Lia+1) := Liat+1} +1; p; 
ifa~k—1thena:=a+1; goto BB 
end; 
Lfa] := Lifa+1]; Lflatlj:=1; a:=a—1; 
if u > 1 then go to BB; 
CC. 
end compose; 
k:=t+2+1; 
if ¢ * (t+2) X 2 then 
begin 
procedure pl; bisori (L, M); 
procedure p2; compose (n—x, k, M, pl); 
compose (x, k, L, p2) 
end 
else 
begin 
procedure 73; sort (L, M, 0); 
procedure p4; compose (n—2x, k—1, M, 13); 
procedure p5; sort (M, L, 1); 
procedure p6; compose (n—x, k, M, pd); 
compose (x, k, L, p4); 
compose (x, k—1, L, p6) 
end 
end Ising 
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A PRIME NUMBER GENERATOR USING THE 
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RIcHARD C, SINGLETON* (Reed. 28 Jan. 1969 and 11 June 
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Stanford Research Institute, Menlo Park, CA 94025 


* This research was supported by the Stanford Research Insti- 
tute out of Research and Development funds. 


KEY WORDS ANDPHRASES: prime numbers, number theory, 
sorting 
CR CATEGORIES: 3.15, 5.30, 5.31 


procedure PRIME(IP,m); value m; 
integer m; integer array [P; 

comment This procedure finds the first m > 4 elements of the 
infinite sequence 2, 3, 5, 7, 11, --- of prime numbers and stores 
them in JP[1], ZP[2], ---, [P[m]. The method of distinguishing 
primes from composite numbers is similar to that used by B. A. 
Chartres [1]. A counter value 7 is compared with the smallest 
value in a list JQ of odd multiples of primes less than or equal 
to «/n. If unequal, n is a prime and is added to the output list 
IP. Otherwise, the matching elements of JQ are incremented, 
based on the corresponding. entries. in the list JQ. Both n and 
the composite numbers in JQ are incremented so as to omit 
multiples of 2 and 3. 

This procedure differs from Algorithm 311 in the method of 

finding the smallest entry in JQ. Here the list JQ is kept partially 
ordered as a tree, i.e. 


1Qtt] 2 IQ + 2] 


thus the base element JQ[1] is always smallest. The variable 
igi holds the current value of JQ[1], and jqi the negative of 
JQ[Il]. If n = zqi, then iqi is incremented by jqi + jqi if jqi > 0 
or by —jqi if jqgi < 0. Then JQ is reordered to bring the next 
smallest element to the base and to return the new value of iq 
to the tree, using a method similar to Williams’ procedure 
SWOPHEAP [8]. The tag list JQ is permuted along with 7Q. 
The treesort principle, used in SWOPHEAP, is well suited to 
the present task of finding the smallest element of a changing 
list. 

In Algorithm 311, five working-storage arrays serve the func- 
tion of the two used here, and the information is totally ordered 
each time a prime is found. Between primes the unordered seg- 
ment of the information is searched to locate the smallest ele- 
ment. The method used here is both simpler and more efficient. 

On the Burroughs B5500 computer, this procedure finds the 
first 10,000 primes in 53 sec. For other values of m, time is pro- 
portional to m!-*4, Corresponding times for Algorithm 311 were 
91 sec for m = 10,000, with time proportional to m)-® for 
other values of m. However, another algorithm [2] finds the 
first 10,000 primes in 14 sec on the B5500 and has times propor- 
tional to m!-4 for other values of m. 

REFERENCES: 

1. Cuartres, B. A. Algorithm 311: Prime number generator 
2. Comm. ACM 10 (Sept. 1967), 570. 

2. SineteTON, R. C. Algorithm 357: An efficient prime num- 
ber generator. Comm. ACM 12 (Oct. 1969), 563-564. 


for2<71< 9, 
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3. Wiuurams, J. W. J. Algorithm 232: Heapsort. Comm. ACM 
7 (June 1964), 347; 
begin 
integer array JQ, JQ(0 : sqri(m)i; 
integer 7, 7j, inc, 1gt, j, Jj, jg, k, n; 


IP(l] := 7 := 2; 
IP{2] := k := 3; 
IP{3] := n := 5; 
jj = tqt := 25; jqt := —10; 
IQ(2] := 49; JQ[2] := —14; 
inc := 4; 
go to Le; 
La: iqi := if jgi > 0 then igi + jgi + jqi else igi — jqi; 
7:= 1; 
comment Reorder the tree, bringing the smallest element to 
the bottom; 
for 77 := 7 + 7 while 7j7 <j do 
begin 


if [Q(i7] > IQkij + 1] then 7j := 77 + 1; 
if JQ[¢j] > iqi then go to Lb; 
1Q(¢) := IQ{i7];_ JQlt] := JQleg}; 7 
end; 
if iqe < 77 then goto Lb; 37 := IQ{j); 
comment Add a new entry to the top of the tree; 
J:=j+1; 7 := IP + 2); 
1Q{f. := ay fT 2; JQ) = y+ yy; 
if ({j—(ij+3)X3) = 1 then JQ[j] := — JQUj); 
comment Return igi and jqi to the tree and fetch a new pair 
from the bottom; 
Lb: IQ[t] := tqt; iqt := IQ[1]; 
JQ] := jqts jqi := — JQU; 
if n = igi then go to La; 
comment Increment n and compare with the next smallest 
composite number; 
Le: ine :=6— inc; n:= n+ ine; 
if n = iqi then go to La; 


= aj 


k:=k+1; IPlk] :=n; 
if k ~ mthen go to Lc; 
end PRIME 


COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 357 

AN EFFICIENT PRIME NUMBER GENERATOR 
[Al] 

RicHarp C. SINGLETON* (Recd. 28 Jan. 1969 and 11 June 
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integer procedure NPRIME(IP, m, jlim); 
integer m, jlim; integer array IP; 

comment This procedure finds the next m primes and stores 
them in IP[1], IP[2], ---, [P[m]. [P[m+1], I[P[m+2], --:, 
IP{jlim] are used for working storage, where jlim > m. On the 
first entry, [P[1] must have a value less than 0 as a flag to set 
initial conditions. Also, m must be greater than or equal to 2 
on first entry and greater than or equal to 1 on subsequent en- 
tries. The arrays JQ and JQ must be large enough to hold all 
primes less than or equal to the square root of the maximum 
number scanned in looking for primes. To generate the first 
million primes, approximately 550 entries are needed in each of 
these two lists. The lists are extended as needed, using a sec- 
ondary prime number generator similar to Wood’s [3], and the 
current upper index is returned as the value of NPRIME., 

The method used is the familiar sieve of Eratosthenes. The 
elements of the upper portion of array JP are set to zero, and 
correspond to a sequence of consecutive odd integers. The com- 
posite numbers are crossed off by entering the smallest prime 
factor in the corresponding cell, leaving zeros for primes. (At 
this point, the array ITP contains the equivalent of a factor 
table, i.e. the smallest factor for each composite odd integer.) 
The list of primes is then constructed by storing the consecutive 
prime numbers in the lower portion of IP. Whenever the infor- 
mation in the upper portion of IP is exhausted, a new sequence 
of odd numbers is scanned as described above. On exit, the un- 
used portion is left for use in the next call. 

As compared with another algorithm [2] based on comparing 
a counter value with the next smallest composite number, and 
not working ahead in a scratch storage, the present algorithm 
was found to be faster, even for jlim = m + 1. Efficiency im- 
proves with added working storage. The improvement is sub- 
stantial at first but is slight beyond jlim = 2m. For jlim = 2m, 
time to find the first n primes on the Burroughs B5500 or the 
CDC 6400 computer was proportional to n4, On the B5500 
computer, it took 13.5 sec to find the first 10,000 primes, gen- 
erating them 500 at a time in an array length of 1022. On the 
CDC 6400 computer, with the algorithm coded in machine lan- 
guage, it took less than 98 sec to find the first million primes, 
generating them 1000 at a time in an array of length 10,000. 
Timing within this run, with jlim = 10m, was proportional to 
n}-, It is interesting to note that Chartres estimated a time 
of 12 hours on the B5500 for this task, using Algorithm 311 [1]. 

This algorithm can be expressed in either ALaou or FortRAN, 
and gains no special advantage from machine language coding. 
However, if we plan to produce very large tables of primes for 


value m, jlim; 
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future use, machine language shift operations may be useful in 
compressing the data for storage. One method of compression 
is to use a single bit to indicate that an integer is a prime, e.g. 
0 = composite and 1 = prime. By omitting multiples of 2, 3, 
and 5 from the corresponding sequence of integers, 8 bits suffice 
to identify the primes in each 30 consecutive integers. 
REFERENCES: 
1. Cuartres, B. A. Algorithm 311: Prime number generator 
2. Comm. ACM 10 (Sept. 1967), 570. 
2. StneLeTON, R. C. Algorithm 356: A prime number gener- 
ator using the treesort principle. Comm. ACM 12 (Oct. 
1969), 563. 
3. Woop, T. C. Algorithm 35: Sieve. Comm. ACM 4 (Mar. 
1961), 151; 
begin 
own integer array JQ, JQ(0 : 600] 
own integer 7), 1k, inc, j, nj; 
integer 7, jqt, k, ni; 
k:=0; if [P[{l] > 0 then goto LJ; 
comment Set initial conditions; 
IP{1) := JQ{1] := ik := ine := 2; 
ZIQ{2] := 9; JQ[2] := IQ[1] := 77 := 3; 
IQ[3] := 25; JQ[3] := nf := 5; k:= 1; 
comment Prepare to delete a sequence of composite numbers; 
Ta: j:=k+1; ni:= QU) —-—j- 3; 
IQ(\] := jlim + jlim + nt; 
for 1 := j step 1 until jlim do IP{i] := 0; 
Lb: i:= ij; if IQ[ij] > IQ[1] then go to Le; 
comment Extend the list of primes in array JQ counting so 
as to omit multiples of 2 and 3; 
Le: nj := nj + tne;, inc := 6 — ine; 
if JQ(tk +1] 7 2 < nj then tk := ik +1; 
for j := 3 step 1 until 7k do 
if (nj + JQ[j)) X JQ[j] = nj then go to Lec; 
i= i7+1; JQ) := nj; IQ] := nj T 2; 
go to Lb; 
comment If j-+ 7+ ni is composite, enter its smallest prime 
factor in JP{j]. If 7 + 7 + ni is prime, then JP[j] = 0; 
Ld: IP[j] := jai; gj = 5+ jai; 
if 7 < jlim then go to Ld; 
Qt] :=jtj+ni; 
Le: t:=t—1; jqt:= JQUi]; 7 := (UQ[i] — nt) + 2; 
if 7 < glim then go to Ld; 
if 7 ~ 1then gotoLe; j:=k; 
comment Pack the next m primes in JP{1], ---, 7P[m]; 
Lf: j:=j7+1; if IP[j] + 0 then go to Lf; 
if j = jlim then go to La; 
k:=k+1; [P[k] :=j7+j7+ ni; 
if k ~ m then go to Lf; 
comment The current length of the tables in arrays JQ and 
JQ is returned; 
NPRIME := tj 
end NPRIME 
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Remark on Algorithm 357 [A1]| 
An Efficient Prime Number Generator [Richard C. 
Singleton, Comm. ACM 10 (October, 1969), 563] 


Richard M. De Morgan [Recd 8 August 1972], Digital 
Equipment Co. Ltd., Reading, England 


On some Algol 60 implementations, the value of #7 is destroyed 
between subsequent calls to the procedure. The second and third 
lines of the algorithm should be changed to make i an own integer: 


own integer ij, ik, inc, j, ni, nj; 


integer /, jgi, k; 
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ALGORITHM 358 

SINGULAR VALUE DECOMPOSITION 

OF A COMPLEX MATRIX [F1, 4, 5] 

Peter A. BUSINGER AND GENE H. Gotvs (Recd. 31 Jan. 
1969 and 18 June 1969) 

Bell Telephone Laboratories, Inc., Murray Hill, NJ 07974 

Stanford University, Stanford, CA 94305 


KEY WORDS AND PHRASES: singular values, matrix decom- 
position, least squares solution, pseudoinverse 
CR CATEGORIES: 5.14 


CSVD finds the singular values o1 > o2 > «++ > on of the com- 
plex M by N matrix (M > N) which is given in the first N columns 
of the array A. The computed singular values are stored in the 
array 8. CSVD also finds the first NU columns of an M by M 
unitary matrix U and the first NV columns of an N by N unitary 
matrix V such that ||A — UZV*|| is negligible relative to |All, 
where > = diag (¢:). (The only values permitted for NU are 0, 
N, or M; those for NV are 0 or N). Moreover, the transformation 
U* is applied to the P vectors given in columns N + 1, N + 2,-:-, 
N + P of the array A. This feature can be used as follows to find 
the least squares solution of minimal Euclidean length (the pseu- 
doinverse solution) of an overdetermined system Az 7 b: Call 
CSVD with NV = N and with columns N +1,N +2,---,N+P 
of A containing P right-hand sides b. From the computed singu- 
lar values determine the rank r of = and define 5+ = diag (o:7!, 
og}, «++, ot, 0, «++, 0). Now 2 = V2*+b, where 6 = U*d is fur- 
nished by CSVD in place of each right-hand side b. 

CSVD can also be used to solve a homogeneous system of linear 
equations. To find an orthonormal basis for all solutions of the 
system Ax = 0 call CSVD with NV = N. The desired basis con- 
sists of those columns of V which correspond to negligible singular 
values. Further applications are mentioned in the references. 

The constants used in the program for ETA and TOL are ma- 
chine-dependent. ETA is the relative machine precision, TOL 
the smallest normalized positive number divided by ETA. The 
assignments made are valid for a GE635 computer (a two’s 
complement binary machine with a signed 27-bit mantissa and a 
signed 7-bit exponent). For this machine, ETA = 2-% = 1.5E-8 
and TOL = 271%/2-% = 1,H-31. 

The arrays B, C, and T are dimensioned under the assumption 
that N < 100. 

The authors wish to thank Dr. C. Reinsch for his helpful sug- 
gestions. 


REFERENCES 

1. Gotus, G. Least squares, singular values, and matrix ap- 
proximations. A plikace Matematiky 18 (1968), 44-51. 

2. Gotus, G., AND Kauan, W. Calculating the singular values 
and pseudoinverse of a matrix. J. SIAM Numer. Anal. 2 
(1965), 205-224. 

3. GoLtus, G., AND Reinscu, C. Singular value decomposition 
and least squares solutions. Numer. Math. (to appear) 


aa 


aa 


aa 


ao 


aa 


aa 


358-P 1- 


SUBROUTINE C S VO 

1 (Ae MMAXe NMAXe Me Ne Pe NUs NVe 
2 Se Use V) 

COMPLEX AC MMAXo LP eUCMMAXe LI eVONMAXoL) 
INTEGER MeNePeNUeNV 

REAL S11) 

COMPLEX GeR 

REAL B(100)eC(100)*T(100) 

DATA ETAvTOL/I-5E-8r1.£-31/ 

NPIN+P 

NI=Nel 


HOUSEHOLDER REDUCTION 
C(1)=0.£0 
K=1 
10 KL=Kel 


ELIMINATION OF AC TeK)o IT=KtleoseeM 
Z=0.£0 
00 20 I=KeM 
20 Z=UZ+#REALCACT oK) De *#24AIMAGCACT 9K) ) #92 
BCK)=0.E0 
IF¢ZsLE.TOLIGOTO 70 
Z=SORT(Z) 
BIKi=2 
WOCABSTACK#K)) 
O=(1-£E020-E0) 
IF (WINES OSEOPGZACK 9K DSW 
A(KeK )=Q¥(Z+W) 
IF (K.EQ.NPIGOTO 70 
DO 50 J=KIeNP 
Q@=(0.£090.6E0) 
00 30 I=KsM 
30 Q=G+CONUGEACT eK) ) *ACTI oJ} 
G=Q/EZ40Z+W)) 
DO 40 I=Kom 
40 ACT eJIZACT oJ) -OeACI KK) 
50 CONTINUE 


PHASE TRANSFORMATION 
Q=-CONJGCATK eK) /SCABSOCACKHK)) 
DO 60 J=K1lsNP 
60 ACKe J)=Q*A Kou? 


ELIMINATION OF AlKeJ)}e J=K+2re0e0N 
70 IF(K.sEQ.NIGOTO 140 
2=0.E0 
00 80 J=KLeN 
80 Z=Z+REALCA(K ed) ) e#24AIT MAGE ACK ed) ) 042 
C(K1)=0.E0 
IFCZ.LE.TOLIGOTO 130 
Z=SQRT(Z) 
C(K1)=2 
WICABS(A(KeK1)) 
@=11.E090.E0) 
IF (WeNE* Os EO) OAK KLIS 
A(KeK1)=G4( ZW) 
DO 110 I=Klem 
6=(0.E090-E0) 
DO 90 J=KleN 
30 Q=Q+CONUG( ACK J) D*AL Ie) 
GO=GstZ*tZaW)) 
DO 100 J=KleN 
100 ACI e J) ZACT 2 JI-G#A(K 9d) 
110 CONTINUE 


FHASE TRANSFORMATION 
Q=-CONJUGCACK KL) SCABSCACK K19) 
BDO 120 I=KloM 

120 ACT eK1) =ACITeK1)*@ 

130 K=K1 
GoTo 10 


TOLERANCE FOR NEGLIGIBLE ELEMENTS 
140 EPS=0.E0 
DO 150 K=1leN 
S¢KI=B CK) 
TOK )=CCK) 
150 EPSZAMAXI(EPS eo StKD#TIK)) 
EPSZEPS*ETA 


INITIALIZATION OF U AND V 

IF(NUEG.OIGOTO 180 

00 170 JaleNU 

00 160 I=1.+M 

160 UCT e JI =(0.E6000.E0) 
170 Ul JeJI=C12E090.£0) 
180 IFCNV.EG.0)IGOTO 210 

DO 200 J=lrNV 
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C SORT SINGULAR VALUES 


™~ OO 190 I=leNn 
190 ViTeJI=(00E090-£60) 
200 Vide JSC LEO8 0.60) 


DO 450 K=1eN 
G=-1.E0 
J=K 


c DO 390 I=KeN 
C OR DIAGONALIZATION IF(SCLIILE.GIGOTO 390 
210 DO 380 KK=l oN G-=S¢tI) 

KSNI-KK J=I 

c 390 CONTINUE 

c TEST FOR SPLIT IF(JeEQ.KIGOTO 450 

220 DO 230 LL=19K StJ=S5K) 
L=Kel-te S(KI=G 
IFCABSCTEILID.LE-EPSIGOTG 29¢ IF(NV.EG@.O)IGCTO 410 
TF CABS (S(L-1)).LE¢EPS)GOTO 240 00 400 I=1eN 
230 CONTINUE QG=VC(TsJ) 
Cc VCTe J) oVOT eK) 
c CANCELLATION OF EtL) 400 VOTeKI=0 
240 C€S20.E0 410 IF(NULEGL0IGOTO 439 

SN=1.£0 00 420 I=leN 

Li=t-1 Q=UCT J) 

DO 280 T=LeK UCTeJIZUCT eK) 
FIOSN#T(T) 420 UCT eK) =0 
TELIZSCS#T(T) 430 IF(N«EGeNPIGOTO 450 
IF CABSCUF IY ALE*EPSIGOTO 290 00 440 I=Nl*NP 
H=S¢1) O=AlJrI) 

W=SQRT CF ef +HeH) ACJe TACK oT) 
S¢IT)cw a40 A(KeT)=0 
CSSH/W 450 CONTINUE 
SN=-F /W 
TF (NULEQ@.0)GOTO 260 BACK TRANSFORMATION 
DO 25C J=leN IF(NULEG.OIGOTO 516 
XTREALCUCIeL 19) DO S500 KK=leN 
YOREAL(UCJ eI)? K=N1-KK 
UlCUeLLYSCMPL XCXeCS+Y*#SN20.E0) IF(8(K)-E0.0.E0IGOTO 500 
250 UCSeTPSCMPLX(Y*CS-X*#SN202E0) QrI-A CK eK D/JCABSIACKeK)) 
260 IFQNP.EG.NIGOTO 280 DO 460 J=leNU 
DO 27C J=N1 NP 460 UCKeJI=O*UCK OJ) 
QrACL1eJ) 00 490 J=leNU 
RIA(IsJ) Q=1026£090.E0) 
A{LIsJ)}=Q*CS#ReSN 00 470 I=KemM 
270 © ACTsJI=SR*CS-Q¥SN 470 G=O+CONIG CACTI eK *LCI ed) 
280 CONTINUE G=Q/(CABSLACK eK) *B(K)) 
c DO 480 I=KeM 
€ TEST FOR CONVERGENCE 480 UCIT eJIDUCT es JI-OeACT eK) 
290 W=S¢K) 490 CONTINUE 

IF(L«E9.K)GOTO 360 500 CONTINUE 

c 510 IF(NV.EG.0)IGOTO 579 

¢ ORIGIN SHIFT IF(NZLT22)GOTO 570 

X=SCL) 00 560 KK=29N 

Y=S(K-1)} K=NI-KK 

G=Tt(K-1) K1I=IKel 

H=TCK) TF(COKL}.EQ.0.E0)G0TO 560 

FICOY-WISCY*WDO(G-H) #(G+HI ISI 2.E0e#H*Y} G=-CONJUG LACK KI) /CABSUACKsK1)) 

GISORT(F*F41.E0) DO 520 J=leNV 

IF (F uL TeOsEOG=-G §20 VOKleJIHOeVEKI ed) 

FICCX-Wre(XeW eC Y/ CF eG) =H) eH) x DO 550 J=leNV 

¢ G=(0.E090.E0)} 

Cc OR STEP 00 530 I=KleN 
CS=1.£0 530 Gr=G+A(KeT devileu? 
SN=1.E0 G=G/ICABSCACK oK1)) *C(K1)) 
Li=tel DO 540 I=KleN 
O00 350 I=Liek 540 VEDeJDZVET ee JI -G*CONUGCACK SID) 

G=T(T) 550 CONTINUE 

y=S(T) 560 CONTINUE 

H=SN#G 570 RETURN 

G=CS*G END 

WIOSQRTCH*H4F #F ) 

TEIT-1)=W 

CS=oF /W 

SN=H/W 

FOX8CS+6 9SN 

G=GeCS-X*eSN 

HOYSSN 

Y=Vecs 

IF(NV.EQ@.0)GOTO 3106 

00 300 J=1leN 
XSREALOUVGJSeT-1)9) 

WOREAL CVG SeT)) 

V¢JSel-1LIOCMPLXOX*CS+WeSN20.E0) 
300 Vd Je LT) SCMPLX(WeCS-XeSN20.E0) 
310 W=SORT (HOH SF OF) 

S(I-1) 50 

CS2F /u 

SN=H/W 

FOCS#G+SNey 

X=CS*Y-SNOG 

IF(NU.EQ.0)GOTO 330 

00 320 J=leN 
YOREAL(UCIeT=-1)) 

WIREAL (UC JeT)) 

Ut Je LT~1LI=CMPLXECY*CS+WeSNeCoEO) 
320 UCSe T)=CMPLXCW#CS-Y*SN10.E0) 
330 IF «{NOEGeNPIGOTO 350 

DO 340 J=NIeNP 
GrA(T-19J) 

R=ACTeJ) 
ACIT-1e JI =OeCSeRESN 
340 ACT sJI=R*CS-O*SN 
350 CONTYNLE 

T(LI=O.EO 

TCKIVOF 

SCK)=X 

GOTO 220 

c 

c CONVERGE NCE 

360 IFCW.GE.0.EO)GOTO 380 

StK)=-wW 


IFUNV.E0.0)G0TO 389 
DO 370 J=leN 
370 VO SeKDI-VOUeK) 
380 CONTINUE 
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ALGORITHM 359 
FACTORIAL ANALYSIS OF VARIANCE* [G1] 
Joun R. Howewy (Recd. 2 Aug. 1968 and 12 May 1969) 
Department of Biometry, Medical Center, Virginia Com- 
monwealth University, Richmond, VA 23219 
* This investigation was supported in part by Public Health 
Service Research Grant FR 00016-05, from the National 
Institutes of Health. 


KEY WORDS AND PHRASES: 
variance, statistical analysis 
CR CATEGORIES: 5.5 


factorial variance analysis, 


Comments. This subroutine transforms a vectory y, observed 
in a balanced complete t:XteX---Xtn factorial experiment, into 
an interaction vector z, whose elements include mean and main 
effects. : 

The experimental observations y,, (s = (81 , 82, °**, 8n); 8: = 0, 
1,--:, & — 1;7 = 1, 2, ---, n) are assumed to be stored in the 
array Y in increasing order by the composite base integer s. 
After the transformation, the array Z will contain the interactions 
in natural order. 

The method used is Good’s [1, 2] modification of Yates’s [5] in- 
teraction algorithm. In [1, p. 367], the interactions are expressed 
in the form z = (M; ® Mz @ -:- @Mn)y, where M; is a uxt 
matrix of normalized orthogonal contrasts and where @ denotes a 
direct (Kronecker, tensor) product. The interactions can also be 
written z = (CiC2 «++ Cn)y, where 


C; M, @11, @--: Ol, 
C. = It, @ M. @ ++: Olen 


Ca = Ley @ It, @-:- @ Mn 


and where J;; is the ¢;X¢; identity matrix. 
By performing elementary operations (row and column inter- 
changes) on the C; we get z = (D,D, --- Dn)y, where 


Ma ® ® Ma 
D; = Mp @ ® Miz 
Mi; ® @ Mi; 


and where M,; is row j of M; . The symbol © denotes a direct sum. 
For an example of this for an unnormalized matrix, see Good 
{1, p. 362]. 

Since each row of D; consists of a row of M;, and zeros, we only 
need M; for forming z. The subroutine forms first Dy, then this 
result is premultiplied by D»1, and so on until we obtain z. The 
elements of z are the required interactions. 

This method can be mechanized for hand computation in the 
following way. (The subroutine was written from this point of 

view.) Write the observations in the order specified above. Write 
row one of M, down the right edge of a strip of paper using the 
same spacing as for the observations. Now place this movable 
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strip alongside the observation vector so that the top element on 
the paper strip is opposite the top element of the observation 
vector. Multiply adjacent elements and write the sum of these 
products at the top of a new column. Now slide the paper strip 
down t, spaces. Form the indicated inner product as before and 
write the result in the new column below the previous entry. Con- 
tinue in this manner until all the observations have been used. 

Now write row two of M, on astrip of paper and proceed as before. 

If we continue this process with all the rows of Mn we will get a 

new vector zn whose elements are linear transformations of the 

observation vector y. The dimension of zn is the same as that of 

y. Similarly form zn_i from zn and M,_1 . Continuing this process 

we finally obtain z: = z which is the desired interaction vector. 

In all the foregoing we used the normalized contrast matrices; 
thus the sums of squares are the squares of the elements of z. For 
hand computation, one might prefer using the unnormalized con- 
trast matrices, since their elements are integers. But then we need 
a vector of divisors; it is obtained by performing the same opera- 
tions on a column of ones as on y, except that we use the squares 
of the elements of the contrast matrices. Then the ith sum of 
squares equals z;? divided by the corresponding divisor. 

This method might be called a “paper strip method”’ for analy- 
sis of variance and is similar to paper strip methods used for 
operations with polynomials. For examples of this, see Lanczos 
[3] and Prager [4]. 

We require 2tit, --- tn locations for storing y and z plus sup(t: , 
te , +++, tn) locations for storing a row of M; . The number of mul- 
tiplications required is ([]#:)(Qot: + 1). 

ACKNOWLEDGMENTS: The author wishes to thank Dr. A. E. 
Brandt for initiating his interest in programming analysis of 
variance. He wishes to thank Dr. W. H. Carter, Jr., and the 
referee, for helpful comments. 

REFERENCES: 
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analysis: An addendum. J. Roy. Statist. Soc. {B} 22, 2 (1960). 
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Ss E ; 

eceeetbueesiele SUBROUTINE AROW 
(ey TE EE EE EE 

*  (YoZeROwWeMSIZE sNCLSeNFCTR) * 


OIMENSION Y(1)9Z2(1)9 acnerta cours 
* ROW( 1) 6MSIZE(1) IF ROW ONE 
(a LOOP FOR NFCTR CONTRAST MATRICES IF (J=-1)30193 
00 5 NF = LeoNFCTR 1 A = NRNC 
z= 1 
c GET SIZE OF THE MATRIX 00 rN I 2 ene 
K = NFCTR#NF +l 2 ROWCT) = EL 
NRNC = MSIZE(K) ¢ AND 
po 3 J @ 1eNRNC RETURN 
¢ ROW OF A CONTRAST MATRIX C ELSE 
CALL AROW (ROW*NRNCOJ) 4 IML = Jel 
Cc PERFORM THE 'PAPER STRIP? RJ as 
¢ OPERATION FOR A MATRIX ROW - 
O09 2 K = JeNCLSeNRNC om : re ae a 
2(I) = O06 
DO 1 LF 1lsNRNC 4 eS RoWttse aa 
KL1 = K+L=1 00 5 To © JsNRNC 
1 ZT) ZED +ROW (LI MYCKLID 5 ROW(T)® O« 
2 I = [+l = - 
3 CONTINUE ReToRN ee 
Cc MOVE Z INTO Y END 
DO 4 J = lsNCLS 
4 Y(S) = 24) 
5 CONTINUE 
DO 6 J = 1leNCLS 
6 YOJ) = Y(sd#y (gd 
RETURN 
END 


REMARKS ON: 

ALGORITHM 332 [S22] 

JACOBI POLYNOMIALS [Bruno F. W. Witte, Comm. 
ACM 11 (June 1968), 436] 

ALGORITHM 344 [S14] 

STUDENT'S t-DISTRIBUTION [David A. Levine, 
Comm. ACM 12 (Jan. 1969), 37] 

ALGORITHM 351 [D1] 

MODIFIED ROMBERG QUADRATURE [Graeme 
Fairweather, Comm. 12 (June 1969), 324] 

ALGORITHM 359 [G1] 

FACTORIAL ANALYSIS OF VARIANCE [John R. 
Howell, Comm. ACM 12 (Nov. 1969), 631] 


ArtHuR H. J. SALE (Recd. 16 Feb. 1970) 
Basser Computing Department, University of Sydney, 
Sydney, Australia 


KEY WORDS AND PHRASES: Fortran standards 
CR CATEGORIES: 4.0, 4.22 


An unfortunate precedent has been set in several recent al- 
gorithms of using an illegal FORTRAN construction. This con- 
sists of separating an initial line from its continuation line by a 
comment line, and is forbidden by the standard (see sections 3.2.1, 
3.2.3 and 3.2.4 of [1, 2]). The offending algorithms are to date: 
332, 344, 351 and 359. 

While this is perhaps a debatable decision by the compilers of 
the standard, and trivial to correct, it seems a pity to break the 
rules just for a pretty layout as has been done. 

REFERENCES: 

1. ANSI Standard FORTRAN (ANSI X3.9-1966), American 
National Standards Institute, New York, 1966. 

2. FORTRAN vs. Basic FORTRAN, Comm. ACM 7 (Oct. 1964), 
591-625. 
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ALGORITHM 360 

SHORTEST-PATH FOREST WITH TOPOLOGICAL 

ORDERING [H] 

Roser B. Dia (Recd. 21 Nov. 1968, 27 Nov. 1968 and 
30 Apr. 1969) 

Alan M. Voorhees and Associates, Inc., McLean, VA 22101, 
and Department of Civil Engineering, University of 
Washington, Seattle, WA 98105 


KEY WORDS AND PHRASES: shortest path, tree, network, 
directed graph 
CR CATEGORIES: 5.32, 5.42 


procedure MOORE (INDEX, J, D, maxi, n, DIST, I, NEXT, 

LAST, mazdist, ROOT, m); 

value mazd, n, maxdist, m; 

integer array INDEX, J, D, DIST, I, NEXT, LAST, ROOT; 

integer mazd, n, maxdist, m; 

comment Given a subset (called ‘“‘roots’’) of the nodes (num- 
bered from 1 to 7) spanned by a directed graph composed of 
arcs of known length, MOORE finds for each node in the network 
the shortest path connecting it to its closest root node. The 
result is a disjoint set of shortest-path trees, referred to here as 

a “shortest-path forest.’”” MOORE’s output describes all the 

paths in the forest and gives their lengths. It also provides two 

lists which sequence the nodes spanned by the forest in forward 
and backward topological order. In the algorithm’s terminology, 

“forward topological order’ is a sequence in which any given 

node is listed after any other node which lies on the path be- 

tween it and its root node. Conversely, the ‘“‘backward topo- 
logical order’’ has the nodes arranged in decreasing distance 
from their nearest root node. 

The procedure below implements a well-known, widely-used 
algorithm by E. F. Moore [1] and is particularly suited for a 
large, sparse network whose are lengths are short and which 
have a small variance, e.g. an urban highway system. As an 
indication of its efficiency, an Assembly Language routine pat- 
terned after MOORE for the IBM 360 model 65 found all short- 
est paths from a single root node to the remaining 12,000 nodes 
of a 36,000-are network (i.e. built a minimum-path tree) in one 
(1) second. In general, for a connected graph, MOORE’s “‘run- 
ning time’’ is directly proportional to the number of arcs in the 
network and is independent of the number of roots. The me- 
chanics of the algorithm are summarized in the following 
three steps: 

0. Mark each root node r ‘“‘reached but not scanned’’ and asso- 
ciate with it a distance of zero (DIST[r]=0). Mark each 
nonroot node? ‘‘not reached”’ and associate with it a distance 
of infinity (i.e. DIST[t]=mazdist). Go to Step 1. 

1. From among the nodes marked ‘‘reached but not scanned,’’ 
select the node 7 whose distance is smallest. If there is no 
node so marked, the forest is complete. Otherwise go to Step 
2. 
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2. For each are (7, j) in the network (i.e. all arcs exiting the 
selected node 7), compare DIST{[j] with thesum of DIST{:] 
and the arc length of (z, 7). Whenever this latter sum is less 
than the former quantity, set. DIST[j] equal to it, mark 
node j “‘reached but not scanned,”’ and put the arc (7, j) in 
the forest, removing any other arc whose final node is j. 
When all arcs exiting node 7 have been so examined mark 
node ¢ “‘reached and scanned’”’ and go to Step 1. 

While Moore’s algorithm possesses the important attribute of 
examining each arc in the network only once, the speed achieved 
in its implementation depends primarily on its efficiency in 
Step 1. To facilitate this node selection, the procedure below 
uses a topological ordering of the final nodes of the arcs in the 
partial forest. It effects Step 1 by referring to a forward-order- 
ing list, NEXT, to determine which node should be selected 
next from the ‘‘reached but not scanned” category. A backward- 
ordering list, LAST, aids updating the ordering when a previ- 
ously found path to a node is superseded by a newly found, 
shorter one. Also used in this updating process are two short 
local vectors, HEAD and TAIL. HEAD{d] and TAIL{d] contain 
the first and last node of a sublist of nodes, whose associated dis- 
tance is not less than the distance of the node selected in Step 1 
and is congruent to d modulo the net’s maximum arc length. 
The use of these latter two arrays becomes clear while studying 
the Auaot below. 

Besides the m root nodes stored in ROOT[1], :-- , ROOT {m], in- 
put to MOORE consists of a network description in three vectors, 
J, D, and INDEX, together with the scalar parameters n, maxd, 
and mazdist. The array J contains the final node numbers of all 
arcs in the network stored in ascending sequence with respect 
to their initial node number. The second vector, D, is parallel 
to the array J and holds the corresponding arc lengths—against 
which paths are to be minimized. INDEX{?] points to the first 
element of J representing an arc exiting node 7. INDEX is di- 
mensioned from 1 ton + 1, where the parameter n is the highest 
node number in the network, and INDEX[n+1] contains one 
plus the total number of arcs in the network. The arc lengths 
stored in the array D must be positive integers strictly less than 
the parameter mazd. Similarly, as mazd exclusively limits the 
length of an arc, so does the other input scalar parameter 
mazdist limit the length of a path. MOORE only considers paths 
which are shorter than mazdist. 

The algorithm’s output describes the minimum-path forest 
in two vectors, IJ and DIST. I{j] contains the initial node of the 
forest’s unique are whose final node is j. Thus the sequence of 
nodes representing the shortest path from the nearest root 
to j is found in reverse order by looking at I{j], J[J[y]], etc., 
until a root node is encountered. DIST[j] returns the minimized 
distance from the closest root node to j. If j is not reachable 
from any root node via a path shorter than mazdist, MOORE 
returns with DIST{j] = mazdist and I[{j] = 0. The forest’s topo- 
logical orderings are returned in list form in the pointer vectors 
NEXT and LAST. NEXT is a circular successor list. The number 
of the node closest to its root node is stored in NEX T[ROOT(1)]. 
The next closest node is contained in NEXT[NEXT(ROOT(I]}}], 
etc., until ROOT(1] is encountered in some NEXT{j1, where j is 
the number of the node farthest from its root node. Similarly, 
LAST is a circular predecessor list. The backward topological 
order is obtained by starting at LAST[ROOT(1]], which contains 
the number of the most distant node. LAST[LAST [ROOT (1})] 
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has the next most distant, etc., until LAST[j] = ROOT{[I], 7 

being the closest node to its root. When no path shorter than 

maxdist exists between a root node and j, then 7 appears in 
neither the NEXT nor the LAST list. 
REFERENCE: 

1. Moornz, E.F. Theshortest path through a maze. In Inter- 
national Symposium on the Theory of Switching Proceedings. 
Harvard U. Press, Cambridge, Mass., Apr. 1957, pp. 285-292; 

begin 

integer procedure mod(d, maxd); value d, mazxd; integer 
d, maxd; mod := d — mazd X entier(d+mazd); 

integer array HEAD[0:mazd—1], TAIL{(O:mazd—1]; integer 
1, pt, kh, v, J, q, et; 

for 7 := 1 step 1 until mazd—1 do HEAD(i] := TAIL{t] := 0; 

for 7 := 1 step 1 until n do 

begin DIST |t] := mazdist; I[:] := 0 end; 

for 7 := 2 step 1 until m do 

begin 
ene := ROOT{(i]; LAST[ROOT(i]| := ROOT 

ta); 
DIST{ROOT{i]] := 0 


end; 

LAST|ROOT(1]] := NEXT(ROOT(m]] := DIST[ROOT{I]] := 
pt := 0; 

t:= HEAD(0] := ROOT{1]; TAIL(0] := ROOT(m); 


comment Examine all exits from selected node (Step 2 above); 
for k := INDEX{i] step 1 until INDEX[i+1] — 1 do 
begin 
v := DIST(t] + Dik]; 7 := J[k]; 
if » < DIST{[j] then 
begin 
comment Path to j via zis shortest so far—put are (i, 7) 
in forest; 
if DIST[j] ¥ macdist then 
begin 
comment Delete node j from its prior sublist; 
q := mod(DIST([j], mazxd); 
if HEAD(q] = j then HEAD{q] := NEXT{j] 
else 
begin 
if TAIL{q] = j then 
begin TAIL(q] := LAST([j]; NEXT{LAST{j]] := 0 


end 
else 
begin LAST(NEXT{j]] := LAST{(j]; NEXT(|LAST 
[j]] :== NEXT{j] end - 
end 
end; 
comment Hook j to its new sublist, and put are (i, 7) in 
forest; 


q := mod(v, maxd); 
if HEAD{q] = 0 then 
begin HEAD(q] := j; LAST{[j] := 0 end 
else 
begin LAST(j] := TAIL{q]; NEXT(TAIL{q]] := j end; 
comment Update forest and forward ordering; 
Ij] := 1; DIST(j]:= 0; TAIL{q] := 37; NEXT{j] :=0 
end 
end; 
comment Select next node whose exit arcs are to be examined 
(Step 1 above); 
if NEXT{i] ¥ 0 then 
begin 
comment Sublist containing 7 not empty—use successor of 
t1;1 := NEXT(i]; gotor 
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end; 
comment Sublist containing 1 empty—use first node in next 
nonempty sublist; 


HEAD{pt] := 0; 
for ct := 1 step 1 until mazd — 1 do 
begin 


pt := mod(pi+1, maxd); 
if HEAD[pt] ~ 0 then 
begin 
comment Found a nonempty sublist—hook it to lists; 
LAST(HEAD[pt]] := 7; « := NEXT(t] := HEAD[pi); 
gotor 
end; 
end; 
comment All sublists empty, forest built—circularize lists 
and quit; 
LAST{ROOT(\]] := 71; NEXT{t] := ROOT(I] 


end MOORE 
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ALGORITHM 361 

PERMANENT FUNCTION OF A SQUARE 

MATRIX I AND II [G6] 

Bruce Suriver, P. J. EBERLEIN, AND R. D. Drxon (Reed. 
19 Feb. 1969, 7 Mar. 1969 and 9 July 1969) 

State University of New York at Buffalo, Amherst, NY 
14226 


KEY WORDS AND PHRASES: matrix, permanent, determi- 
nant 
CR CATEGORIES: 5.30 


real procedure per1(A, n); 
integer n; array A; 

comment Let A be ann X n real matrix, n > 1. The perma- 
nent function of A, denoted per(A), is computed by H. J. 
Ryser’s [1] expansion formula: 


n—l r 
per(A) = 2) (-1 DY [I x 
r=0 x€T,_, i=l 
where Tj, 7 = n,n — 1, --- , 2,1, is theset of vectorsx = (a;), 
t = 1, 2, -+- , n which are obtained by adding j columns of A 
together in a A possible ways. To effect the sum over vectors 


in T;, n — 1 sums are computed. The natural 1-1 map from the 


binary integers to all r-combinations, r = 1, 2, +-- , n — 1, is 
used to increment the sums over the sets 7. 
REFERENCE: 


1. Ryser, H. J. Combinatorial Mathematics, Carus Monograph 

#14. Wiley, New York, 1963, p. 27; 
begin 

real sig, pera, vrod, rowsum; 

integer number, limit, mod, gen, g, 1,9, 73 

array sum(0:n— 1); 

integer array d[1:n]; 

stig := —1; pera:=0; limit := (2Tn) —1; 

for r := 0 step 1 until 2 — 1 do sumir] := 0; 

for number := 1 step 1 until limit do 


begin 
r:=0; gen := number; 
for mod := 1 step 1 until n do 
begin 


g := gen + 2; if (gen—gX2) = 1 then 
begin 7 := r +1; a[r] := mod end; 
gen := 9 
end; 
prod := 1; 
for 7 := 1 step 1 until n do 
begin 
rowsum := 0; 
for j := | step 1 until r do 
rowsum := rowsum + Alt, d[j]]; 
prod := prod X rowsum 


end; 
sum|n—r] := sum[n—r] + prod 
end; 
for 7 := 0 step 1 until n — 1 do 
begin sig := —sig; pira := pera + sig X sum[r] end; 
per := pera 


end of real procedure per1; 
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real procedure per2(A, n); 


integer n; array A; 


comment Let A be ann X nreal matrix, n > 1. The permanent 


function of A, denoted by per(A) is computed by Jurkat and 
Ryser’s [1] method of inductively generating the vectors 
Pi, °** , Pn Where p, is the vector of permanents of r by r sub- 


matrices of the first r rows of A. This vector has i components 


indexed by the r-combinations of {1, --- , n}. The natural 1-1 
map from the binary integers {1,--- ,2fm—1} to the r-com- 
binations of {1, --- , n} forr = 1, -++ , n is used to index the 


p’s and thus they are generated in an order somewhat different 

from that of Jurkat and Ryser. 
REFERENCE: 

1, Jurkat, W. B. anp Ryser, H. J. Matrix factorizations of 
determinants. and permanents. J. Algebra $ (1966), 1-27; 

begin 

integer number, limit, mod, gen, g, r, dig, sub, 7; 

array list [1:2 f n—1]; 

limit := 2fn — 1; 

comment Initialize list as accumulators; 

for j := 1 step 1 until limit do list [j] := 0; 

for 7 := 1 step 1 until n do list [2 ft G—1)] := ALL, J]; 


for number := 1 step 1 until limit do 
begin 
if list [pumber] # 0 then 
begin 
r:= 1; gen := number; 
for mod := 1 step 1 until n do 
begin 
g := gen + 2; 
if gn—- 2X g =1thenr:=7+1; 
gen := 9 
end count of 1’s in number; 
dig := 1; gen :=- number; 
for mod := 1 step 1 until n do 
begin 
g := gen + 2; 
if gen — 2 X g = 0 then 
begin 


sub := number + dig; 
list [sub] := list [sub] + list [number] A [r, mod] 
end; 
gen := 9; dig:= 2X dig 
end computations with list [number]; 
end 
end; 
per := list [limit] 
end of real procedure per2; 


Note. On the Permanent Function of a Square Matrix I and II: 
Program I is slower than Program II. However Program II uses 
approximately 2” more locations of store. The running times for 
both programs double when n is incremented by 1. 
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REMARK ON ALGORITHM 361 [G6] 

PERMANENT FUNCTION OF A SQUARE MATRIX 
I AND II [Bruce Shriver, P. J. Eberlein, and R. D. 
Dixon, Comm. ACM 12 (Nov. 1969), 634] 

Bruce Suriver, P. J. EBERLEIN, AND R. D. Drxon 
(Recd. 22 Jan. 1970) 

State University of New York at Buffalo, Amherst, NY 
14226 


KEY WORDS AND PHRASES: matrix, permanent, determi- 
nant 
CR CATEGORIES: 5.30 


The authors would like to cite the following misprints in the 
above two algorithms: 
(A) In procedure per1(A, n) 
(1) in line 48, the variable name pira should be pera 
(2) in line 44, the variable name per should be perl. 
(B) In procedure per2(A, n) 
(1) in line 47, the variable name per should be per2. 
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ALGORITHM 362 

GENERATION OF RANDOM PERMUTATIONS [G6] 

J. M. Rosson (Reed. 1 Apr. 1969) 

Programming Research Group, 45 Banbury Road, Oxford, 
England 

KEY WORDS AND PHRASES: permutation, random permu- 


tation, transposition 
CR CATEGORIES: 5.5 


procedure perm(n, r, A); value n,r; integer n,7r; integer 


array A; 
comment This procedure produces in the vector A a permuta- 
tion on the integers 1, 2, --- , n, each of the nm! permutations 


being given by one value of r between 1 and n! inclusive. It is 
thus similar in effect to the procedure given in [1] but it is con- 
siderably faster, especially for large values of n, since it uses a 
single loop rather than a double one. 

A permutation is generated as the product of n — 1 transpo- 
sitions of which the jth transposes A[n+1—j] and Al[z] for 
somex Sn+1-—j). 

If the line 
for 7 := 1 step 1 until n do A[i] := 7 

is omitted the procedure will permute the original values 
A[i], --- , Alm] in the same manner. 
REFERENCE: 
1. Rosinson, C. L. Algorithm 317, Permutation. Comm. ACM 10 
(Nov. 1967), 729; 


begin 
integer 7, x, ¥; 
for 1 := 1 step 1 until n do A[i] := 7; 
for 7 := n step —1 until 2 do 
begin 


ei=r— (rtt)Xt+1;5 rie rei; 
y:= Alz]; Alz) := Alt]; Alt] :=y 
end 
end 
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ALGORITHM 363 

COMPLEX ERROR FUNCTION?® [815] 

WaLTEeR GAUTSCHI (Recd. 11 June 1969) 

Computer Sciences Department, Purdue University, La- 
fayette, IN 47907 
* Work supported, in part, by the National Aeronautics and 
Space Administration (NASA) under grant NGR_ 15-005-039 
and, in part, by Argonne National Laboratory. 


KEY WORDS AND PHRASES: error function for complex 
argument, Voigt function, Laplace continued fraction, Gauss- 
Hermite quadrature, recursive computation 

CR CATEGORIES: 5.12 


procedure wofz(xz, y, re,im); value z,y; real x, y, re, im; 
comment This procedure evaluates the real and imaginary 
part of the function w(z) = exp(—z?)erfe(—iz) for arguments 
= x + zy in the first quadrant of the complex plane. The accu 
racy is 10 decimal places after the decimal point, or better. 
For the underlying analysis, see W. Gautschi, ‘Efficient com- 
putation of the complex error function,” to appear in SIAM 
J. Math. Anal.; 
begin 
integer capn, nu, n, npl; 
real h, h2, lambda, rl, r2, s, 81, s2, tl, (2, c; 
Boolean b; 
ify < 4.29 Az < 5.33 then 
begin 
s:= (l—y/4.29) X sqrt(l—z X 2/28.41); 
h:=16Xs; h2:=2Xh; 
capn :=6+23X s; nu:=9+21X s 
end 
else 
beginh := 0; capn:=0; nu := 8 end; 
if h > 0 then lambda := h2 7 capn; 
b:=h=0V lambda = 0; 
rl := 72 := sl := s2:= 0; 
for n := nu step —1 until 0 do 


n+1; 
<= yth+nplXrl; (2 := 2 — npl X 12; 
e:= .5/(t1 & tl + (2 X 12); 
rl:=cX tl; r2:= eX 12; 
ifh >0 An S capnthen 
begin 
tl := lambda + sl; sl := rl X #1 — 72 X 82; 
s2:=7r2X i+rl1 X s2; 
lambda := lambda/h2 


end 
end; 


re := ify = 0 then exp(—2zXz) else 
1.12837916709551 X (if b then 71 else s1); 
im := 1.12837916709551 (if b then r2 else s2) 
end wofz 
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Certification of Algorithm 363 [S15] 
Complex Error Function [Walter Gautschi, Comm. 
ACM 12 (Nov. 1969), 635] 


K.S. Kolbig* (Recd. 8 Oct. 1970) 

Data Handling Division, European Organization for 
Nuclear Research (CERN), 1211 Geneva 23, 
Switzerland. 


Key Words and Phrases: error function for complex argument, 
Voigt function, special functions, function evaluation 
CR Categories: 5.12 


As a result of an exchange of letters with W. Gautschi it became 
apparent that the following alterations simplify somewhat the pro- 
cedure wofz: 

(i) insert the statement 
lambda := h2 1 capn; 


between the statements 
capn:=6+23Xs; m:=9+21X%s 
(ii) delete the statement 

if > Othen lambda := h2 {1 capn; 


Furthermore, for clarification, a cornment could be inserted before 
the statement 6 := 4 = 0 \/ lambda = 0; namely 


comment In the following statement, /ambda = 0 covers the under- 
flow case when / > 0 is very small; 


After these slight modifications, the procedure wo/z was translated 
into Fortran and extended to the whole z-plane (z= x+y) by means 
of [1, No. 7.1.11, 12] 


w(—z) = 2e” — w(z), w(z) = w(—Z). 


It was then tested on a CDC 6500 computer at CERN. The tests 
included the following: 

(i) Calculation of the seven examples No. 12-18 for w(z), erf(z), 
and the Fresnel integral S\(z) given in [{1, No. 7.5]. At least 11 
significant digits agreed with the values obtained by 


w(z) = e-* [l—erf(—iz)] = e7erfe(—iz). (1) 


The error function erf(z) for complex z in (1) was calculated using 
Salzer's formula, which is reproduced in the NBS Handbook [1, 
No. 7.1.29]. This formula requires the computation of erf(x) for real 
x, which was done with the help of a library program based on the 
approximation given by Cody [2]. (Note that the correct value 
of ImS,[C4+) V2] in example 18 is —0.681620 instead of 
— 0.681619.) 

(it) Calculation of w(z) for 


z = 4.29 + 10°-Yp + i(5.334+10-%Q) 


with p,g = —1, 0, 1. These values of z lie near the line which sepa- 
rates two branches in the procedure wo/z. Eight to nine significant 
digits, corresponding to nine to ten figures after the decimal point, 
agreed with the values obtained from (1). 
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(iii) Calculation of w(z) along the diagonal z = (1+diu for u = 
— 27(1)100, 1000, 10000. For u < 10, the formula {1, No. 7.9] 


wid + Du] = 


aig ' + @-1) E e) — all 


was used for comparison. The Fresnel integrals C(x) and S(x) were 
computed with a library program based on the Algol procedure 
Fresnel written by Bulirsch {3]. Twelve to fourteen significant digits 
agreed. For 1 > 10, the results of wo/z were checked against the 
asymptotic expansion [1, No. 7.1.23] 


i 2 1.3+++ (2n — 1) 
wie) ~ Z(1 +2 a 


WZ n=1 
z= Beg ies 
oO, — - rg(z) — |}, 
aaa 4 


(2) 


(3) 


Thirteen to fourteen significant digits agreed. 
(iv) Calculation of w(z) along the imaginary axis x = 0 for y = 
— 27(1)100, 1000, 10000. For y < 25, the formula 


w(iy) = ev erfe(y) (4) 


was used for comparison. The complementary error function erfc(x) 
was computed by means of a library program based on [2]. For 
~—27 < y < —2and for 10 < y < 25, 12 to 14 significant digits 
agreed, whereas for —2 < y.< 10, ten to thirteen significant digits 
were found to be in agreement. For y > 25, the results were checked 
against the asymptotic expansion (3). Thirteen to fourteen signifi- 
cant digits agreed. 

(v) Calculation of w(x) — en" along the real axis y = O for 
x = 0(1)100, 1000, 10000 using the formula [1, No.-7.9] 


wx) — eo? = ee er a e dt = = F(x) (5) 


for comparison. The Dawson integral F(x) was computed with the 
help of the rational approximations given by Cody et al. [4]. For 
x < 7, 10 to 12 significant digits agreed, whereas for x > 7, 13 to 
14 significant digits were found to be in agreement. 

(vi) Calculation of w(z) forz = (1+i/./3)uandz = (1+/ /3)u for 


u = 10*,k = —10(1)4. Fork < 0, the results were compared with 
the values obtained from the power series 
2, (iz)" 
we) = 
n=0 


"5 + ') (6) 


Ten significant digits agreed. For k > 0, 13 to 14 significant digits 
agreed with the values obtained from the asymptotic expansion (3). 
(vii) Calculation of w(z) forz = x + 10-8 for x = 1(1)100, 1000, 
10000. For x <_ 5, the results were compared with the values ob- 
tained from formula (1). Six to eight significant digits, correspond- 
ing to at least nine to ten decimals, agreed for the real part. However, 
the accuracy of wofz may be higher, since the values from formula 
(1) are possibly inaccurate. The imaginary part agreed to ten to 
twelve significant digits. For x > 5, the asymptotic expansion (3) 
was used for comparison. For 6 < x < 8, ten to twelve significant 
digits, and for x > 8, thirteen to fourteen significant digits agreed 
in both the real and imaginary part. For x = 5, it was not possible 


to calculate accurate values for the real part of w(z) either by means 


of formula (1) or from the asymptotic expansion (3). 


References 

1. Gautschi, W. Error function and Fresnel integrals. Chap. 7 in 
Handbook of Mathematical Functions, M. Abramowitz and J.A. 
Stegun, Eds. NBS Appl. Math. Ser. 55, U.S. Govt. Printing Office, 
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ALGORITHM 364 

COLORING POLYGONAL REGIONS [Z] 

Ropert G. Herriot (Recd. 30 Jan. 1967, 31 Oct. 1968 
and 2 July 1969) 

University of Wisconsin, Computer Science Department, 
Madison, WI 53706 


KEY WORDS AND PHRASES: coloring polygonal regions, 
coloring planar surfaces, drawing pictures, shading enclosed 
regions 

CR CATEGORIES: 4.9 


procedure drawarea (x, y, firstpoint, lastpoint, section, numrows, 
numseats, regcolor, paintflag, paintcolor, sgn, dir, edge); 


value firstpoint, lastpoint, mnumrows, numseats, regcolor, 
paintflag, paintcolor, sgn, dir, edge; 
integer firstpoint, lastpoint, numrows, numseats, regcolor, 


painicolor, sgn; 

real edge; 
Boolean patntflag, dir; 
real array 2, y; 
integer array section; 

comment This procedure is a part of a large program which 
produces the card stunts for the Stanford University football 
game half-times. The initial development was done by L. Breed, 
L. Tesler, and J. Sauter. The author (a Stanford student at the 
time) made many further developments on this program which 
included producing an algorithm for coloring in polygonal re- 
gions. Prior to the development of this algorithm, there were 
many cases which did not work. The larger program takes as 
input an English description of the stunts and produces as out- 
put an image of each flip (similar to a frame in a movie film), 
as a rectangle that has 45 rows with 77 seats in each row. The 
main program, which will be considered the driver program for 
the purpose of the procedure drawarea, does all of the handling 
of the definition of regions and also the printing of the images. 
It should be mentioned that the procedure drawarea in the actual 
program is just part of a larger procedure and that all of the 
parameters are global in order to increase efficiency. The pur- 
pose of drawarea is to take the current regions and draw them in 
the two-dimensional array section, which is to be declared as 
section [1: numrows, 1: numseats] (the array is 45 by 77 for Stan- 
ford). Each completed picture in section is then printed and also 
written out on tape. Another program later takes this tape and 
processes it to produce an instruction card for each student 
holding a set of colored cards in the rooters section. 

The larger program allows objects of any shape to be defined 
by a series of x, y-coordinates. It will accept a series of points 
which are given an identifying name by the user and which can 
then be used as (1) a group of points, (2) a series of connected 
line segments, (3) a polygonal region enclosed by the points 
(with the first and last point connected by a straight line). It 
also allows ellipses to be defined. Once an object is defined, it 
can be expanded and contracted in size, rotated about any fixed 
point, or moved anywhere, including all or partially out of 
sight. As soon as all objects are in place, the user can ask that an 
image of the picture be made. Except for polygonal regions, 
producing the image of these objects is trivial. The procedure 
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drawarea is the routine which places the polygonal regions in the 
array section. 

The array section is presumed to have a background color 
associated with it. All objects, which also have an associated 
color, are then drawn into the array in a specified order so that 
the objects which are to be superimposed over other objects are 
drawn last. The procedure drawarea takes the coordinates of 
the point (which may not be integral) from arrays x and y with 
subscript values ranging from firstpoint to lastpoini and decides 
which seats in array section will form the left and right bound- 
aries of this new region. After the boundary is determined, the 
interior must be colored in. The algorithm colors the region by 
taking each row and then examining each seat from left to right. 
For optimization, only the area of a miniroal circumscribing 
rectangle is examined. At the beginning of each row the variable 
count is set to leftcount (row, 0]—righicount [row, 0], which will be 
zero unless the object is partially out of sight on the left. Then 
as long as cownt remains zero, the seat is on the exterior and is 
not colored. As each seat is encountered, leficount [row, seat] 
is added to count. When count is positive, the seat is in the in- 
terior or on a boundary and is colored. After each seat is proc- 
essed, rightcount [row, seat] is subtracted from count. When 
count returns to zero, the seat is an exterior seat and is not col- 
ored. In any row it is possible to have the color turned on and 
off several times. Arrays leftcount and rightcount contain twice 
the number of left and right boundaries which pass through each 
individual seat. These two arrays solve the problem created by 
having several boundaries passing through one seat. 

A further complication to the routine is added by allowing a 
region to be gradually changing color. Thus each region always 
has a color (regcolor) associatecl with it, and if the region is 
being swept with a new color, then painiflag is true and paint- 
color, sgn, dir, and edge are used to determine the section of 
the region which is to be of the new color (paintcolor). The roles 
of the parameters for painting are: sgn and dir indicate the direc- 
tion in which the imaginary paintbrush is moving. dir = true 
means the direction is horizontal and dir = false means ver- 
tical. sgn = —1 means the direction is left or down and sgn = 1 
means the direction is right or up. edge is the row or seat (col- 
umn) where the new color (paintcolor) ends and the old color 
(regcolor) begins. The driver program is expected to change 
edge with each new image so that the region looks as if it is 
being swept by a new color. 

A related algorithm which determines whether a point is 
inside a polygon is presented in Algorithm 112 [1, 2]. 

REFERENCES: 
1. Hacker, Ricwarp. Certification of Algorithm 112, Position 
of point relative to polygon. Comm. ACM 6 (Dec. 1962), 606. 
2. Suimrat, M. Algorithm 112, Position of point relative to 
polygon. Comm. ACM 6 (Aug. 1962), 434; 
begin 
integer row, seat, toprow, rightseat, rit, lef, top, bot, tox, toy, 
inz, iny, sdx, sdy, j, tx, ty, count; 
real oz, oy, nx, ny, dz, dy, dudy, const; 
integer array leftcount, righicount [0: numrows+1, 

0: numseats+1]; 
integer procedure maz(z, y); 

mas := if zs > y then z else 4; 
integer procedure min(x, y); 

min := if x < y then « else y; 


value 2x, y; integer x, y; 


value xz, y; integer x, y; 


COLLECTED ALGORITHMS (cont.) 364-P 2- 0 


toprow += numrows + 1; lef := mazx(1, lef); rit := min(rit, numseats); 
rightseat := numseats + 1; bot := mazx(1, bot); top := min(top, numrows); 
for row := 0 step 1 until toprow do comment Color the area. It is only necessary to look within 
for seat := 0 step 1 until righiseat do the circumscribing rectangle; 
leftcount {row, seat] := rightcount [row, seat] := 0; for row := bot step 1 until top do 
ox := xllastpoint]; rit := left := tox := oz; begin 
oy := yllastpoint]; top := bot := toy := oy; count := leficount [row, 0] — rightcount [row, 0]; 
comment Draw the boundary by iterating through the points; for seat := lef step 1 until rit do 
for j := firstpoint step 1 until lastpoint do begin 
begin Bee count := count + leficount [row, seat]; 
ees alg]; a ec a if count > 0 then 
ny := ylj]; tiny = ny; section [row, seat] := if paintflag then 
dix i= nu — on; (if sgnX((if dir then seat else row)—edge) >0 
dy := ny — oy; then 
sdx := if dx < 0 then —1 else 1; regcolor 
sdy := if dy < 0 then —1 else 1; else paintcolor) 
if 1oy = iny then else regcolor ; 
poem Woy a : count := count — righicount [row, seat]; 
comment The line is horizontal, or almost so; end coloring of une seat; 
comment min and maz keep the point in the section; end coloring of one row; 
row := max(min(ioy, toprow), 0); end draisaren: : 
seat := max(min(max(iox, inx), righiseat), 0); ; 
rightcount [row, seat] := rightcount [row, seat] + 1; 
seat := max(min(min(tox, tnx), rightseat), 0); 
leftcount [row, seat] := leftcount [row, seat] + 1; 
end horizontal line 
else 
begin 
comment The line is not horizontal; 
dady := dx/dy; 


const := if abs(dx) < abs(dy) 
then ox — dxdy X oy 
else or — dady X (oy — sdx/2) — sdy/2; 
comment Draw line between two points by stepping 
through each row and determining which seat should be 
marked as the boundary; 
for iy := ioy step sdy until iny do 


begin 
tx := dxdy X ty + const; 
row := max(min(iy, toprow), 0); 
seat := max(min(ix, rightseat), 0); 


comment Because end points are each processed twice, 
we add only 1 to them instead of the usual 2; 
if dy > 0 then 


begin 
comment Boundary on right side of area; 
rightcount[row,seat] := rightcount|row,seat] 
+ (if ty=ioy\Viy=iny then 1 else 2) 
end 
else 
begin 


comment Boundary on left side of area; 
leftcountlrow,seat] := leftcount[row,seat] 
+ (if iy=toy\/iy=iny then 1 else 2) 
end 
end drawing of line; 
end sloping line; 
comment Move on to next line segment; 
0x := NX; 10% := 0%} 
Oy := NY; toy := oy; 
comment Find rectangle which circumscribes the area; 
if rit < tox then rit := tox 
else if lef > ioz then lef := t0z; 
if top < ioy then top := toy 
else if bot > toy then bot := toy; 
end bordering area; 
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ALGORITHM 365 

COMPLEX ROOT FINDING [C5] 

H. Bacu (Reed. 18 Apr. 1968 and 15 July 1969): 

Laboratory of Electromagnetic: Theory, Technical Uni- 
versity of Denmark, Lyngby, Denmark 


KEY WORDS AND PHRASES: downhill method, complex 
relaxation method, complex iteration, complex equation, 
transcendental complex equation, algebraic complex equation 

CR CATEGORIES: 5.15 


Comment. The present subroutine determines, within a cer- 
tain region, a root of a complex transcendental equation f(z) = 0, 
on which the only restriction is that the function w = f(z) must be 
analytic in the region considered. The iterative method used, 
the downhill method, was originally described in [2] and is dis- 
cussed and modified in [1]. 

The program uses a complex function subprogram FUNC(Z) 
for the computation of f(z). From a given complex starting point 
ZS, the iteration is performed in steps of initial length HS. The 
iterations stop at the root approximation ZE when either the 
function value DE at the end point is less than the prescribed 
minimum deviation DM or when the step length HE has become 
less than the prescribed minimum step length HM. For reference, 
the subroutine also returns DS, the function value at the starting 
point ZS, and N, the number of iterations used. There are thus 
four input parameters, namely the starting point ZS, the initial 
step length HS, the minimum step length HM, and the minimum 
deviation DM. 

ACKNOWLEDGMENT. Thanks are due to Mr. Frank Jensen, 
M.S8c., who helped in the testing of this algorithm. 

REFERENCES: 
1. Bacu, H. On the downhill method. Comm. ACM 12 (Dec. 
1969) 675-677. 
2. Warp, J. A. The downhill method of solving f(z) = 0. J. 
ACM 4 (Mar. 1957), 148-150. 


SUBROUTINE CRF (ZS sHSsHMsDMsFUNC sDSeZE sHEsDE oN) 


THE SUBROUTINE DETERMINES A ROOT OF A TRANSCEN- 
DENTAL COMPLEX EQUATION F(Z)=0 BY STEP=WISE ITE- 
RATIONe (THE DOWN HILL METHOD) 


TNPUT~PARAMETERS. 


zs START VALUE OF Ze (COMPLEX) 
HS LENGTH OF STEP AT START. 


=x 
HM = MINIMUM LENGTH OF STEP, 
DM = MINIMUM DEVIATION, 


SUBPROGR AMe 


FUNC(Z)s A COMPLEX FUNCTION SUBPROGRAM FOR THE 
CALCULATION OF THE VALUE OF F(Z} FOR A COMPLEX 
ARGUMENT Ze 


OUTPUT=P ARAME TERS. 


NANDA ANDNAIANAND ANNDNAANAANAN 


DS = CABS(FUNC(ZS))=DEVIATION AT START. 
ZE = END VALUE OF Ze (COMPLEX) 

HE = LENGTH OF STEP AT ENDe 

DE = CABS(FUNC(ZE))=DEVIATION AT ENDe 
N = NUMBER OF ITERATIONSe 


NANANAD 


NAN 


AND 


AaAN 


NAN nan 


Aan AaAN 


ANA 
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RESTRICT IONSe 


THE FUNCTION W=F(Z)} MUST BE ANALYTICAL IN THE 
REGION WHERE ROOTS ARE SOUGHT. 


REAL W(3) 

COMPLEX 209ZS9ZEoZD0Z2 9213) sCWeAsVoUl 7) sFUNC 
UCT ellesO0e)} 

U2) = (0 286602549025000090} 
U03) = (02000000041 20000090) 
U4) # (009659258 2062588190) 
U5) 2 (067071068 109.7071068) 
U(6)=(06258819090.9659258) 
UT) ® (-062588190 029659258) 
H=HS 

Z0=Z2S 

N=0 


CALCULATION OF DSe 


1 


2 


CW=FUNC (Z0) 
WOZABS(REAL (CW) )+ABS(AIMAG(CW)) 
NS=wo 

IF ¢(WO-DM) 1891891 

K=1 

120 

Val=1e900) 


EQUILATERAL TRIANGULAR WALK PATTERNe 


3 


Az(-0.590.866) 


CALCULATION OF DEVIATIONS W IN THE NEW TEST POINTSe 


4 


Z(1)=ZO+HEVEA 

CW=FUNC(Z(1)) 
W(1)2ABS(REAL (CW) +ABS1ATMAG(CW) ) 
Z(2)2Z0+H#V 

CWHFUNC(Z(2)) 
W(2)2ABS(REAL (CW) )+ABS(ATMAG(CW)) 
204) eZO+H*¥CONIG(IA) FV 
CW=FUNC(Z(3)) 
W(3)=ABS(REAL (CW) )+ABSIATMAG(CW)) 
N=N+1 


DETERMINATION OF WINR)» THE SMALLEST OF W(I)6 


6 
7 


8 


9 
10 
11 
12 


TF CWC 1) -W03)) 59596 
{FUWCL)-WO2)) 79898 
IF (W02)-W03)) 85899 
NR=] 

GOTO 10 

NQ&=2 

GOTO 10 

NRz3 

IF C(WO-W(NR)} Llsl2912 
GOTO (13914915) 0K 
K=1 

1=0 


FORWARD DIRECTED WALK PATTERNe 


13 


A=(06707906707) 
Ve(Z(NR)-Z0)/H 
WO=zW(NR) 

Z0#Z (NR) 
IF(WO-DM) 1891854 
Ka2 


REDUCTION OF STEP LENGTHo 


14 


IF(HeLTeHM) GOTO 18 
H=H#0 025 

GOTO 3 

K=3 


RESTORATION OF STEP LENGTH. 


15 


H=H*¥4e 
GOTO 2 
t=I41 


ROTATION OF WALK PATTERNes 


16 


IFCI-7) 16916917 
veut.) 
GOTO 3 


REDUCTION OF STEP LENGTH. 


17 


18 


IF (HeLToHM) GOTO 18 
HsH#0 625 

1=0 

GOTO 2 

ZEsZ0 

HEsH 

DE=wo 

RETURN 

END 


0 
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ALGORITHM 366 

REGRESSION USING CERTAIN DIRECT 

PRODUCT MATRICES [G2] 

P. J. CLARINGBoLD (Recd. 10 May 1968 and 8 July 1969) 

Division of Animal Genetics, C.S.I-R.O., P.O. Box 90, 
Epping, N.S.W., Australia, 2121 


KEY WORDS AND PHRASES: analysis of variance, analysis 
of covariance, regression analysis, experimental design, matrix 
direct product, projection operator, orthogonal matrix 

CR CATEGORIES: 5.14, 5.5 


procedure regressor (vec, kobs, levs, code, kfac, nfac, ndf); 
value nfac; 
integer kobs, levs, code, kfac, nfac, ndf; 
real vec; 

comment The mathematical basis of the algorithm which forms 
the kernel of a very general analysis of variance and covariance 
procedure (Algorithm 367) is set out in [5, 6]. An overwhelming 
majority of the experimental designs in [2] may be analyzed in 
this way. Statistical nomenclature is given in parentheses. 

A vector vec, of nobs elements (observations) traced by kobs, 
is replaced by ndf S nobs elements (regression coefficients) 
obtained by the matrix product C7-vec, since the matrix is 
semiorthogonal. The number of initial elements is implied as 
the product of the nfac values of the variable levs which are 
traced by kfac. Values of code, similarly traced, specify matrices 
which enter a direct product [4] to form the transforming matrix 
CT (independent variates transposed). As code takes the values 0, 
1, or 2, the matrices selected are J, j, or V, i.e. the unit matrix 
of order levs, the unit vector of levs equal elements, or a matrix 
made up of levs — 1 mutually orthogonal unit vectors which are 
also orthogonal to the previous vector (V7.7 =0 and V?-V = 1). 
A direct product of the transposes of the selected matrices forms 
the transforming matrix. An example of an actual call is shown 
to illustrate tracing: example: regressor (vec[kobs], kobs, 
levs[kfac], code|kfac], kfac, nfac, ndf). 

The squared length of the resultant vector (swm of squares on 
ndf degrees of freedom) is equal to the squared length of the 
projection of the original vector in the subspace spanned by an 
idempotent symmetric matrix (idiz) P. Eigenvectors associ- 
ated with unit eigenvalues of this projection operator [1] com- 
prise the rows of the transforming matrix. 


I? = vecT-P-vec = vecT-C-CT-vec, (1) 


The cosine of the angle between two similarly transformed vec- 
tors (correlation coefficient) is obtained in an analogous manner 
from a scalar product (sum of cross products). 


Iveclweccos(Q) = vec™- P-wec. (2) 


Prior evaluation of direct products is very wasteful of opera- 
tions [8], and use is made of an identity which involves ordinary 
(-) and direct (X) products: 


(AXBXC)-y = (AXIXD): CXBXD: IXIXC)+y. (3) 


Although shown for a triple product the identity obviously 
holds for any number of factors. The identity, however, is only 
valid for square matrices and the rectangular j or V factors 
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must therefore be bordered by zeros to satisfy. In the algorithm 
multiplication by these zeros is bypassed, and after each trans- 
formation the vector is packed ready for the next. 

Another identity: 


(AXB)-(CXD) = (A-C) X (BD), (4) 


implies that the ordinary products in (8) may be taken in any 
order, since the direct product factors commute. The trans- 
- formations should therefore be taken in the order which achieves 
the largest reduction in the number of elements. Since j-factors 
achieve a reduction in the ratio levs:1, while V-factors merely 
achieve levs:levs — 1, the transformations are arranged in de- 
scending order of levels for j-faciors followed by an ascend- 
ing order of levels for V-factors. Transformations requiring 
the unit matrix are, of course, skipped. 
REFERENCES: 

1. Banernser, K.S. A note on idempotent matrices. Ann. Math. 
Statist. 35 (1964), 880-882. 

2. Cocuran, W. G. and Cox, GERTRUDE M. Experimental De- 
signs (2 Ed.) Wiley, New York, 1957. 

3. Goon, I. J. The interaction algorithm and practical Fourier 
analysis. J. Roy. Statist. Soc. {B} 20 (1958) , 361-373. 

4. Marcus, M. Basic theorems in matrix theory. Nat. Bur. 
Standards Appl. Mathl Ser. 67 (1960), Washington, D.C. 

5. Netpsr,J.A. The analysis of randomised experiments with 
orthogonal block structure. I. Block structure and the null 
analysis of variance. Proc. Roy. Soc. {A} 283 (1965), 147-162. 

6. Nevper,J.A. The analysis of randomised experiments with 
orthogonal block structure. II. Treatment structure and the 
general analysis of variance. Proc. Roy. Soc. {A} 283 (1965), 
163-178; 

begin 

integer ifac, jgo, nift, nrgt, jfac, jump, wlft, irgt, jumphold, tlev, 

jumpo, jumper, tup, idown, nlev, maxp; 

real x, v; 

integer array ranks(1:nfac]; 

maxp := ndf := 1; 


for kfac := 1 step 1 until nfac do 

begin 
comment Transmit levels and determine largest factor; 
ranks|kfac] := nlev := levs; ndf := ndfXnlev; 


if nlev > maxp then mazp := nlev 
end with degrees of freedom set in null case; 


maxp := —(maxp+l1); 
for jgo := 1, 2 do 
begin 
comment Averaging before differencing transformations; 
mfac: 
begin 
comment Search for best remaining factor; 
nlev := maxp; tfac := 0; 
for kfac := 1 step 1 until nfac do 
begin 


tlev := (8—2Xjgo) X ranks{kfac}; 
if code = jgo /\ ranks{kfac] = levs A tlev > nlev then 
begin 
nlev := ilev; ifac 
end if a better factor 
end search; 
if tfac > 0 then 


:= kfac 
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begin 
comment Process a factor; 
kfac := ifac; nlev := levs; nlft := nrgt := 1; 


for jfac := 1 step 1 until nfac do 
if ifac ~ jfac then 
begin 
comment Determine orders of unit matrices to left 
and right; 
if jfac < ifac then nlft := nlft * ranks[jfac] 
else nrgt := nrgt X ranks{[jfac] 
end products; 
begin 
comment Evaluate normalization constants; 
array root[jgo :if jgo=1 then 1 else nlev]; 
if jgo = 1 then root[1] := sqrt(1/nlev) 
else 
for ilev := 2 step 1 until nlev do 
root{tlev] := sgrt(1/(ilevX (ilev—1))); 
comment Begin transformation of vector; 
jump := 0; 
comment Loop over all combinations to the left; 
for zift := 1 step 1 until nift do 


begin 
jump := jump +1; 
comment Loop over all combinations to the right; 


for irgt := 1 step 1 until nrgt do 
begin 
jumphold := jump; jump := jump — nrgt; x :=0; 
comment Loop over active factor; 
for ilev := 1 step 1 until nlev do 
begin 
comment Form sum; 
jumpo := jump; kobs := jump := jump + nrgt; 
if jgo = 2 A ilev > 1 then 
begin 


comment Form difference when appropriate; 


vy := vec; kobs := jumpo; 
vec := (a—(tlev—1)Xv) Xrootfilev] 
end now do sum; 
kobs := jump; x := % + vec 
end sum and difference loop; 
if jgo = 1 then 
begin 
comment Insert normalized average; 
kobs := jumphold; vee := x X root{l] 
end insertion; 
jumper := jump; jump := jumphold + 1 
end loop over all combinations to the right; 
jump := jumper; 
end loop over all combinations to the left 


end block; 

tup := nrgt X nlev; idown := if jgo = 1 then nrgi else 
wup — nrgt; 

for ilft := 2 step 1 until nlft do 

begin 


comment Compact vector; 
for irgt := 1 step 1 until nrgt do 


for ilev := 2 step 1 until nlev do 
if tlev < 3 V jgo = 2 then 
begin 
kobs := tup := iup +1; v := vec; 
kobs := idown := idown + 1; vec := v 


end within block moves; 
iup := if jgo = 1 then iup + (nlev—1) X nrgt else 
tup + urgt 
end block moves; 


366-P 2- 0 


comment Adjust dimensions of pseudoarray; 
ranks{ifac] := if jgo = 1 then 1 else nlev — 1; 
ndf := idown; 
go to mfac 
end 
else go to end jgo 
end labeled compound statement; 
end jgo: 
end loop over factor types 
end regressor 
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ALGORITHM 367 

ANALYSIS OF VARIANCE FOR BALANCED 

EXPERIMENTS [G2] 

P. J. CLARINGBOLD (Recd. 27 May 1968 and 8 July 1969) 

Division of Animal Genetics, C.S.I.R.O., P.O. Box 90, 
Epping, N.S.W., Australia, 2121 


KEY WORDS AND PHRASES: analysis of variance, analysis 
of covariance, regression analysis, experimental design, bal- 
anced experiment, missing data, interblock estimate, intrablock 
estimate 

CR CATEGORIES: 5.14, 5.5 


integer procedure balanced anova (y, missing y, x, fixed effect, esti- 
mate, error level, error code, all y, all x, length y, length x, pooled 
beta, se beta, normalized beta, error, df total, df error, tolcor, 
ltolength, tolmpss, ispace, nspace, ires, jres, nres, ttrt, ntrt, tobs, 
nobs, tfac, nfac, max cycle, check diagonality, projector, putpy, 
getpy, putpx, getpa) ; 
value tolcor, tolength, tolmpss, nspace, nres, ntrit, nobs, nfac, maz 
cycle, check diagonality; 
real y, x, ally, all x, length y, length x, pooled beta, se beta, normal- 
tzed beta, error, tolcor, tolength, tolmpss; 
integer error level, error code, df total, df error, ispace, nspace, 
tres, jres, nres, tirt, ntrt, iobs, nobs, ifac, nfac, max cycle; 
Boolean missing y, fixed effect, estimate, check diagonality; 
procedure projector, putpy, getpy, putpx, getpz; 
comment The algorithm provides analyses of variance, covari- 
ance, and regression for data collected according to a wide 
variety of experimental designs. The vector of elements compris- 
ing either a response (y or dependent) or a treatment (zx or inde- 
pendent) variate forms a conceptual complete array of nfac di- 
mensions. The implied subscripts are a set of discrete variables 
which define an error classification. Designs of this type include 
the fully randomized, randomized block, incomplete block, split (to 
any order) plot, Latin (and higher) squares, lattices, et cetera, and 
make up the overwhelming majority in use [3]. By means of an 
appropriate transformation the frequency data of contingency 
tables may be processed to provide partitions of chi-square [1]. 
A comprehensive account of the mathematical basis is given in 
(4, 5]. 

In this implementation extensive use is made of the call-by- 
name facility so that generators and routines involving auxiliary 
store may freely be used for all input variables. Usually data 
sets are quite small and storage of intermediate quantities 
within the immediate access store is possible. In the following 
notes on the formal parameters relevant tracer variables are 
shown in brackets. An arrow (—) indicates that the variable is 
used only as a source of information. 

balanced anova: If the projection of x-variate numbered jért 
has a correlation coefficient exceeding tolcor with the projection 
of x-variate numbered kért in subspace ispace of the design, then 
abnormal termination is forced with balanced anova = 10° X 
ispace + 10° X jirt + kirt. Zero is returned as the value of the 
procedure in the case of normal termination. Note that this 
time-consuming check of the balance of the treatment model 
with respect to the error model is only performed if check diago- 
nality is set true. 
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y, missing y (tres, iobs) — : The y-variaie generator or array 
must provide trial values, e.g. the average of present elements 
for the variate, for any missing data. These elements are flagged 
by true in the Boolean missing y which may take the form of 
an expression in terms of tres, obs, and integer constants. 

z (itrt, iobs) — : A complete specification of the orthogonal 
decomposition of the total sum of squares (and products) using 
polynomials or some other form of contrast representation is 
required. In the case of treatment classifications (for example 
factorial experiment) the x-variate values may be generated as a 
direct product (or as a selection of elements from such a matrix) 
of a number of small contrast matrices, i.e. orthogonal matrices 
with first column having elements greater than zero (usually 
constant). 

fixed effect (itrit, ispace) > : By setting this variable true the 
flagged regression coefficients, i.e. beta number Ztrt in estimation 
subspace number ispace, are declared to be error free or invari- 
ants. In most practical cases this facility is only relevant to the 
constant term of the regression model. 

estimate (dirt, ispace) + : By setting this variable false the 
flagged regression coefficients are declared to be zero and are 
not estimated in the indicated subspaces. Usually this facility 
is not required, and the constant true is used as actual parame- 


ter. 

error level (ifac) —> : The variable sets the number of levels 
of the error classifications. If it is assumed that the conceptual 
subscripts have unit lower bounds, then the upper bounds are 
set. Variates (traced by iobs) must be in lexical order by the 
implied subscripts, and use of a permutation array or function 
may be required to achieve this end. 

error code (ifac, ispace) —> : Error sources of variation (esti- 
mation or error subspaces) are specified by integer codes 0, 1, 
or 2. The codes could be generated by means of a procedure 
which interpreted a string of input characters denoting the 
error structure of the experimental design, see [4, 5]. A set of nfac 
integers specifies a projection operator which spans a subspace. 
The operator is formed as the direct product of (0) tdentity 
matrix I, (1) averaging matrix J, or (2) differencing matriz K = 
I — J. Every element of the averaging matrix is equal to the 
reciprocal of the order, 
eg.: 2,0,1,2,.10KiXhXJsX KiXJs = Pi, say. 
It is required that the error subspaces be mutually orthogonal, 
PiPj = 5ijPi. 

Code Sets for Some Common Designs 


Design Codes PitP? 
Fully randomized 1 2 0 
Randomized or 11 21 02 01 
incomplete 
block 
Split plot 111 211 021 002 011 
Split split plot 1111 2111 0211 0021 0002 0111 
Square or 11 21 12 22 01 
rectangle 
Revlicated square 111 211 021 O12 022 011 


or rectangle 
Three-way crossed 111 211 121 112 221 212 122 222 O11 


error 
In certain circumstances it may be desired to work 
mod(JXJX-++:XJ), that is the y-variates are adjusted to have 
zero mean. In this case the first code is omitted from the analy- 
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sis. Usually it is convenient to pool the subspaces defined by 
JXIX%+:+xXJandKXJX-:-: XJ yielding (by addition) 
IXJxX:-: XJ, and if this is required the first two columns 
of the table are replaced by the rightmost auxiliary column. 

all y {tres], all x [ttrt], length y [tres, ispace], length x [itrt, ispace]: 
The lengths of the y, x, projected y, and projected x vectors are 
returned. Null variates (which have zero length) should be indica- 
ted in, or excluded from, analysis of variance tables (et cetera) 
derived from an activation of the procedure. 

pooled beta, se beta [ires, trt]: The weighted mean regression 
coefficient relating y-variate number ires to z-variate number itrt 
is returned in pooled beta, and the standard error of the estimate 
in se beta. 

normalized beta [ires, tirt, ispace]: Within each subspace the 
regression coeflicients are scaled so that it may be assumed that 
the sum of squares of each (nonnull) projected x-variate is unity. 
The dyad obtained by forming all pairwise products over the 
tracer ires (fixing the other tracers) is a single degree of freedom 
contribution due to treatment (x-variate) number iirt to sub- 
space number space of the analysis of variance (and covariance 
if nres > 1). 

error [ires, jres, ispace]: For each subspace an error covari- 
ance matrix is computed. This is the only variable bearing the 
tracer jres which is constrained so that jres S ires. The calling 
program may make provision to pack the matrices in triangular 
form using a subscript function: pack[ires] + jres, where 
pack{ires}] = (tresX (tres—1)) + 2. 

df total, df error {tspace]: The variables return the total and 
error degrees of freedom for each subspace. 

tolcor: If the activation calls for a check of the orthogonality 
of projected x-variates, then this constant sets the value of the 
correlation coefficient, which should not be exceeded in the test. 
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ments of a transformed y-variate to auxiliary store, while getpy 
performs the reverse transmission. Similar actions on the 
z-variates are carried out by the other two procedures, All four 
routines have similar calling sequences: (vec{ielm], ielm, nelm, 
ivar, ispace), where vec identifies the vector to be moved, zelm 
traces the elements of the vector, nelm (returned by projector) 
specifies the number of elements to be moved, ivar gives the 
variate number, and ispace gives the subspace number. The ele- 
ments to be moved are in the leading position in vec, and an 
appropriate instruction begins for telm := 1 step 1 until 
nelm do. The last two formal parameters may be used to index 
an array listing the starting positions of the vectors in auxiliary 
storage. 
REFERENCES: 

1. Cuarinapotp, P. J. The use of orthogonal polynomials in 
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analysis of variance. Proc. Roy. Soc. {A} 288 (1965), 147-162. 

5. Nevppr,J.A. The analysis of randomised experiments with 
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begin 


array yy, cz{1:nobs]; real s, ¢, v, ssmp; 
integer 7 cycle, ndf, jirt, kirt, kres, nelm, nmis; 


tolength: A projected vector is assigned zero length if the 
ratio of the computed length to that of the unprojected vector, 
multiplied by the square root of the ratio of the number of ob- 
servations to degrees of freedom of the subspace, fails to exceed 
this criterion. 

tolmpss: As a single measure of all missing data a sum of 
squares is computed. If the ratio of the absolute value of the 
difference between this sum and that of the previous iteration 
(or 0), to the current sum, fails to exceed this constant, no 
further iterations are made. _ 

ispace, nspace, tres, jres, nres, itrt, nirt, iobs, nobs, ifac, nfac: 
The identifiers with initial letter 7 or j are tracers mnemonically 
related to the remaining identifiers which define the number of 
subspaces, y-variates, x-variates, observations and error factors, 
respectively. 

max cycle: An upper limit to the number of iterations re- 
quired for the convergence of estimates of missing data is pro- 
vided by this parameter. 

check diagonality: If this parameter is true then the projected 
x-variates are checked for orthogonality. While computing time 
is saved by the opposite setting, incorrect results are computed 
if an invalid assumption of orthogonality is made. 

projector: In order to compute the consequences of projection 
of variates, a choice between at least two procedures is made: 
P-% = C:-CT-z or CT-x. The idempotent symmetric projection 
operator P (see [4, 5]), or the rectangular matrix made up of 
the eigenvectors corresponding with unit eigenvalues (see [2]) 
is used. The second alternative is preferred since the transform- 
ing matrix is then thin, and Algorithm 366 is an implementation 
of this approach. 

pulpy, getpy, putpx, getpzr: These procedures are concerned 
with the transmission of transformed variates between arrays 
internal to the algorithm and auxiliary store. While immediate 
access store may be used as auxiliary store with small problems, 
backing media such as magnetic drum, disk, or tape are required 
for large problems. The procedure putpy transmits all nelm ele- 


real procedure sigma (x, 1, n); 
value n; 
real x; integer 7, 7; 
begin 
real zz; xx := 0; 
for 7 := 1 step 1 until n do wz := 2x + 2; 
sigma := 2% 


end sigma; 
comment Count missing data items; 
nmis := 0; ssmp := 0; 


for tres := 1 step 1 until nres do 
for iobs := 1 step 1 until nobs do 
if missing y then nmis := nmis + 1; 
begin 
comment Get space for estimates of missing data; 
array y missing|l : if nmis=0 then 1 else nmis]; 
comment Set up loop for missing data iteration; 
for 7 cycle := 1 step 1 until maz cycle do 
begin 
comment Analyze data in various error subspaces; 
for ispace := 1 step 1 until nspace do 
begin 
comment Determine subspace degrees of freedom; 
if < cycle = 1 then 


begin 
comment Only compute degrees of freedom once; 
nof :=1; 


for ifac := 1 step 1 until nfac do 
ndf := ndf X (if error code=0 then error level 
else if error code=1 then 1 else error level—1); 

df total := ndf 
end 
else ndf := df total; 
comment Project response vectors; 
nmis := 0; 
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for tres := 1 step 1 until nres do 
begin 
comment Fetch a vector, and possibly fit missing 
data; 
for iobs := 1 step 1 until nobs do 
if missing y then 
begin 


nmis := nmis +1; 
if ispace = 1 then y missing[nmis] := if i cycle = 1 
then y 
else sigma (pooled betaXzx, dirt, ntrt); 
yyliobs] := y missing[nmis] 
end 
else yy[iobs] := y; 
if ispace = 1 thenall y := sqrt(sigma(yyliobs] f 2, iobs, 
nobs)); 
projector (yy[tobs], iobs, error level, error code, ifac, nfac, 
nelm); 
jres := tres; 
error = sigma(yy{iobs] T 2, obs, nelm); 
length y := if sqrt((errorXnobs)/ndf)/all y > tolength 
then sgrt(error) else 0; 
putpy (yyliobs], tobs, nelm, jres, ispace); 
for jres := 1 step 1 until ires — 1 do 
begin 
comment Determine sums of cross products; 
getpy (xa[iobs}], tobs, nelm, jres, ispace); 
error := stgma(yy{iobs]X-xz[tobs], tobs, nelm) 
end cross products 
end dependent variates; 
comment In the first cycle project treatment vectors; 
if < cycle = 1 then 
for jirt := 1 step 1 until nirt do 
if estimate then 
begin 
comment Only work on variates included in regres- 
sion; 
ttrt := ftrt; 
for iobs := 1 step 1 until nobs do zz{iobs] := 2; 
if space = 1 then all x := sqrt(sigma(za[tobs] fT 2, 
tobs, nobs)); 
projector (xx[iobs], tobs, error level, error code, ifac, nfac, 
nelm); 
t := sigma(xz[iobs] Tt 2, cobs, nelm); 
s:= lengthx := if sqrt((tXnobs)/ndf)/all x > tolength 
then sgri(t) else 0; 
if s > 0 then 
begin 
comment Null variates are skipped; 
putpxz(a2z[iobs], iobs, nelm, iirt, ispace); 
if check diagonality then 
for kirt := 1 step 1 until jiri — 1 do 
if estimate then 
begin 
comment Orthogonality checked for variates 
in regression; 
tirt := kirt; ov := length a; 
if v > 0 then 
begin 
comment Null variates are skipped; 
getpx(yy|tobs], tobs, nelm, tirt, aspace); 
if abs(sigma(ez[iobs]xXyy{iobs], iobs, nelm))/ 
(sXv) > tolcor then 
begin 
comment Force termination since ex- 
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; cessive correlation; 
balanced anova := 1000 X (1000Xispace+ 
jirt) + kere; 
go to exit 
end large correlation 
end if secondary variate has projection 
end secondary variate loop 
end if primary variate has projection 
end primary variate loop; 
comment Compute normalized regression coefficients; 
for iirt := 1 step 1 until niri do 
if length x > 0 /A estimate then 


begin 

comment Skip null or not in regression independent 
variates; 

naf := ndf — 1; 

getpx(xa{iobs], tobs, nelm, tirt, ispace); 

for tres := 1 step 1 until nres do 
if length y > 0 then 
begin 


comment Skip null dependent variates; 

getpy (yy[iobs], cobs, nelm, tres, space) ; 

normalized beta := sigma(xx[iobs]Xyy[iobs], tobs, 
nelm)/length x 


end 
else normalized beta := 0 

end 

else for tres := 1 step 1 until nres do normalized beta 
:= 0; 


? 
df error := ndf; 
comment Reduce sums of squares and products for 
regression; 
for tirt := 1 step 1 until niri do 
if length x > 0 A estimate then 
begin 
for kres := 1 step 1 until nres do 
for jres := 1 step 1 until kres do 
begin 
tres := jres; s := normalized beta; 
ires := kres; error := error — s X normalized beta 
end dyad reduction loops 
end normalized regression coefficient computation; 
comment Determine true regressions and information; 
for ires := 1 step 1 until nres do 
begin 
for jres := 1 step 1 until tres do 
error := if lengthy = 0 \V ndf = 0 then 0 else error/ndf; 


jres := tres; 
for ttrt := 1 step 1 until nirt do 
begin 


comment Clear areas at start; 
if ispace = 1 then pooled beta := se beta := 0; 
if estimate then 
begin 
comment Set information as unity for fixed 
effects; 
t := if fixed effect /\ length x > 0 then 1 else 
if ndf = 0 then 0 else length x71 2/ (if error=0 
then 1 else error); 
se beta := se beta + t; 
pooled beta := pooled bela + t X (if length x=0 
then 0 else normalized beta/length x) 
end of addition to pools 
end independent variate loop 
end dependent variate loop 
end error subspace loop; 
for ives := 1 step 1 until nres do 
for tirt := 1 step 1 until nirt do 
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if se beta > 0 then 
begin 
comment Compute weighted means and standard 
errors; 
pooled beta := pooled beta/se beta; 
se beta := sgri(1/se beta) 
end average; 
if nmis > 0 then 
begin 
comment Check convergence of missing items; 
8s := sigma(y missing[iobs] T 2, cobs, nmis); 
if abs(s—ssmp)/s > tolmpss then ssmp := s 
else go to finish 
end missing data convergence test 
end cycle; 
finish: balanced anova := 0; 
exit: 
end block 
end balanced anova 
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ALGORITHM 368 

NUMERICAL INVERSION OF LAPLACE 

TRANSFORMS [D5] 

HaRatD STEHFEST* (Recd. 29 July 1968, 14 Jan. 1969 
and 24 July 1969) . 

Institut f. angew. Physik, J. W. Goethe Universitat, 
6000 Frankfurt am Main, W. Germany 


* The work forms part of a research program supported by the 
Bundesministerium fiir wissenschaftliche Forschung and the 
Fritz ter Meer-Stiftung. 


KEY WORDS AND PHRASES: Laplace transform inversion, 
integral transformations, integral equations 
CR CATEGORIES: 5.15, 5.18 


procedure Linv(P, N, T, Fa, V, M); 
value N, T; 
integer M, N; real T, Fa; array V; real procedure P; 
comment If a Laplace transform P(s) is given in the form of a 
real procedure, Linv produces an approximate value Fa of the 
inverse F(t) at 7. Fa is evaluated according to 


In2~ In2. 
Fa = 5 ye (“ ‘). 
N must be even. Since the V; depend on N only, in case of re- 
peated procedure calls with the same N the array V is to be 
evaluated only once. That is why the formal parameter M has 
been introduced: that part of the algorithm which computes the 
V; isrun through only if M # N, and after every call of Linu M 
equals N. At the first call M may be any integer different from N. 

The calculation method originates from Gaver [2], who con- 
sidered the expectation of F(t) with respect to the probability 
density 


= (2n)! 
IG) = 8 ay 


il * F(i)fa(a, t) at (1) 
0 


a — e7at)ne-nat a>o: 


"y 
Fy 
il 


“nln — D! a1 \4 


aS, (") (=1)'P(n + i)a. 


Sn(a, t) has the following properties: 

lL. fo’ fa(a, t) dt = 1, 

2. modal value of f,(a, t) = In 2/a 

3. var(t) = 1/a?> oto 1/(n + 1)%. 

They imply that F, converges to F(In 2/a) for n > o. F, has 


the asymptotic expansion [2] 


In 2 
ad eee ae 
a n n ni 
For a given number WN of P-values a much better approximation 
to F(in 2/a) than Fy_; is attainable, and that by linear combi- 
nation of F,, Fs, ++: , Pj: : requiring 
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# a | 
2, ©) Gey i- op = 8, 
k=0,1,---,K—-1, K<N/2, 
we find 


a(K) = — (*) é(N/2 +1 — 1)* 


K! 
and thus 
K oe In 2 kt (N/2 — K)! 
> t(K)Pynui = F ( a + (=D) a (N /2)! 


feng (N/2 — K)! 
(N /2)! 
Setting K = N/2,a = In(2)/T, and using (1) we get the ex- 
pression the procedure evaluates: 


wie In? In2. 
Fa = »» t(N/2)F wei = T > V;P TT ‘) . 
with 
Min(i,N/2) Ne 
V,=(-1)" k (2h)! a 
-[F] (N/2 — k)iki(k — Iie — k)M2k — 1)! 
TT. 


(The method of “extrapolation to the limit,’’ which Gaver [2] 
used, leads to less accurate results for the same N, because not 


' g§0 many powers of n cancel out. Moreover, with this method NV 


must be a power of 2, so that in general one cannot make the 
best use of the available computer precision.) 

Theoretically Fa becomes the more accurate the greater N. 

Practically, however, rounding errors worsen the results if N 
becomes too large, because V; with greater and greater absolute 
values occurs. (This reflects the unboundedness of the inverse 
Laplace operator.) For given P(s) and T the N at which the 
accuracy is maximal increases with the number of significant 
figures used. For fixed computer precision the optimum value 
of N is the smaller, i.e. the maximum accuracy is the greater, the 
faster fF’, (see eq. (1)) converges to F(T). In the following the 
term ‘‘smooth”’ is used to express that the rate of convergence 
is sufficiently great. An oscillating F(t) certainly is not smooth 
enough unless the wavelength of the oscillations is large com- 
pared with the half-width of the peak which fy;2(n 2/T, t) has 
at T. No accurate results are to be expected, too, if F(t) has dis- 
continuities near 7. If F(t) behaves equally in the neighborhood 
of two different 7'-values the result at the smaller 7'-value will 
be the better one, because the peak of f,(In 2/7, t) broadens 
as T increases. 
- The only way to sharpen these qualitative statements is to 
apply [inv to many Laplace transforms the inverses of which 
are known. This was done with 50 transforms. The numbers of 
significant. figures used ranged from 8 to 17 (IBM 7094, single 
and double precision, CDC 3300). The 7'-values lay between 0 
and 50. It was found that with increasing N the number of cor- 
rect figures first increases nearly linearly and then, owing to 
the rounding errors, decreases linearly. The optimum N is 
approximately proportional to the number of digits the machine 
is working with. Table I was calculated using 8-digit arithme- 
tic and N = 10. 
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TABLE I 
T F(T) Fe F(T) Fa 
1 1 ee 
F() = Jai , P(s) = ai POS) ace in 
1.0 0.56419 0.56555 —0.57722 —0.57782 
2.0 0.39894 0.39912 —1.27036 — 1.27084 
3.0 0.32574 0.32655 — 1.67583 —1.67544 
4.0 0.28209 0.28278 — 1.96851 —1.96392 
5.0 0.25231 0.25174 —~ 2.18665 —2.18727 
6.0 0.23333 0.22989 — 2.36898 —2.36870 
7.0 0.21324 0.21322 — 2.52313 — 2.52270 
8.0 0.19947 0.19956 — 2.65666 — 2.65740 
9.0 0.18806 0.18814 —2.77444 — 2.77390 
10.0 0.17841 0.17796 — 2.87980 —2.88091 
F(t) = 8/6, P(s) = 1/s4 F(t) = o*, P(s) = 1/(s + 1) 
1.0 0.16667 0.16568 0.36788 0.36798 
2.0 1.33333 1.32543 0.13534 0.13557 
3.0 4.50000. 4.47354 0.04979 0.05043 
4.0. 10.66667 10.60342 0.01832 0.01849 
5.0 20. 83333 20. 70845 0.00674 0.00640 
6.0 36.00000 35 .78832 0.00248 0.00195 
7.0 57.16667 56 .82535 0.00091 0.00036 
8.0 85 .83333 84.82735 0.00034 —0.00006 
9.0 121.50000 120 .78473 0.00012 —0.00047 
10.0 166 .66667 165 .66759 0.00005 —0.00020 
PW) = sin (VB, POs) = /Z, oN FW = Lap, PQ) = ESB 
1.0 0.98777 0.98775 — 0.66667 —0.66533 
2.0 0.90930 0.91001 —0.33333 ~0.32531 
3.0 0.63816 0.63826 1.00000 1.02575 
4.0 0.30807 0.30968 2.33333 2.39533 
5.0 —0.02068 —0.02119 2.66667 2.78844 
6.0 —0.31695 —0.31927 1.00000 1.21092 
7.0. ~— 0.56470 ~ 0.57254 —3.66667 —3.32956 
8.0 —0.75680 —0.76869 — 12.33333 —11.82953 
9.0 —-0.89168 —0.91049 — 26 .00000 — 25 .28393 
10.0 --0.97128 —0.98949 — 45.66667 —44.88511 


With double precision arithmetic and N = 18 the number of 
correct figures doubles. The chosen N-values are about the op- 
timum N for all functions of the table. Evaluating an unknown 
function from its Laplace transform, one should, nevertheless, 
compare the results for different N, to see whether the function 
is smooth enough, what accuracy can be reached, and what the 
optimum JN is. Even then it is risky to rely solely on the results 
of Linv. One ought to be sure a priori that the unknown function 
F(t) has not any discontinuities, salient points, sharp peaks, 
or rapid oscillations. Moreover, the accuracy should be checked 
by employing other inversion techniques. 

The inverses of the 50 test functions were also evaluated ac- 
cording to the inversion technique of Bellman et al. [1], which 
is based on the approximation of F(t) by a polynomial in e~*. 
It appeared that the algorithm Linv generally produces better 
results, i.e. the condition ‘“F(¢) is everywhere smooth (in the 
sense described above)”’ is less restrictive than the condition 
‘“(F (—In(r)) can be well approximated by a polynomial in r = e~* 
for0 sr s 1’. The evaluation of the function F(t) = #/2 from 
its Laplace transform P(s) = 1/s* illustrates the difference be- 
tween the two conditions: using Liny the inverse is correct 
within 0.1 percent, using the inversion technique described in 
{1] errors of hundreds of percents occur (VN = 10, 0.1 < 7 < 10). 

The algorithm was successfully applied to renewal equations, 
differential-difference equations, and systems of partial differ- 
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ential equations. Reference [1] includes many other problems 
to which the algorithm can be applied. 
REFERENCES: 

1. Beuuman, R. E., Kauasa, R. E., anp Locxert, J. Numerical 
Inversion of the Laplace Transform. American Elsevier, 
New York, 1966. 

2. Gaver, D. P. Observing stochastic processes, and approxi- 
mate transform inversion. Oper. Res. 14, 3 (1966), 444- 
459; 

begin 

integer 7, th, k, Nh, sn; reala; 

if M = N then go to C; 

G[0] := 1; Nh:= N/2; 

for 7 := 1 step 1 until N do G[t] := Glz—1] X 7; 

A[l1] := 2/G[Nh—-1]; 

for i := 2 step 1 until Nh do 

A(t} := 7 Tt Nh X Gl2x2]/G[Nh—-1] XG) XGt—-1)); 

sn := 2X sign (Nh—Nh+2X2) — 1; 


array G[0:N], H[1:N/2]; 


for 7 := 1 step 1 until N do 
begin 
V(t] := 0; 
fork := (+1) + 2step1 untilif 7 < Nh then? else Nh do 


V[e] := Vit] + Alk]/(Gii—k]xG[2xk—1])); 
V{t] := sn X V[t); 
sn := —sn 
end; 
M :=N; 
C: Fa := 0; a := In(Q2)/T; 
comment In(2) should be replaced by its actual value 
0.69314... ; 
for 1 := 1 step 1 until N do 
Fa := Fa + Vit] X P(tXa); 
Fa := aX Fa 
end 
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Some errors have crept into the comment of the procedure af- 
ter proof-reading: 
The formula following ‘‘and thus’’ should read 


K In 2 N/2—K)! 
> a(K) Puy = F ( + (-1)FHag S aie 
(N/2—K)! 
+o \ 721 ): 


The formula following ‘‘with’”’ should read 


as kN?2(2k)! 
(N/2—k) lk kb—1)(t—k) (2kh—1)1 


Min(i,N/2) 
Vy = (—1)" 24 


i-[F] 
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ALGORITHM 369 poissrn := k 
GENERATOR OF RANDOM NUMBERS end 
SATISFYING THE POISSON DISTRIBUTION [G5] end porssrn 
Henry E. Scuarrer* (Recd. 27 Jan. 1969 and16 July 1969) 
North Carolina State University, Genetics Department, 

Raleigh, NC 27607 


* This work was supported by Grants PR-00011 and GM-11546 
of the National Institutes of Health. 


KEY WORDS AND PHRASES: Poisson distribution, random 
number generator 
CR CATEGORIES: 5.5 


integer procedure poissrn (lambda); 
value lambda; real lambda; 

comment At each call this procedure returns an observation 
from a Poisson distribution with parameter lambda. The rejec- 
tion method discussed by Kahn [1] is used. It requires an aver- 
age of lambda + 1 (pseudo) random numbers (uniformly dis- 
tributed on the 0, 1 interval) per call. For efficiency the random 
number generator should be coded in-line. 

This procedure is especially suitable when a small number of 
random numbers are needed from each of a large number of 
different Poisson distributions. This can occur when the Poisson 
parameter used in each call is itself chosen according to some 
probability distribution. Algorithm 342 [2] is more efficient for 
repeated use of the same value of the Poisson parameter. 

A value of —1 is returned to signal a value of lambda which 
is not positive. A value of —2 is returned to signal a value of 
lambda which is too large for the significance of the computer. 

I thank the referee for his suggestions and comments. 
REFERENCES: _ 

1. Kaun, H. Applications of Monte Carlo. RM-1237-AEC, Rand 

Corp. 1956 (revised version). . 

2. Snow, R. H. Algorithm 342, Generator of random numbers 
satisfying the Poisson distribution. Comm. ACM 11 (Dec. 


1968), 819; 
if lambda < 0.0 then poissrn := —1 
else 
begin 
real z; 
z:= exp (—laamdb); 
if z = 0.0 then poissrn := —2 
else 
begin 


real ¢; integer k; 
real procedure random; 
begin 
comment The body of this procedure must be provided 
by the user to generate the uniformly distributed random 
numbers required by poissrn. The random number gen- 
erator is placed here rather than called as a global pro- 
cedure to decrease the time taken to obtain each random 
number. For the same reason a fast generator should be 
chosen. It is also important that this generator should 
have negligible serial correlation; 
{procedure body); 
end random; 
k:=0; ¢t:= 1.0; 
for t := t X random while i > zdok :=k-+1; 
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KEY WORDS AND PHRASES: random number generator, prob- 
ability density function, transformation, cumulative density 


function 
CR CATEGORIES: 5.18, 5.5 


Introduction. The algorithm below will generate random num- 
bers from any probability density function, whether it be ana- 
lytical, hypothetical, or experimentally acquired. Although there 
are in existence some fast and some general routines, the fast ones 
are for specific densities whereas the general algorithms are slow. 
As an example of a general algorithm, IBM’s GPSS [7] uses the 
transformation theory of random deviates [4] to generate random 
numbers from any density function which can be described by data 
points. The GPSS algorithm is simple, and its precision is de- 
pendent upon the degree of interpolation and the number of points 
used for estimating the transformation function. 

The program below has made the transformation method more 
accurate than the GPSS routine by using 257 points and linear 
approximation to the probability density function. Speed was 
acquired by appropriate organization of necessary tables. A time 
estimate for the performance of an assembly language program of 
the algorithm RANDG on an IBM 360/65 is about 33usec for 
each generation. 

Inttialization. The operation of RANDG is based on vectors 
@ and R& which can be derived by RANDGI as indicated below. 
An explanation of the routine RANDGI will give the reader some 
insight into the theory of RANDG. 

1. Let (a, ys), 7 = 1, 2,--+ , n be coordinates describing the 
probability density function, y = f(x). 

2. Using the trapezoidal rule, find p; = (3 f(x) dx so that p(z) 
approximates the cumulative density function of f(z). 

3. Let = p~(v), the inverse cumulative density function. 

4. Find q; = p7!(v;) by using Lagrange’s quadratic interpola- 
tion formula on p~!(v) for values of v; = 7/256 andj = 0,1,2,--- 
256 [5]. 

5. Compute f(q;) and let rj = (f(q541) -F(as))/ OF (@i41) +4 (as) 
for 7 = 0, 1, 2, --- , 255. The | 7; | is the ratio of the triangular 
area to the total area of a trapezoid approximating the probability 
density function between x = q; and z = q;4: (Figure 1) and the 
sign of r; is the sign of the derivative. If the vectors Q and R are 
available to the experimenter, it is not necessary to use RANDGI. 
It should also be noted that RANDGI need be used only once for a 
given density function and, therefore, does not usually affect the 
speed of generation. 

Program. The routine RANDG then uses Q and R to generate 
the random ordinates in the following manner: 

1. Select the jth interval with probability 1/256. 

2. Let LZ, and Iz be uniform random numbers on the interval (0, 
1). It follows that€Y: = Q; + (Qj41—Q,)*L: is uniformly random 
over the interval (Q; , Qj41) and Y2 = Q; + (Qji1 —Q;)* max (Li, 
L,) is triangularly distributed on the same interval and is skewed 
left. 

3. Let PIY=¥i] = |R; and P[Y=Y.] = 1—|R; . Then 
Y is trapezoidally distributed with 
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TRIANGULAR AREA + RECTANGULAR AREA 


TOTAL AREA = 


F (x) 


Fic. 1. Trapezoid approximating area under the probability den- 
sity function from Q; to Qs. 


2R(Y —Q;)/(Qj41—-Q3)? 
+ (1—RB;)/(Qj1—Q5), Q;5< Y < Qin, 


0, otherwise. 


fY) = 


4. lf R; <0 then use Yo = Q; + (Qy41--Q;)* min (Z, , Zs). 
The use of 256 intervals was arbitrary. For speed in assembly 
language on a binary machine a power of 2 should be used. It is 
possible that 128 or 64 values are adequate and the use of fewer 
than 256 would certainly save storage. (Note: Any good uniform 
random number generator may be used for selecting the interval 
and finding ZL; and Zz [1, 2, 3, 6].) 
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SUBROUTINE RANDG 


PURPOSE 
COMPUTE RANDOM NUMBERS FROM ANY GENERAL DISTRIBUTION. 


qaaoaaaaa 


USAGE 


aa 


CALL RANDG (L,X,R,¥) 


COLLECTED ALGORITHMS (cont.) 
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DESCRIPTION OF PARAMETERS 
INPUT 
L -A NON ZERO ODD RANDOM INTEGER 
X -VECTOR OF LENGTH 257 CONTAINING ORDINATE POINTS 
SEPERATED BY EQUAL PROBABILITY ON DESIRED DISTRIBUTION. 
(CAN BE CALCULATED IN RANDGI). 
R -VECTOR OF LENGTH 256 CONTAINING RATIOS OF DERIVATIVE*DX 
TO AREA/DX FOR EACH ORDINATE POINT IN X. 
(CAN BE CALCULATED IN RANDGI). 
OUTPUT 
Y -RANDOM NUMBER 


REMARKS 
QUADRATIC APPROXIMATION OF CDF (CUMULATIVE DENSITY FUNCTION) 


WHICH IMPLIES LINEAR APPROXIMATION OF PDF (PROBABILITY DENSITY 


FUNCTION). 


SUBROUTINES AND FUNCTION SUBPROGRAMS REQUIRED 


NONE DIRECTLY. RANDGI MAY BE USED FOR INITIALIZATION. 


METHOD 
TABLE LOOK UP PLUS UNIFORM AND TRIANGULAR DISTRIBUTION 


VARIABLES ARE USED. 


kee eR RR KK KR KEKE RRR KR KK KEK RHEE RK KEK KEKE KEKE KEE 


SUBROUTINE RANDG (L,X,R,Y) 


DIMENSION X(257),R(256) 


GENERATE TWO JNIFORM RANDOM NUMBERS ON INTERVAL (1 - 2**31) 


ANY GOOD GENERATOR MAY BE SUBSTITUTED. 


L1=IABS (65539*L) 
L=IABS (65539*L1) 


L2=L 

CALCULATE TWO UNIFORM’ RANDOM NUMBERS 

Kl INTEGER ON INTERVAL (1 ~ 256) 

AK2 REAL ON INTERVAL (0 -— 1.0) 
K1=L1/8388608+1 
AK2=FLOAT (MOD (L1, 8388608) )*1.192093E-7 
IF (AK2~-ABS (R(K1))) 8,8,30 

8 IF(R(K1)) 20,10,10 


CALCULATE TRIANGULAR RANDOM SKEWED LEFT 


10 Y=X(K1) + (X(K1L+1) -X(K1) ) *AMAXO (L1,L2) *4.656613E~10 


RETURN 


CALCULATE TRIANGULAR RANDOM SKEWED RIGHT 


20 Y=X(K1)+(X(K1+1) -X (KL) ) *AMINO (L1,L2) *4.656613E-10 


RETURN 


CALCULATE UNIFORM RANDOM 


[ome EEE e EEK o SEE o EK o SEE > EE o MEE o EE c TEE o SEE o EEE o EE o SEC o EE o EE o EE o EE o EE o IEC o EE o EC o ME o EEC o ME o EE o EE o EE o MEE o IEC c EEC o EE o IE o ME o HE o ME o ME o MEE o EE o IEE » EL o EE o 
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30 Y=X (K1) + (X(K1+1) -X(K1) ) *FLOAT (L2) *4.656613E-10 
RETURN 
END 


2 


SUBROUTINE RANDGI 


PURPOSE 


COMPUTE INITIALIZING VECTORS FOR RANDG 


USAGE 
CALL RANDGI (N,X,Y,P,Q,R,IER) 


DESCRIPTION OF PARAMETERS 
INPUT 
N -NUMBER OF (X,Y) POINTS OF APPROXIMATION TO PDF 
(PROBABILITY DENSITY FUNCTION) 
X VECTOR OF LENGTH N CONTAINING ORDINATE OF PDF 
Y -VECTOR OF LENGTH N CONTAINING ABSCISSA OF PDF 
OUTPUT 
P -WORK VECTOR OF LENGTH N 
Q -VECTOR OF LENGTH 257 CONTAINING ORDINATE POINTS 
SEPERATED BY EQUAL PROBABILITY ON DESIRED DISTRIBUTION. 
R -VECTOR OF LENGTH 256 CONTAINING RATIOS OF DERIVATIVE*DX 
TO AREA/DX FOR EACH ORDINATE POINT IN Q. 
IER-ERROR INDICATOR 
1 - ERROR IN SCALING. I.E. TOTAL AREA OF PDF NOT EQUAL TO 
1. ASSUMING ESTIMATION ERRORS A FUDGE FACTOR IS USED 
TO SCALE A RESULT. 
2 - DENSITY NOT POSITIVE. I.E. SOME Y¥(I) LT 0. ABORT 
3 - NOT IN SORT. I.E. SOME X(I) LT X(I-1). ABORT 
4 -— SEARCH ERROR. SHOULD NEVER OCCUR BECAUSE OF FUDGE 
FACTOR USED. ‘THIS MEANS SOME P(I) NOT LARGE ENOUGH 
FOR SEARCH OF PROPER Q. INVESTIGATION IS NEEDED. 
REMARKS 


NONE 


SUBROUTINES AND FUNCTION SUBPROGRAMS REQUIRED 


NONE 


METHOD 
LINEAR APPROXIMATION OF PDF TO FIND CDF (CUMULATIVE DENSITY 
FUNCTION) AND ICDF (INVERSE CDF). QUADRATIC INTERPOLATION ON 
ICDF TO FIND Q AND R. 

ek RK RK KR KK KR KEK KR KR ER KR KEK KR KEK KEKE KEKE KEK KEKE KEE 


SUBROUTINE RANDGI (N,X,Y,P,Q,R,IER) 


DIMENSION X(300) ,¥(300) ,P (300) ,Q(257) ,R(256) 


CALCULATE CUMULATIVE PROBABILITIES 


IER=0 


IF (¥(1)) 5,10,10 


ERROR 2 


COLLECTED ALGORITHMS (cont.) 


c 
5 IER=2 
RETURN 
10 P(1)=0.0 
Do 15 I=2,N 
IF(W(I)) 5,11,11 
11 IF (X(I)-X(1I-1)) 6,12,12 
c 
C ERROR 3 
c 
6 IER=3 
RETURN 
1200 P(T)=(¥(I) +¥ (IL-1) ) * (X (I) -X(I-1) ) *0.5+P (I-1) 
15 CONTINUE 
IF (P (N)-0.996094) 7,7,16 
16 IF(P(N)-1.003906) 3,7,7 
c 
C ERROR 1 
c 
7 IER=1 
3 F=1.0/P (N) 
DO 4 I=2,N 
4 P(I)=P(I)*F 
c 


C CALCULATE X POINTS FOR EQUAL-DISTANT CUMULATIVE PROBABILITIES 


¢c 
V=0.0 
Q(1) =x (1) 
Tl=y(1) 
Jl=2 
100 DO 150 I=2,257 
IF (1-257) 102,103,103 
102 V=V+3.90625E-3 
Cc 


C LOCATE BEST POINT FOR INTERPOLATION 


c 
DO 101 J=J1,N 
IF(P(J)-v) 101,104,105 
101 CONTINUE 
Cc 
C ERROR 4 
Cc 
TER=4 
103 J=N 


104 = Q(I) =x(J) 
T2=¥ (J) 
GO TO 125 
105 IF(J-3) 113,108,107 
107 IF(J-N) 108,111,111 
los) = IF((P (3) -v) -(V-P(J-1))) 110,110,111 
11009 gl=J-1 
GO TO 120 
111 9 Jl=d-2 
GO TO 120 


113 Jgi=l 
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C QUADRATIC INTERPOLATION OF P INVERSE FOR Q 
Cc 
120 =XT2=P(J1+2) -P(J1) 
XT3=P (J1+2) -P (J1+1) 
XTL=P (J1+1) -P (J1) 
XV1=V-P (J1) 
XV2=V-P (J1+1) 
XV3=V-P (J1+2) 
Q(L) =(XV3*XV2*xX (J1) ) / (XTL*XT2) -: (XV3*XVLEX (TLL) )/ (XTL*XT3) + 
1 (XV2*XVL*X (J1+2) ) / (XT2*XT3) 
c 
C LINEAR INTERPOLATION OF Y FOR T2 AND R 
c 
T2=(¥ (J) -¥(T-1) ) * (Q (LZ) -K(S-1) )//(K (J) -K (J-1) )+¥ (T-1) 
125 R(I-1)=(T2-T1) /(T2+T1) 
Tl=T2 
J1l=s 
150 CONTINUE 
RETURN 


END 


Remark on Algorithm 370 [G5] 
General Random Number Generator [Edgar L. Butler, 
Comm. ACM 13 (Jan. 1970), 49--52] 


L.G. Proll* (Recd. Nov. 1970) 
Department of Mathematics, University of 
Southampton, U.K. 


Key Words and Phrases: random number generator, 
probability density function, transformation, cumulative 
distribution function 

CR Categories: 5.13, 5.5 


Algorithm 370 was translated into Algol and run on an ICL 
1907 computer. Tests revealed that, in several instances, the sub- 
routine RANDGI generated incorrect values for the vector Q and, 
consequently, for R. In particular, RANDGI does not guarantee 
that Q(J) increases with J as clearly should be the case. For ex- 
ample, a selection of the results for Q and R, rounded to four 
decimal places, obtained by RANDGI with 


N=4 

X = (0.0,0.5,1.0,2.0) 

Y = (0.0,0.5,1.0,0.0) 

corresponding to a symmetric triangular distribution {1] on [0, 2], 
is as follows: 


I Q(1) RW) 


78 0.9211 0.0031 
79 0.9268 0.0030 
80 0.9322 0.0029 
81 0.7232 — 0.1262 
82 0.7284 0.0036 
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Similar results were obtained for several other distributions. 

The error lies in changing the interpolating quadratic between 
two interpolation points and will always arise when, for some J, 
(i) interpolation takes place at points between P(/) and P(J+1), 
(ii) the interpolating quadratic on the points P(J—1), P(J) and 
P(J+1) is convex, 

(iii) the interpolating quadratic on the points P(J), P(J+1) and 
P(J+2) is concave. 

Alteration of the interpolating quadratic only at an interpola- 
tion point will avoid this error; an appropriate alteration to the 
algorithm is given later. 

In addition, the following remarks can be made about the 
algorithm: 

(i) The statements labeled 105, 110, and 120 in the subroutine 
RANDGI imply that N > 4. However only three points are needed 
for quadratic interpolation, and moreover, it is meaningful to 
specify a probability distribution by only three points, e.g. any 
triangular distribution. 

(ii) A trivial alteration would allow the subroutine RANDGI to 
trap the condition X(J) = X(/—1) which would otherwise cause an 
overflow in calculating an element of Q. 

(iii) The usefulness of Algorithm 370 can be enhanced by allowing 
the vector Y to represent either a probability density function or a 
cumulative distribution function as required. The experimenter 
may, for instance, have directly available the cumulative polygon (2] 
of an empirical distribution. 


The following alterations to the subroutine RANDGI incor- | 


porate the above correction and remarks: 
(i) In the opening comment, 
(a) replace line 9 by 


C CALL RANDGI (N,X,Y.P,Q,R,K,IER) 
(b) replace line 14 by 


C (PROBABILITY DENSITY FUNCTION) OR CDF 
C (CUMULATIVE DISTRIBUTION FUNCTION), N.GE.3. 


(c) add the words OR CDF to lines 15 and 16 
(d) insert after line 16, 


Cc K -K SHOULD BE SET TO 1 IF Y REPRESENTS 
Cc A CDF, OTHERWISE Y WILL BE INTER- 
Cc PRETED AS A PDF 


(e) replace line 28 by 
C 3-NOT IN SORT, I.E. SOME X(D LE X(I-1).ABORT 
(ii) Change the subroutine statement to 
SUBROUTINE RANDGI (N,X,Y,P,Q,R,K,IER) 
(iii) Change the statement labeled 11 to 
11 IF (X(I) —X(UI—1)) 6,6,12 
(iv) Delete the statement labeled 12 and insert 
12 IF (K.EQ.1) GO TO 13 


PI) = (¥()+Y(I—1))*(X()—X(I-1))*0.5 + 
1 P(I-1) 
C YISAPDF 
GO TO 15 
13. PO) = YD 
C YISACDF 


(v) Delete the five statements commencing at label 105 and insert 
105 IF J.LE.3) GO TO 113 


With these alterations to the subroutine RANDGI and with the 
incorporation of locally available routines for generating uniform 
and triangular deviates [3] into RANDG, satisfactory results were 
obtained for the first two moments of several distributions including 
the beta, symmetric triangular, nonsymmetric triangular, and vari- 
ous empirical distributions. Table I contains a selection of the results 
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obtained for various values of N for samples of size 1000 from a 
beta(4, 3) distribution. In each case, the distribution was specified at 
the points 


XQ) = U-1)/(N-1), T= 1,2--N. 


With the exception of the case when N = 5, the true mean and 
variance lie within the appropriate 95 percent confidence intervals 
obtained from the samples. 

In addition to tests on the first two moments, the samples were 
also subjected to Q - Q plots [4]; i.e t.he ordered observations were 
plotted against the quantiles of the parent distribution. The proce- 
dure indicates a perfect match by a straight line of slope 1 passing 
through the origin and is especially sensitive to differences in the 
tails of the distributions. The quantiles of the beta distribution were 
calculated by interpolation in values of the beta c.d.f. obtained by 
the method of Hill and Pike [5]. Serious departures from the desired 
shape were observed for N = 5, 10 in both the cases K = 1 and 


Table I. 
K #1 = | 
N Mean Variance Mean Variance 
5 0.605 0.037 0.663 0.026 
10 0.576 0.031 0.580 0.028 
20 0.578 0.029 0.580 0.029 
- 50 0.571 0.030 0.574 0.030 
100 . 0.565 0.029 0.573 0.029 
True value 0.571 0.030 0.571 0.030 


K ~ 1. The results obtained for N > 20 were satisfactory for both 
cases. z 
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ALGORITHM 371 

PARTITIONS IN NATURAL ORDER [A1] 

J. K. 8. McKay (Recd. 28 Apr. 1967) 

California Institute of Technology, Mathematics Division, 
Pasadena, CA 91109. 


KEY WORDS AND PHRASES: partitions, number theory 
CR CATEGORIES: 5.39 


procedure partition (p, k, last); integer n, k; 
integer array p; Boolean last; 
comment Partition may be used to generate partitions in their 
natural (reverse lexicographical) order. On entry the first k 
elements of the global integer array p[1:n] should contain a parti- 
tion, p{l] > p[2] > --- > p[k], of n intok parts. In order to ini- 
tialize m, the first entry must be made with last set true: this will 
result in p[{1], p[2], --- , p[k] and k remaining unaltered and last 
set false on exit. On all subsequent, entries with last false, k is 
updated and p[1], p[2], --- , p{k] will be found to contain the 
next partition of n with parts in descending order. On returning 
with the last partition, p{l] = p[2] = --- = pin], last is set 
true. To generate all partitions of n, p[1], k, last should be set 
to n, 1, true, respectively for the initial call: these variables 
must not be altered between successive calls for partition; 
begin 
own integer m; integer {; 
if last then 
begin 
last := false; 
for m := 1 step 1 until k do 
if p[m] = 1 then go toc; 
m:=k; gotoc 


end; 
t:= k— m; 
k:=m; 


p[m] := p[m] — 1; 
a: if p{k] > t then go to b; 
t:= t — pik]; 
k:=k+1; 
pik] := plk—1; 
go toa; 
b: k:=k-+1; 
plik] := t+ 1; 
if p[m] ~ 1 then m := k; 
c: if p[m| = 1thenm:=m—1; 
if m = 0 then last := true; 
end partition 
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ALGORITHM 372 

AN ALGORITHM TO PRODUCE COMPLEX 

PRIMES, CSIEVE [A1] 

K. B. Dunuam (Reed. 29 July 1968 and 7 Oct. 1968) 

Georgia Institute of Technology, School of Information 
Service, Atlanta, GA 30332 


KEY WORDS AND PHRASES: primes, complex numbers 
CR CATEGORIES: 5.39 


procedure CSIEVE (m, Pk, PI); 
value m; integer m; integer array PR, PJ; 

comment Primes can be defined in the complex domain, a + 01, 
where a and b are integers. A unity is +1 or +7. A unity times a 
prime is its associate. Primes are not unique among associates; 
but except for that ambiguity, all the ordinary rules of real 
primes, such as the unique factorization law, apply to complex 
primes. 

It can beshown that a complex integer is prime if and only if its 
conjugate is prime. Therefore it is sufficient to search for primes 
in the one-eighth plane area with a closed bound along y = 0 
and an open bound along x = y, where z is positive and y is less 
than x but nonnegative. Any prime found in that area has 
seven more associated primes: —x% + yi, +a — yt, +y + xi, 
+y — zt. A discussion of complex primes can be found in [1]. 
It should be pointed out that numbers prime in the real domain 
are not necessarily prime in the complex domain, e.g. 2 = 
2+ 0¢ = (1+7)(1—-7). 

Algorithms 35 [2], 310 [3], and 311 [4] generate real primes. 
The simplistic technique used by Algorithm 35 applies equally 
well to generating complex primes. Unfortunately the more 
efficient techniques of Algorithms 310 and 311 cannot easily be 
translated into complex prime sifters. This algorithm, CSIEVE, 
uses the result that a complex integer is prime if the square of 
its modulus is relatively prime to the square of the moduli of 
all previous primes. The procedure is called with a value m, the 
number of complex primes to generate, PR and PI, the real and 
imaginary parts of the prime list generated where PR > PI > 0 
for each prime. The seven other associated primes must be gen- 
erated externally to CSIEVE. 

REFERENCES: 

1. Harpy, G. H., anp Wriaut, E. M. An Introduction to the 
Theory of Numbers. Clarendon Press, Oxford, 1954, Ch. 12. 

2. Woop, T. C. Algorithm 35, Sieve. Comm. ACM 4 (Mar. 1961), 
151. 

3. CHartres, B. A. Algorithm 310, Prime number generator 1. 
Comm. ACM 10 (Sept. 1967), 569. 

4, Cuartres, B. A. Algorithm 311, Prime number generator 
2. Comm. ACM 10 (Sept. 1967), 570; 

begin 
integer dn, nr, nt, sq, root, 1, j, k; 
integer array PM|2:m]; 


dn := PR(I] := PI{l] := P12] := 1; PM[2] := 5; 
j i= PR[2) := 2; 
for nr := 3 step 1 until m do 
begin 
dn := 1 — dn; 
for ni := dn step 2 until nr — 1 do 
begin 


sg:= nar X nr+ ni X nt; 
root := entier (1.5Xnr); 
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for 1 := 2 step 1 until j do 
begin 
if ((sq+ PM{i]) X PM[i]) = sq then go to C; 
if root < PM {i] then go to A; 
end; 
A: fori := 2 step 1 until j do 
begin 
if PM{i] > sq then 
begin 
for k := j step —1 until 7 do 
PM[k+1] := PM{k); 
go to B; 
end 
end; 
B: PM{t] := sq; j:=974+ 13 
if 7 = m then go to D; 
C: end 
end; 
D: 
end CSIEVE 


PR(j)] := nr; PI[j] := m; 


REMARKS ON 

ALGORITHM 372 [Al] 

AN ALGORITHM TO PRODUCE COMPLEX 
PRIMES, CSIEVE [K. B. Dunham. Comm. ACM 13 
(Jan. 1970), 52-53] 

ALGORITHM 401 [A1] 

AN IMPROVED ALGORITHM TO PRODUCE COM- 
PLEX PRIMES [P. Bratley. Comm. ACM 13 (Nov. 
1970), 693] 

Pauu Bratuey (Recd. 25 Feb. 1970) 

Département d’informatiqué, Universite de Montréal, 
C.P. 6128, Montréal 101, Quebec, Canada 


KEY WORDS AND PHRASES: 
bers, complex numbers 
CR CATEGORIES: 5.39 


number theory, prime num- 


Algorithm 372 was run on the CDC 6400 at the University of 
Montreal. The variable 7 is undefined if the for-loop at label A is 
completed. The statement 


Er a a 


should be added immediately before label B. Algol purists may 
also care to remove redundant semicolons after go to A and goto 
B, and the redundant parentheses in one if-statement. With these 
changes the algorithm produced correct results for several values 
of m. 

The comment in Algorithm 372 is slightly inaccurate. The first 
prime generated by the algorithm is 1 + 7, which does not have. 
PR > PI, and which has not seven but three associated primes. 

It is not possible to compare the speeds of Algorithm 372 and 
Algorithm 401 directly since they generate primes in a different 
order. However, the following test was run. A value of m was 
chosen, and Algorithm 401 was used to list all the complex primes 
with modulus less than m. The time taken and the number of 
primes produced were noted. Then Algorithm 372 was used to 
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produce an equal number of primes, the time taken again being 
noted. Times observed are shown in Table I. 


TABLE I 


Algorithm 401 


Bs , Time taken by Algorithm , 
modus, 2rgsweed ns | Tigh 372 to braduce the came simes taken 
25 60 0.278 0.331 1.2 
50 189 1.577 2.140 1.4 
75 373 4.217 7.602 1.8 
100 623 8.618 20.214 2.4 
150 1266 23.732 79.481 3.4 
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The conclusion from the figures in Table I is that if the speed 
with which the complex primes are generated is of. paramount 
importance then Algorithm 401 should be preferred to Algorithm 
372. 

As written Algorithm 401 will use more memory than Algorithm 
372 since it is convenient and perspicuous to use steve2 in an un- 
modified form, which makes it necessary to store temporarily all 
the rational primes less than m?. However, if space is tight then 
sieve2 can easily be modified so as to generate rational primes one 
at a time on successive calls, and in this way the use of the long 
array P2 can be avoided. If this modification is made Algorithm 
401 will in fact use less store than Algorithm 372, which wastefully 
stores many useless values in PM. It is also to be noticed that the 
factors 0.7 and 1.4 occurring in the declarations of P2 and P3 may 
be diminished for large m: all that is necessary is that P2 should 
be long enough to hold the rational primes less than m?, and that 
P3 should be long enough to hold the rational primes which are 
not greater than m and which are of the form 4n + 3. Some space 
may be saved similarly in sieve2, which is called from Algorithm 
401. 
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ALGORITHM 373 

NUMBER OF DOUBLY RESTRICTED 

PARTITIONS [A]] . 

Joun S. Wuits (Recd. 4 Mar. 1969) 

University of Minnesota, Department of Mechanical 
Engineering, Minneapolis, MN 55455 


KEY WORDS AND PHRASES: partitions, restricted parti- 
tions, sums of integers, restricted sums 
CR CATEGORIES: 5.39 


procedure setk (P, N, K); value N, K; 
integer VN, K; integer array P; 
comment The number of partitions of Z with parts greater than 
or equal to K and less than or equal to M is set in P[L, M] for 
all L, M such that N > L > M > 0. This algorithm is a general- 
ization of [1] which treats the case K = 1. 
REFERENCE: 
1. McKay, J. K. 8S. Algorithm 262, Number of restricted par- 
titions on N. Comm. ACM 8 (Aug. 1965), 493; 
begin integer L, M; 
for L := 0 step 1 until N do 
for M := 0 step 1 until L do P[L, M] := 0; 
P[0, 0} := 1; 
for L := K step 1 until N do 
for M := K step 1 until L do 
P[L, M] := P[L, M—1] + P[L—M, if L-M<M then L—M 
else M] 


end 


0 
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ALGORITHM 374 

RESTRICTED PARTITION GENERATOR [A1] 

JoHN 8. WuiteE (Recd. 4 Mar. 1969) 

University of Minnesota, Department of Mechanical 
Engineering, Minneapolis, MN 55455 


KEY WORDS AND PHRASES: partitions, restricted parti- 
tions, sums of integers, restricted sums 
CR CATEGORIES: 5.39 


procedure gen (P, N, K, position, pin, len); 
value N, K, position; 
integer N, K, position, len; integer array P, pin; 
comment The partitions of N with smallest part greater than 
or equal to K are mapped in their natural order, one-one, onto 
the consecutive integers from 0 to P[N, N] — 1, where P[N, N] 
is the number of partitions of N with smallest part greater than 
or equal to K. The array P is set by the procedure setk. On entry, 
postition contains the integer onto which the partition is mapped. 
On exit, len contains the number of parts of the partition and 
ptn[1 :len] contains the parts of the partition in descending order. 
This algorithm is a generalization of [1] which considers the 
case K = 1. 
REFERENCE: 
1. McKay, J. K. 8. Algorithm 263, Partition generator. Comm. 
ACM 8, (Aug. 1965), 493; 
begin integer L, M, psn; 
L:= N; psn := position; len := 0; 


A: 

len:=len+1; M:=K; 
B: 

if P[L, M] < psn then 

begin 

M:=M+41; gotoB 

end 

else if P[L, M] > psn then 
C: 

begin 


pin[len] := M; psn := psn — P[L, M—1]; 
L:=L—M; ifL <Kthen gotoD; goto A 
end 
else M := M+1; 
if M = L then go to C else go to B; 
D: 
end; 
begin integer N, I, J, K, len, position; 
integer array P[(0:20, 0:20], pin[0:20); 
comment driver for setk and gen; 
Nezt: 
outstring (1, ‘*’’); outstring (1, ‘’’); 
outstring (1, “partitions of N, N=’’); ininteger (2, N); 
outsiring (1, ‘with parts > K, K="’); ininteger (2, K); 
for J := 0 step 1 until N do 
for J := 0 step 1 until N do P{/, J] := 0; 
setk (P, N, K); 
outstring (1, ‘‘P array’’); 
for I := 0 step 1 until N do 
begin 
for J := 0 step 1 until N do outinteger (1, PZ, J]); 
outstring (1, ‘*’’) 
end; 


outstring (1, ‘*’’); 
outstring (1, ‘‘pos. partition’); 
for position := 0 step 1 until P[N, N] — 1 do 
begin 
gen (P, N, K, position, pin, len); 
outinteger (1, position) ; 
for I := 1 step 1 until len do outinteger (1, ptn{I]); 
outstring (1, ‘’’); 
end; 
go to Next 
end 
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ALGORITHM 3875 

FITTING DATA TO ONE EXPONENTIAL [E2] 

H. Spatu (Reed. 23 Oct. 1967) 

Institut fiir Neutronenphysik und Reaktortechnik, Kern- 
forschungszentrum Karlsruhe, Germany 


KEY WORDS AND PHRASES: 
CR CATEGORIES: 5.15 


nonlinear least squares fit 


procedure abfit (x, y, p, n, eps, a, b, ab, eb, bool, exit); 
value n, eps; integer n; real eps, a, b, ab, eb; 
label exit; array 2, y, p; Boolean bool; 

comment If you want to fit data points (a; , y:) @=1, +++ , n) 
with associated weights p; to f(z) = ae? the usual approach is 
to do a linear fit in the sense of least squares with In(f(z)) = 
In(a) — bz to the data (x; , In (y,;)) that is to minimize 

. n 


S* = pa gi (In (y;)—1n (a) +2;)?. (1) 
In [1] it is shown that this approach for finding a and b that are 
minimizing 


S(a, b) = Dy piyr—aem)?  @) 


is in general bad if you do not choose 

gi = py? (=1,-->, 7). (3) 
Proceeding similarly as in [2] from the necessary conditions for 
S having a minimum 


== = Q, (4) 


we eliminate a = a(b) from the first equation of (4) and put this 
into the second one. We result in an equation 


F(b) = 0. (5) 


If we have found a zero b of (5) then (a(0), b) is asolution of (4). 

The procedure abfit has two possibilities to do this. For bool = 
false we use the result b* from minimizing (1) with weights (8) 
to set up the intervals 


E ( - >) , b* ( - Ie = 0, +1, +2, +++, +19) (6) 


and to look if F has opposite signs at the endpoints of one of 
these intervals [ab, eb]. Experience has shown that for realistic 
data this method is a good one. If we do not find such an interval, 
abfit is left through exit and we can deliver ab and eb as input 
parameters to abfit with bool = true. 

In both cases a global procedure Rooifinder must be made 
available to find an existing zero b with relative accuracy eps 
in the calculated or given interval otherwise leaving to the label 
exit, 

The label exit would further be used if for bool = false the 
condition y; > 0 fori = 1,--- ,n is not fulfilled. 

REFERENCES: 

1. Borresr, H. Uber Gewichtsverteilung beim Fit mit Expo- 
nentialfunktionen. ZfK-TPh 22 (1966). 

2. SpAtH, H. Algorithm 295, Exponential curve fit. Comm. 
ACM 10 (Feb. 1967), 87; 

begin integer &; 

real hl, h2, h3, h4, hb, h6, h7, h8, 61, 62, Fl, F2, F8, F4, h; 

procedure Fb(b, F); value 6; real b, F; 

comment For given } this procedure calculates F = F(b); 
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Al := h2 := h3 := hA := 0; 
for k := 1 step 1 until n do 


begin 
h5 := exp(—bXalk]); 26 := pik] xylk]; 
h8 := h5 X 6; = h7 := pik] X hb X& hd; 
hl := Al +8; h2 := h2 + :h7; 
h38 := h8 + afk] X h8; h4 := h4 + alk] X A7 
end; 
a:= hl/h2; F := h3 X h2 — hl X hh 
end Fb; 


if bool then go to ROOT; 
Al := h2 := h3 := 4 := hd := 0; 
comment. The linear fit is done to get the estimate b*; 
for k := 1 step 1 until n do 
begin 
if y{k] < 0 then go to ezit; 


h8 := In(ylk]); h6 := pik] X ylk] X ylk]; A7 := h6 X afk); 
hl :=h1 + h6; h2:=h2 + h7 XK afk]; hB = 3 + A7; 
hA = FA +7 X 18; 15 = hd + hO X h8 

end; 


h8 := 1.0/(h1Xh2—h3Xh3); b r= —hB X (ALKA4—h3XKAhS); 

b1 := 02 :=b; k:=0; h:=0; Fb, F1); F2:= Fl; 
SEARCH: k:=k+1; if k > 20 then go to ezit; 

h:=h-+ .05; ab := b1 X (1.0—h); Fb(ab, F3); 

if F1 X F3 < 0 then begin eb := b1; go to ROOT end; 

eb := b2 X (1.0+h); Fb(eb, F4); 

if F2 <x F4 < 0 then begin ab := 62; goto ROOT end; 

bl :=ab; b2:= eb; Fl:= F3; F2:= F4; goto SEARCH; 
ROOT: Rooifinder (Fb, ab, eb, eps, b, exit) 
end abjit 
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ALGORITHM 376 

LEAST SQUARES FIT BY f(z) = A cos (Bz+(C) [E2] 

H. SpAru (Recd. 26 June 1967 and 28 Oct. 1968) 

Institut fiir Neutronenphysik und Reaktortechnik, Kern- 
forschungszentrum Karlsruhe, Germany . 


KEY WORDS AND PHRASES: nonlinear least squares fit 
CR CATEGORIES: 56.15 


procedure cosfii(x, y, p, n, beginB, endB, eps, A, B, C, fB, 8, fx, 
exit); 
value n, beginB, endB, eps; integer n; 
real beginB, endB, eps, A, B, C, fB, 8; 
array t, y, p, ft; label exit; 

comment Let (%, y:) be n given data points with associated 
weights ». . We want to find the three parameters A, B, and C 
of a curve f(z) = Acos(Bz+C) such that f fits the data in the 
least squares sense. Introducing the parameters a = —Asin(C), 
8 = Acos(C), y = B, we have f(z) = asin(yx) + Bcos(yx) and 
thus only one nonlinear parameter ~. Now we can use the same 
method as in [1]. From the necessary conditions for 


s(a, 8, y) = > De (Xk — f(xx))? 


having a minimum we eliminate a and 8 getting one equation in 
one nonlinear parameter 7, F(y) = 0. If we obtain a root 7* of 
F then the triple (a@(7*), B(7*), y*) isa stationary point of s and 
we finally get the desired parameters by 

B=y, 

C = arctan(a/8), 

A = —sign(a) X sign(sin(C)) X (a? + 6%}. 

A global procedure naraed Rootfinder must be made available 
to cosfii which is able to get a zero y = B of a function F(y) in 
a given interval [beginB, endB] with relative accuracy eps, if 
sign (F(beginB)) ~ sign(F(endB)) otherwise leaving to the 
global label exzt. A bisection routine is possible, but an interpo- 
lation method like that in [2] is to be preferred. 

By setting beginB equal to endB, the procedure cosfit can 
be used to tabulate the functions fB = F(B) = F(beginB), s = 
s(beginB), A = A(beginB), and C = C(beginB) and thus allows 
to get all minima in a given range. Often, the tabulation is made 
superfluous by proceeding as follows. In a rough graph we 
gather two intervals (2:*, 7:**) and (22*, z2**) including two 
successive zeros x1 and x2 of the desired function f. Then the 
two values beginB = 2nr/(a*—2x,**), endB = 2n/(a2**—2,*) in 
general form an interval that contains the value B for which 
s has the absolute minimum. As s has in general infinitely many 
minima, our method is superior to general purpose minimizing 
methods. If the found zero of F is not a minimum of s in the 
sense that the Jacobian s” is numerically not positive definite, 
the program puts s equal to —s. As rounding errors may cause 
here a wrong decision it is recommended to look also at the 
magnitude of s. 

The label exit is further used if, during the zero locating 
process, it would happen that the elimination of a and § were 
not possible. Variables fB and s finally have the values F'(B) 
and s(B) at the found zero. The array fx will contain the fitted 
values fz[k] = A X cos(BX2[k]+C). 

REFERENCES: 

1. SpAta, H. Algorithm 295, Exponential curve fit. Comm. 
ACM 10 (Feb. 1967), 87. 

2. Kristiansen, G. K. Contribution No. 6, Zero of arbitrary 
function. BIT 3 (1963), 205-207; 
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begin 


integer k; real hl, h2, h3, h4, h5, h6, h7, h8, h9, hil, h12, h13, 
hl4, hh, alpha, beta, gamm, t, u, v, w, 2, g, 7, h, d, e, f; 
procedure Fgamma(gamm, Fgamm); 
value gamm; real.gamm, Fgamm,; 
begin 
if gamm = 0 then go to exit; 
Al := h2 := 3 := WA = 15 z= AG := hh? := 18 i= 19 := 0; 
for k := 1 step 1 until n do 
begin 
t := afk]; u:= gamm Xt; v := sin(u); wu := cos(u); 
w:i=vXo; 2:=uXu; gq:= pik]; r:=vX u; 
h:= y[k]; d:=qXh; ei= 
Al:=hl+qXw; h2:=h2+qX% 2; AB :=h +9 X71; 
hMA:=h4+dXo; MH:=hb +dXu; h:=h6+e Xr; 
h7 :=hT +eX (e—w); AB:=AB+fRXu; AV:=hO+f Kv 
end; 
hh := hl X h2 — h3 X hd; 
if hh = 0 then go to exit; h = 1/hh; 
alpha :=hX (RAXh2—h3Xh5); beta :=hX (hlXh5—h3Xh4); 
Fgamm := fB := h6 X (alpha-tbeta) X (alpha— beta) 
+ alpha X beta X hi — alpha X h8 + beta X h9 
end Fgamma; 
if beginB = endB then begin Fgamma(B,fB); go to CC end 
Rootfinder (Fgamma, beginB, endB, eps, gamm, exit); 
B := gamm; 


CC : if beta =0 then C := — 1.5707963 else C := —arclan 
(alpha/beta); 
A := —sign(alpha) X sign(sin(C)) X sgrt(alphaXalpha-+betax 
beta); h:= 0; 
for k := 1 step 1 until n do 
begin 


v := falk]) := A X cos(BXa{k]+C); 
vi=v—ylkl; h:=h+ pik] XvXv 


end; 
s:=h; if beginB = endB then go to END; 
if hl < 0 V hh < 0 then begin s := —s; go to END end; 


hill := hl2 := hl3 := Al4 := 0; 
for k := 1 step 1 until n do 
begin 
u:= BX alk]; v := sin(u); u:= cos(u); 
e:= alk] X z[k]; r:= pik] Xe; f:=rX yk); 
All := AlL + 7 X (uXu—vXv); ALZ:= AI2@+ 7 Ku XK 9; 
hl3B :=hIB+f Xv; hl4:=h14+f Xu 
end; 
hll := hll X (alpha+beia) X (alpha—beia) 7 
— 4 X alpha X beta X h12 + alpha X h13 + beta X h14; © 
hi2 := 2 X alpha X h6 + beta & h7 — h8; h13 := alpha X h7 
— 2X beta X h6 + AY; 
if hll X hh — h18 X (Al X A138 — 13 X Al2) 
+ h1l2 X (h8 X h13 — h2 X h12) < O then s := —s; 


END: end 
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ALGORITHM 377 

SYMBOLIC EXPANSION OF ALGEBRAIC 

EXPRESSIONS [R2] 

MicwHare.u J. LEVINE* 

Department of Physics, Carnegie-Mellon University, 
Pittsburgh, PA 15213 

AND STANLEY M. Swanson{ (Reed. 27 Jan. 1969) 

89 Mid Oaks Lane, St. Paul, MN 55113 


*This work was done in part at the Division of Theory, 
CERN, Geneva, Switzerland. 

+ This work was donein part at the Institute of Theoretical Phy- 
sics, Stanford University, Stanford, California. 


KEY WORDS AND PHRASES: algebra, symbolic algebra, 
symbolic multiplication, algebraic distribution, algebraic multi- 
plication, distribution algorithm, multiplication algorithm, prod- 
uct algorithm, polynomial distribution, polynomial expansion 
Ck CATEGORIES: 3.10, 3.17, 3.20, 4.13, 4.90 


procedure EX PAND(M); integer M; 

comment This algorithm algebraically expands arbitrarily 
parenthesized expressions into monomials. Distribution is direct, 
without intermediate expansion of lower level expressions. The 
algorithm has been used as a part of algebra programs in the- 
oretical physics [2, 3]. It was devised by H. J. Kaiser [1] and re- 
constructed by M. J. Levine. Expansion proceeds in two steps: 
First, parsing an input expression into a sequence of variable- 
operator pairs with associated parenthesis-level information, 
and then picking out the variables which belong together as 
factors of monomial terms. HX PAND accepts an abbreviated 
Aua@ot-like syntax: 

(variable) ::= A|B|C|D|E|F|G 


(primary) ::= (variable) | ((expression)) 
{term) ::= (primary) | (term) X (primary) 
(expression) ::= (term) | (expression) + (term) 
REFERENCES: 


1. Katsur, H. J. Trace calculation on electronic computer. Nu- 
clear Physics 48 (1963), 620. 
2. Levine, M. J. Dirac matrix and tensor algebras on a compu- 
ter. J. Computat. Phys. 1 (1967), 454. 
3. Swanson, 8. M. Computer algorithms for Dirac algebra. 
J, Computat. Phys. 4, 1 (1969), 171; 
begin 
integer LVL, N,T,U; Boolean array MULT(0:M}; 
integer array V, VL, OPL, INDEX(0:.M]; 
integer procedure CHAR; 
begin 
integer C; 
A: insymbol (2, ‘X) + (ABCDEFGu;’, C); 
to A; 
CHAR :=C 
end CHAR; 
procedure DISTRIBUTE(N); integer N; 
comment There are two problems in distribution: first, to se- 
lect the variables in an expression which belong together as 
factors of the current monomial, and then to alter the reference 
marks in USED to indicate the next monomial. A Boolean 
value in USED is associated with each variable-operator pair. 
The expression is scanned from the left to select the first un- 
used variable, and then any variables in an additive relation 


if C = 12 then go 


NEXT: ALTER := true; 
FACTOR: I :=I1+1; 


SKIP: 


R: 
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to the selected variable are skipped before continuing the 
scanning for other factors. For the next monomial, the first 
selected variable followed by a ‘‘+-+” is marked used, and the 
marks on all the variables to the left are altered, depending on 
their operator type and level relation to the “‘+’’. Distribu- 
tion is from left to right (initial factors change most often); 


begin 
integer J, J, K, L, LEVEL; 
Boolean ALTER, PRODUCT, TERM; 
Boolean array USED(0:N}; 
for K := 0 step 1 until N do USED[K] := false; 
J:=I[:=—-1; 
if USED[I] then go to FACTOR; 
J:=J+1; INDEX{|J):= 1; 
if MULT{I] then go to FACTOR; LEVEL := OPL(I); 
if LEVEL > 0 then 
begin 
if ALTER then 
begin 
L:= LEVEL; LEVEL := 
USED|I\ := PRODUCT := 
ALTER := false; 
for K := I— 1step —1 until 0do 
begin 
if OPL{K] < LEVEL then 
begin 
LEVEL := OPL{|K]; PRODUCT := MULT([K\; 
if PRODUCT then LEVEL := LEVEL + 1; 
if LEVEL < L then TERM := false 
end; 
if PRODUCT then USED[K] := TERM 
end 
end 
else 
begin 
I:=1+1; 
end; 
go to SKIP 
end; 
PROCESS(J); if — ALTER then go to NEXT; 
end DISTRIBUTE; 
procedure PROCESS(J); integer J; 
comment Askeletal output routine (normally, monomials are 
further manipulated, sorted, and accumulated); 


VL] + 1; 
TERM := true; 


if LEVEL < OPL{I)| then gotoR 


begin 
integer I; outstring (1, ‘+’); 
for J := 0 step 1 until J do 
begin 
outsymbol (1, “X) + (ABCDEFG”’, V{IINDEX[I}}); 
if I ~ J then outstring (1, ‘“X’’) 
end 
end PROCESS; 
comment The following statements parse the input. A full- 
fledged input routine would extend (primary) to include num- 
bers and would class both ‘“‘—”’ and ‘“‘-+-” together as (adding 
operators). DISTRIBUTE still works with only ‘‘+”’ and ‘‘“x”’ 
since a ‘‘—”’ is either absorbed into a following unsigned num- 
ber or replaced by the string ‘‘—1X’’. Only asingle subexpres- 
sion, followed by an unparenthesized ‘‘+-’’, is expanded at a 
time. M limits the size of this subexpression. A syntax error or 
a semicolon terminates the processing of input; 
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LVL := N:=0; U:= CHAR; if U <4 then goto ERR; 
A: T:=U; if U = 13then 7 := 3 else U := CHAR; 
if U = 4 then 
begin 
if 7’ = 1 then 
hegin 
MULT({N) := true; OPL[N] := LVL; N:=N+1 
end 
else if T = 3 then 
begin 
MULT[N] := false; OPL[N] := LVL; 
if LVL = 0 then begin DISTRIBUTE (N); N := O0end 
else VN := N+ 1 
end 
else if T = 4then LVL := LVL+1 
else go to ERR 
end 
else 
begin 
if! =2/A LVL >Othen LVL := LVL — lelse 
if 7 = 5then begin V[N] := 7; VL[N] := LVL end 
else go to ERR; 
end; 
if U ~ 13 then go to A elseif LVL = 0 then go to B; 
ERR: outstring (1, ‘syntax error’); 
B: end EXPAND 
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ALGORITHM 378 

DISCRETIZED NEWTON-LIKE METHOD FOR 

SOLVING A SYSTEM OF SIMULTANEOUS 

NONLINEAR EQUATIONS [C5] 

W. Panxinwicz (Recd. 24 May 1967, 13 July 1967 
and 2 Oct. 1968) 

Warszawa-90, Wyszatycka 28, Poland 

KEY WORDS AND PHRASES: Newton’s method, nonlinear 


equations, interpolating polynomials 
CR CATEGORIES: 6.15, 5.18 


integer procedure nielin (n, h, w, eps, psi, y, 2); 
value n, h, w, eps, pst; 
integer n; real h, w, eps, pst; array y, 2; 

comment Functional procedure nielin, of the integer type, 
solves a system of simultaneous nonlinear algebraic or trans- 
cendental equations. 


Let us consider a given system of n equations with n variables: 
fily » Y2 2 °°, Yn) = 0, T= 1,2, 0+, 0, (1) 


A kth approximation of the solution of the system (1) is 
supposed to be given: 


re = (yi, yr”, wf) ye). (2) 


If for every 2, 
Ifi(Yo")| <, (3) 


where e > 0 is a given number, then the approximation (2) is 
considered as a solution of the system (1), otherwise a further 
approximation is calculated. 

Let h > 0 be given and construct the n new points: 


(k) (k) k) (k) k) . (k) ke . 
ee (yi pry yd Yi + hi » Yit1 poet y 8), d= 1,2, 00+, m. (4) 


el 


For every function of the system (1) a new interpolating poly- 
nomial of the first order is constructed on the points (2) and 
(4) such that: 


wil¥;”) = $Y), fF =O 1, n,6 = 1,2, 0,0 6) 
A solution of the linear system: 


wily 1Y2,%%*, Yn) = 0, 7=1,2,---,n, (6) 


is used as the (k+1)-th successive approximation. 

The special choice of the interpolation points (2) and (4) 
assures existence and uniqueness of the interpolating poly- 
nomials w; (5). Namely, the kth approximation has for the ith 
function the form: 


wP(Y) = fH) + oP Os — vs, @) 
where 
ge = (fi) — FY). (8) 


The solution of the system (6) where w, is given by (7) can 
be written in the form (see [2]): 
yh = YP — Aa)? XK, 4G = 1, 2 +, 2, (9) 


k k k ke . : ° 
where z™ = (zs 2 zs M, tee, zi») is a solution of the following 
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linear system: 
De LAY) X 2 = FV), 6 = 1% om, (0) 
— 


and 


a® =o 1— >> 2%, (11) 
m=1 
If the sequence { Y“} is convergent whenk — © and, {h®} +0 
then the solution of the system (1) is the limit of the sequence. 
The algorithm described above is realized by means of the 
procedure nielin, which in turn uses the following two addi- 
tional procedures: 
(1) nonlocal procedure f(y, z), which calculates for a given 
vector y values of the left-hand sides of the system (1), and 
(2) local procedure gauss (u, a, y), see [1]. 
Input parameters: 
nm number of equations in the system (1), 
h number which is used for the construction of auxiliary 
points (4), . 

w factor multiplying the number A in every iteration, 

eps number used in the checking of condition (2), 

pst maximal admissible absolute value of the left-hand 

sides of the system (1). 
Input/output parameters: 

-y vector of dimension [1:n]. Initially this vector must 
contain the starting approximation; subsequently y will 
contain the successive approximations to the solution. 

Output parameters: 

z vector of dimension [1:n] which contains the values of 
the equations in (1) evaluated at y, 

nielin assigned one of the following values: 

—1 if any left-hand side exceeds the given value psi, 
—2 if the linear system (10) is singular, 
—3 if the sum of the roots of the system equals 1, i.e. if 
alpha = 0 (11), 
m number of iterations, if the required accuracy eps is 
attained. 
To solve the system 


yr? + y? —1=0, 
0.75y:3 — Y2 + 0.9 = 0, 


the procedure (see footnote*) was applied. 
For eps = 10-7, pst = 1038 and w = .1 the following results 
were obtained: 


Example. 


40 h k y r4 k y z 
—.4 .1 7 — 9817026 — 40-9 

—.1 . 1904203 910-9 

—.7 1 6 — .9817026 0 

—.2- 1904203 0 

—-.7 1 —-2 .3081622 —Tio-l 4 .3569699 —41-8 
—.2 .9866243 dio-l -9841159 —410-8 


*The procedure applied was: 
procedure f(y, z); 
array Y, 2; 


begin 
2{1] := yl] 12+ yf2] 72-1; 
2(2] := .75 X y[i] T 3 — y[2] + 9 
end 
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The second result of the third set was obtained from a ALARM: 
repeated call as indicated below. nielin := —1; goto END; 
Procedure nielin was tested on many 2 X 2 and 3 X 3 systems. error: 
If, from a given starting guess the process was divergent, the mielin := —2; goto END; 
divergence was apparent after two or three iterations. ALPHA: 
In the case when the auxiliary linear system (10) was singular nielin := —3; 
or a = 0 (11), the obtained approximation was close to the END: end nielin 


required approximation. Then the repeated call of the proce- 
dure with the obtained approximation and the starting value h 
gave the desired result after 3-4 iterations. The last remark 
suggests the following construction of the call of procedure 
nielin: 

REPEAT: k := ntelin (n, h, w, eps, pst, y, 2) 


ee 


REFERENCES: 


1. Counts, J. W. Algorithm 126, Gauss’ method. Comm. ACM 6 
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2. Panxtewicz, W. About some method for solving a system of 
simultaneous nonlinear equations. Proc. of the Symposium: 
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begin 

integer m, 1, k; real alpha, r; 
Boolean b1, 62; array A[l :n, 1 :n+1], of] :n]; 
procedure gauss (wu, a, -y); 
integer U; array a, Y; 
begin 
comment At this point the body of a procedure named 
Gauss (see [1]) must be supplied by the user to solve a 
u X wu linear system whose coefficient matrix is stored in 
the first w rows and u columns of a, whose vector of con- 
stants (right-hand side) is stored in the (u + 1)-th column 
of a, and whose solution is given as y. If the system is singu- 
lar it should execute go to error; 
end gauss; 
m:=0; 
POCZATEKR: 
b1:= true; 062 := false; f(y, z); 
for z.:= 1 step 1 until n do 


begin 
Ali, n+1) := r := 2[z]; 
r t= abs(r); 
b1 := bL Ar < eps; 
b2 := b2 Vr > psi 
end: 


if 61 then go to KONIEC; 

if 62 then go to ALARM; 

for 7 := | step 1 until n do 

begin 
ri=yltl yltl:=r+h; fly, 2); 
for k := 1 step 1 until n do 

Afk, 7] := z[k]; 

yt] i= 34 

end; 

gauss (n, A, v); 

alpha := 1; 

for 7 := 1 step 1 until n do 
alpha := alpha — v{t]; 

if alpha = 0 then goto ALPHA; 

alpha := h/alpha; 

for 7 := 1 step 1 until 2 do 
y{t] := y{t] — o[t] * alpha; 

h:=hxwjym:=m+1; 

go to POCZATEK; 

KONIEC: 
nielin := m; goto END; 
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ADAPTIVELY—NOISE KILLED)* [D1] 
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DESCRIPTION: 

Purpose. SQUANK is an automatic numerical quadrature 
routine. The user provides a = A, and b = BIG, the lower and 
upper limits of integration, the tolerance e1.; = ERROR he re- 
quires, and a function subprogram FUN(X) for the integrand 
f(x). The routine returns Rf = SQUANK, where Rf is an expres- 
sion of the forra Rf = >°%1 w.f(x:) which is an approximation to 
the integral If = f° f(x) dz. 

Hopefully, this approximation is within the claimed accuracy 
étot, 1. | Rf — If| = | eactl < evor. 

The routine returns three other quantities, as arguments. 
These are 

FIFTH—the fifth-order adjustment term. This may be used as 

an error estimate in cases in which round-off error is not sig- 
nificant. 
NO = N—the number of calls to the function subprogram. 

RUM = e¢o1—the claimed accuracy. This is normally the same 

aS €tot , the required tolerance, except in cases in which round- 
off error is significant, when it is higher than eo; . 
Like many other routines, SQUANK is a special purpose routine. 
It is designed to treat efficiently integrands f(x) having both the 
following properties: 
(a) f(z) and its first four derivatives are continuous in the open 
interval (a, 6). 

(b) f(z) does not have high frequency oscillations. 

By experiment the routine has been found efficient for the wider 
class of functions 

(c) g(x) = f(a)|zx — a2, a > 0, where m = a or x = bora = 
(a+b)/2 and f(x) satisfies both (a) and (b) above. 

Construction. The construction of this routine is described in 
detail in [3]. Briefly, it is based on the ideas of the Adaptive 
Simpson Quadrature routine [5-8], referred to below as ASQ, but 
embodies four major modifications: 

(1) a different assignment of allowed error to interval and a 
different interval convergence criterion; 

(2) interval bisection in place of trisection; 

(3) inclusion of an adjustment term to give a result of poly- 
nomial degree 5 in place of degree 2; 

(4) a round-off error guard (which guards against the effects 

of excessive round-off error in function values). 
The first three modifications are of a standard nature. Their 
effect is described below under Comparisons. The fourth mod- 
ification is somewhat unusual and is described by means of an 
example below. 

Round-off Error Guard. The accuracy attainable by any quad- 
rature routine is clearly limited by the accuracy to which the func- 
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tion is evaluated. The effect in an automatic routine of requesting 
an accuracy in excess of the accuracy of the function evaluation 
is described elsewhere [4] and can be catastrophic. SQUANK 
contains a ‘“‘round-off error guard’’ which is Modification 4 of [3]. 
Thus the user may request any tolerance €1o1 , even €to: = 0. The 
routine provides a result which may reflect different accuracies 
over different ranges of xz, the local tolerance level being con- 
strained to remain above the level of the apparent local round-off 
error. The overall estimated accuracy e;o1 is returned as argument 
RUM. 

As an example, the same problem was treated using SQQUANK 
on two different computers. These have machine accuracy param- 
eters ey = 10-" and ey = 10°’, respectively. The problem was to 
evaluate 


If = fii (2? + 10°) dx ~ 3 X 10° 


with various tolerances ¢€:.;. A selection of the results is tabu- 
lated below. 


€m = 1071 €m = 1077 
Etol €tol €act N Eiol €act N 
10-3 10-3 —2.4 * 10-° | 1081 |2.4 * 107-4/9.1 & 10-4 889 


10-* 19.9 K 10-8 |—3.2 XK 107” |12057 |1.5 K 1074/9.1 X 1074/2513 
0 |9.9 X 10-8 |—-3.2 X 107” |14809 {1.5 K 10-3/9.1 XK 1074/2513 


Here éacz is the difference between Rf and If. 

It should be borne in mind that the peak of the integrand is of 
magnitude 10°. Thus the accuracy in function evaluation near the 
peak is about 10“ or 107, respectively. Naturally, the machine 
with smaller word length produced a less accurate result, but at a 
lower cost in function evaluation. No intervention by the user 
was necessary. For a further comparison, the round-off error 
guard was disabled. For e:.: < 10-7, the routine then required 
577,197 function values, but the resulting value Rf was about the 
same. Thus in this example, the round-off error guard cut the 
computation time by a factor of 40. 

The inclusion of this round-off error guard has one serious 
drawback. If the routine is used with an integrand which is discon- 
tinuous, or has a low order discontinuous derivative, SQQUANK 
may take this to be evidence of round-off error and may adjust 
the tolerance. In these cases, the result may have a much lower 
accuracy than requested. However, this value of the accuracy is 
estimated and returned in argument RUM. The number of func- 
tion values required for such a less accurate result is correspond- 
ingly lower. 

Comparisons. Besides the testing carried out by the author, 
SQUANK has been subjected to two independent sets of extensive 
tests in comparison with other quadrature routines [1, 2]. The 
respective authors have kindly made some of their results avail- 
able to me. These tests involve a set of routines, a set of functions, 
and eight different tolerances, all large enough so that round-off 
error is not significant. 

Restricting attention only to functions of type (c) and to the 
two routines SQUANK and ASQ, the following information is 
reported. Of a set of 47 functions, both routines are equally reli- 
able; ASQ is more economic than SQUANK for only one of these. 
For the other 46, SQUANK is more economic, generally by factors 
of about two [1]. Of a set of 14 functions, in all cases SQQUANK is 
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more economic, by factors ranging from 1.4 (at high accuracies) 
to 3 or 4 (at low accuracies) [2]. 


Turning to a general comparison with other routines, certain 


trends are apparent, although there are no clear simple conclu- 
sions. In some cases SQUANK is more economic than other 
routines; in other cases it is obviously much worse. 


REFERENCES: 


1. CasauertTo, J., Picket, M., anp Ricg, J. A comparison of 


2. 


3. 


some numerical integration programs. CSD TR 37, Purdue 
U., Lafayette, Ind., June 1969, and “SIGNUM Newsletter’”’ 


4,3 (Oct. 1969), 30-40. 


Kauaner, D. K. Private communication. See also Compari- 
son of numerical quadrature formulas, LA-4137, Los Alamos 
Sci. Lab., Los Alamos, N.M., June 1969. 

Lyness, J. N. Notes on the adaptive Simpson quadrature 
routine. J. ACM 16 (July 1969), 483-495. 

Lyness, J. N. The effect of inadequate convergence criteria 
in automatic routines. Comput. J. 12 (1969), 279-281. 

McKerrman, W. M. Algorithm 145, Adaptive numerical in- 
tegration by Simpson’s rule. Comm. ACM & (Dec. 1962), 604. 


6. ——. Certification of algorithm 145, Adaptive numerical inte- 
gration by Simpson’s rule. Comm. ACM 6 (Apr. 1963), 167-168. 

7. ——, aNnD Tester, L. Algorithm 182, Nonrecursive adaptive 
integration. Comm. ACM 6 (June 1963), 315. 

8. ——. Algorithm 198, Adaptive integration and multiple inte- 
gration. Comm. ACM 6 (Aug. 1963), 443. 

ALGORITHM: 


NMAODDHAADANDAANDA AANDAAADAANDANDAAADAANGBAADAHANAADAANADAARANDAAADAADAAGAAAANAGBDANAAAAANAANAG 


SeQeUeAenek 


NCTES ON USEe (1) 


NCTES ON USE. (2) 


NCTES ON USE. (3) 


FUNCTION SQUANK (A,B1G, ERROR »FIFTH»RUM»NOSFUN) 


STANDS FOR ® SIMPSON QUADRATURE USED ADAPTIVELY. NOISE KILLED.® 


CALLING PROGRAM REQUIRES 
EXTERNAL FUN 


THIS IS FUNCTION TO 8E INTEGRATED 
4 THE LOWER LIMIT OF INTEGRATION 
BIG THE UPPER LIMIT 


ERROR THE REQUIRED TOLERANCE (ABSOLUTE ERROR) 


OUTPUT 


SQUANK THE FIFTH ORDER RESULT = THIRO + FIFTH 

FIFTH THE FIFTH ORDER ADJUSTMENT TERM 

RUM THE CLAIMED TOLERANCE { ADJUSTED FOR ROUNOOFF ERROR) 
NO THE NUMBER OF FUNCTION EVALUATIONS REQUIREO 


DISCONTINUOUS FUNCTIONS 


THIS ROUTINE IS BASED ON DEGREE 3 ANO DEGREE 5 LOCAL POLYNOMIAL APPROX 
IMATION. CONSEQUENTLY IT SHOULD NOT BE USED WITH FUNCTIONS WHICH HAVE 
DISCONTINUITIES IN THE FOURTH OR LOWER DERIVATIVES WITHIN THE INTERVAL OF 
INTEGRATION. IF THERE ARE SUCH DISCONTINUITIES: THIS ROUTINE WILL TAKE 
THIS TO BE EVIDENCE OF ROUND OFF ERROR IN FUNCTION VALUES AND WILL ADJUST 
THE TOLERANCE. 

IF TRE LOCATIONS OF SUCH DISCONTINUITIES ARE KNOWN, THE ROUTINE MAY 
BE USEC SEPARATELY FOR EACH INTERVAL BETWEEN CONSECUTIVE OLSCONTINUITIESs 
WHILEs LIKE ALL SUCH ROUTINES,» IT DISLIKES DISCONTINUITIESs IT CAN HANDLE 
THEM IF THEY ARE LOCATED AT THE END POINTS OF THE INTEGRATION INTERVALS 


FUNCTIONS WITH HIGH=FREQUENCY OSCILLATIONS.» 


THE ROUTINE WILL RETURN UNRELIABLE RESULTS FOR FUNCTIONS LIKE G(X) 


TIMES COS(100#X). IF THE HIGHEST PERIOO LIKELY TO BE ENCOUNTEREO IS 
KNOWN, THE INTERVAL SHOULO BE SUB-DIVIDEO IN SUCH A WAY THAT» 

(ONE) THERE ARE NOT MORE THAN THREE PERIOOS PER INTERVALe AND 

(TWO) THE PERIOO #P*# OIVIOED BY THE SUB-INTERVAL» #(8 = A}® IS NOT A 


-A SIMPLE FRACTION #N/M# WITH N OR M LESS THAN 9. 


INTERVAL SUB-DIVISION 


THE FAILURES DESCRIBED ABOVE ARE GENERALLY WORSE FOR *SQUANK® THAN FOR 
OTHER RCUTINES BECAUSE *SQUANK* TAKES THE INCONVENIENT BEHAVIOR AS AN 
INDICATION OF ROUND OFF ERROR. IN GENERAL SUB-DIVISION OF THE INTERVAL IS 
ADVOCATED. ESSENTIALLY THE USER CARRIES OUT, UNDER DRIVING PROGRAM 
CONTROL, & SEQUENCE OF CALCULATIONS WHICH SHOULD HAVE BEEN CARRIED QUT 
IN THE SUBROUTINE IN ANY CASE. IN THIS WAY HE PREVENTS CHANCE LOW ORDER 
FALSE CCNVERGENCE AT VIRTUALLY NO ADDITIONAL COST. NOTE THAT THE SUM OF 
THE PARAMETERS #ERROR® FOR THE SUB-INTERVALS SHOULO CORRESPOND TO THE 
VALUE REQUIRED FOR THE WHOLE INTERVALeS 


NIM NUMBERING SYSTEM AND LOGIC 


THE INTERVAL {AsB) IS OEFINED NIM = 1 5 LEVEL = O- 


THE INTERVAL NIM = Ne LEVEL = L IS BISECTED, IF NECESSARY, INTO 
THO INTERVALS, NIM = 2¢N AND NIM = 20N + 1p BOTH AT LEVEL = Ltle 

IF INTERVAL NIM = Ne LEVEL = L DOES NOT CONVERGEs THE NEXT INTERVAL 
CONSICERED IS NIM = 2€Ny LEVEL = Ltle 

IF INTERVAL NIM=Ny LEVEL = L DOES CONVERGE, THE NEXT INTERVAL CONSIDERED 
TS NIM = M(R) + 1 - LEVEL = L-R » WHERE MIR) IS THE FIRST 
EVEN MEMBER OF THE SEQUENCE M(O) = Ne M(S#1} = (MO9)-1)/2. 

GIVES LEVEL = UG» THE CALCULATION IS COMPLETE. 


TF THIS 


SCALING TC AVOIO EXCESSIVE OIVISION BY TWO. 


AAADAAAAAAAADAAAADADADAAAAAAADAAAA ANDANAAAAAAAAADAANNADAANAADAGBAAGAAAAHANIIAANAAAANGDAAAANAAANAO 


aaaa 


anan 


aan 
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THE INTERVALC(X1LeX5) TS OF LENGTH H = X5~-X1le THE POINTS XL eX29X3sX4eX5 
ARE THE POINTS OF QUARTERSECTION OF THES INTERVAK ANO FXLZFAZSFX3,FX4e 
FX5 ARE THE CORRESPONDING FUNCTION VALUES. 

EST IS APPROXIMATION TO (6.0/H)® INTEGRALOX1L»X5)~ 

CESTL*EST2) ES APPROXIMATION TO {12.G/H) © INTEGRAL (X1»X5)~ 

SUM IS APPROXIMATION TO £1260) # INTEGRAL(AsX1) 0 


STORAGE 
M3ST(L) = Oe5e(X5STIL} + XL)- THUS X3ST(L) COULD BE RECALCULATED 
AT EACH STAGE TO AVOID STORAGE. ESTST(L) IS SAME IN THIS RESPECT. 


THE RESULTS OF ABOVE RECALCULATION ARE IDENTICAL MACHINE NUMBERS. 
XSST(L) = XL + (BeA)eC2ee(-L)) © THIS COULD ALSO BE RECALCULATED. BUT 
IN THIS CASE CALCULATION IS EXCESSIVE AND THERE IS A POSSIBILITY OF 
ROUNC OFF ERROR ARISING BECAUSE THE SAME POINT IS BEING CALCULATED 

IN TwO OR MORE OIFFERENT WAYS. 


AVOIDANCE OF ROUND OFF ERROR TROUBLE 


IF INTERVAL DOES NOT CONVERGE, FOLLOWING INTERVAL SHOULO HAVE ADIEF 
VALUE APPROXIMATELY EQUAL TO (1/16) TIMES PREVIOUS AOIFF VALUE, CALLED 
ADIFFL IN THE COOE. THERE IS A THEOREM WHICH STATES THAT» UNLESS THE 
FOURTH DERIVATIVE OF FUNIX) VANISHES IN THE PREVIOUS INTERVAL, ADOIFF 
TS LESS THAN OR EQUAL TO ADIFFL. IF THIS OOES NOT HAPPEN, {T IS TAKEN 
TO BE AN INDICATION OF POSSIBLE ROUND OFF LEVEL. IN THIS CASE, 
UNLESS LEV IS LESS THAN FIVE. THE CURRENT TOLERANCE LEVEL, CEPS») 
TS APPROPRIATELY ADJUSTED. HOWEVER CEPS 1S RESET AS ANO WHEN 
APPEARS THAT IT SHOULD BE ADJUSTED EITHER UP OR DOWN. IT 1S REDUCED 
IF CONVERGENCE OCCURS WITH A NON-ZERO AOIFF STAICTLY LESS THAN 
O.25*CEPS. AN INVOLVED SECTION OF CODING GUARDS TQ SOME EXTENT AGAINST 
AN UNREALISTIC VALUE ARISING AS A RESULT OF A ZERO IN THE FOURTH 
DERIVATIVE. A FACTOR EFACT IS CALCULATED WHICH ADJUSTS THE CLAIMED 
TOLERANCE TO TAKE INTO ACCOUNT THESE ALTERATIONS IN THE TOLERANCE LEVEL es 
THE ROUTINE ENTERS THESE INVOLVED SECTIONS OF CODING ONLY IF ROUND 
OFF ERRCR APPEARS TO BE PRESENTs IN A NORMAL (ROUNO OFF ERROR FREE) 
RUN, THESE SECTIONS ARE SKIPPED AT A COST OF A SINGLE COMPARISON PER 
ITERATICN (TWO FUNCTION EVALUATIONS )o 


ARBITRARY CONSTANTS 


(1) NO CONVERGENCE IS ALLOWED AT LEVEL = @#@#ZEROee, 


(3) PHYSICAL LIMIT. 


THE FOLLOWING CONSTANTS HAVE BEEN ASSIGNED IN THE LIGHT OF EXPERIENCE 
WITH NC THEORETICAL JUSTIFICATIONS 
THIS MEANS THAT THE 


ROUTINE IS CONSTRAINED TO BASE THE RESULT ON AT LEAST 9 FUNCTION VALUESe 


£2) NO UPWARD ADJUSTMENT OF THE TOLERANCE LEVEL IS CONSIOERED AT LEVELS 


LOWER THAN LEVEL = ##FIVE*#. THE POINT SPACING IS THEN (BIG-A)/128.0- 


HIGHEST LEVEL ALLOWED IS LEVEL # e@THIRTYee,. HERE 


CONVERGENCE IS ASSIGNED WHETHER OR NOT THE INTERVAL HAS CONVERGEOe THE 
POINT SPACING IS THEN ABOUT (BIG~A)#2.08 10ee-1U, 


(4) UPWARO ADJUSTMENT OF TOLERANCE LEVEL IS LIMITED IN GENERAL TO 


A FACTOR ##2.08% OR LESS. 


(5) DOWNWAKC ADJUSTMENT OF TOLERANCE LEVEL IS INHIBETEO IN GENERAL UNLESS 


BY A FACTOR GREATER THAN ##4,.088, 


SOME NOTATION 


205 


210 


215 


220 
295 


SUM ANC SIM ARE RUNNING SUMS, INCREASED AT STAGE EIGHT. THEY ARE 
RESPECTIVELY 212.0 * ( THIRO ORDER APPROXIMATION TG THE INTEGRAL } 
AND -180.0 © ( FIFTH ORDER ADJUSTMENT TO THE INTEGRAL). 

CEPSF IS THE REQUIRED (SCALED) TOLERANCE. 

CEPS IS THE RUNNING VALUE OF THE ADJUSTED TOLERANCE. 

QcEPS = 0.25 « CEPS 

LEVTAG = -1 OR 04203 INDICATES WHETHER TOLERANCE IS NOT OR IS 

CURRENTLY ADJUSTED. ¢ SEE COMMENT IN STAGE SEVENe) 

EFACT IS RUNNING SUM CORRESPONDING TO 180.0 # RUM 

FACERR = 1560 OR 146 OEFPENDING ON WHETHER TOLERANCE IS OR IS NOT 
CURRENTLY ADJUSTED. IF IT 1S, THERE IS NO JUSTIFICATION FOR THE 
FIFTH ORCER ADJUSTMENT ANDO ACCURACY IS NOT EXPECTED TO BE (1/15) TIMES 
OIFFERENCE OF APPROXIMATICNS. FACERR = 15.0 REMOVES THE BUILT IN 15.0 
FACTCR FOR CALCULATION OF EFACT. 

EPMACH THE MACHINE ACCURACY PARAMETER. THE ROUND OFF ERROR GUARD DOES 
NOT REQUIRE THIS NUMBER. IT IS MACHINE INDEPENDENTe THIS IS ONLY 
USED TO HELP IN AN INITIAL GUESS IN STAGE TWO IF THE VALUE OF ERROR 
HAPPENS TO BE ZERO. ANY NON-ZERO NUMBER MAY BE USED INSTEAD, WITH A 
VERY SMALL PENALTY IN NUMBER OF FUNCTION EVALUATIONS IF A COMPLETELY 
UNREASONABLE NUMBER IS USED. 


DIMENSION FX3ST(30)eX3ST(30) sESTSTUA0) sFXSST(30) »XSST(30) 
DIMENSICN PREOIF(36) 

DOUBLE PRECISION SUMsSIM 

EPMACH = 0.0000v00000075 


sone STAGE ONE euee 
UNITIALISE ALL QUANTITIES REQUIREO FOR CENTRAL CALCULATION (STAGE 3)6 
SUM = 0.0 
SIM = 0.6 


CEPSF = 180.0#ERROR/(BIG — A} 
CEPS = CEPSF 


AOIFF = 0.6 
LEVTAG = =1 
FACERR = 1.€ 
XZERO = A 
EFACT = €.0 
NIM = 1 

LEV = 0 

FIRST INTERVAL 
Xl =A 

X5 = BIG 


X32 = O.5#(A+BIG) 

FXl= FUN(XL) 

FX3= FUN(X3) 

FXS= FUN(X5) 

NO = 3 

EST = FX1 + FX5 + 4,0@FX3 
eeee STAGE TwO seen 

SET A STARTING VALUE FOR TOLERANCE IN CASE THAT CEPSF = 0-0 


IF(CEPSF) 295,205,295 
LEVTAG = 0 

FACERR = 15.u 

CEPS = EPMACHeABS (FXL) 
TFCFXE) 295-4210,295 
CEPS = EPMACH#ABS {(FX3) 
LEVTAG = 3 

TF(FX3) 295,5215,295 
CEPS = EPMACH®ABS (FX5) 
TFCFX5) 295,220,295 
CEPS = EPMACH 

QCEPS = G.25eCEPS 
INITIALESING COMPLETE 


STAGE THREE 


eens 
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300 


305 


aaganaaqaaaa 


400 


ag 


410 
415 


aanaa 


500 


aaanaaa 


700 


705 


N10 


a ane anaanaa a ananana 


715 
729 
725 


730 
135 


740 


150 
765 
770 


775 
780 


aooan 


CENTRAL CALCULATION. 
REQUIRES X1eX3_X5,FXL sFX3eFXSsEST ,ADIF Fe 


CONTINUE 

X2 = Oo5Se(X1L + X3) 

X4 = 0050(X3 + X5) 

FX22 FUNC(X2) 

FX4= FUN(X4) 

NO = NO + 2 

ESTL = FXL + 4.G8FX2 ¢ FX3 

EST2 = FX3 + 400®FX4 # FXS 

AOIFFR = ADIFF 

OIFF = EST + EST = ESTL — EST2 

IF (LEV - 30) 305,800,800 

AODIFF= ABS (OIFF) 

CRIT = ACIFF - CEPS 

TF(CRIT) 7002700.400 

ENO OF CENTRAL LOOP 

NEXT STAGE IS STAGE FOUR IN CASE OF NO NATURAL CONVERGENCE 
NEXT STAGE IS STAGE SEVEN IN CASE OF NATURAL CONVERGENCE 


eous STAGE FOUR aees 
NO NATURAL CONVERGENCE. A COMPLEX SEQUENCE OF INSTRUCTIONS 
FOLLOWS WHICH ASSIGNS CONVERGENCE AND / OR ALTERS TOLERANCE 
LEVEL IN UPWARD OIRECTION IF THERE ARE INDICATIONS OF ROUND OFF 
ERROR. 


CONTINUE 
TFCADIFFL - ADIFF) 4105 41G-4 500 
IN A NORMAL RUN WITH NO ROUND OFF ERRGR PROBLEM, AOIFFL 1S GREATER THAN 
ACIFF ANO THE REST OF STAGE FOUR IS OMITTED. 
TFC LEV ~ 5) 500+415,415 
EFACT = EFACT + CEPS #{X1 = XZEROJ@FACERR 
XZERO = XL 
FACERR = 15.0 
THE REST OF STAGE FOUR DEALS WITH UPWARD ADJUSTMENT OF TOLERANCE (CEPS) 
BECAUSE OF SUSPECTEO ROUND OFF ERROR TROUBLE. 
TFCADIFF-2.08CEPS) 42094204425 
SMALL JUMP IN CEPS. ASSIGN CONVERGENCE 
CEPS = ADIFF 
LEVTAG = 9 
GO TO 78C 
I1F( ADIFFL - ADIFF) 4355430,435 
LARGE JUMP IN CEPS 
CEPS = ADIFF 
GO TO 445 
FACTOR TwO JUMP IN CEPS 
CEPS = 2,.0eCEPS 
TFOCLEVTAG = 3) 44054459445 
LEVTAG = 2 
QCEPS = G.25eCEPS 


eave STAGE FIVE anne 
NO ACTUAL CONVERGENCE. 
STORE RIGHT HAND ELEMENTS 


CONTINUE 

NIM = 20NIM 

LEV = Lev +1 
ESTST(LEV) = EST2 
X3ST(LEV)= Ke 
X5STCLEV)= Xx§ 
FRIST(LEV) =FX4 
FXSST(LEV)=FXS 
PREDIFCLEV) = ADIFF 


aese STAGE SIX anee 
SET UP CUANTITIES FOR CENTRAL CALCULATION. 


READY TC GO AHEAD AT LEVEL LCWER WITH LEFT HAND ELEMENTS 
Xl AND FX1 ARE THE SAME AS BEFORE 


x5 = X3 
X32 = X2 
FAS = &X3 
FX3 = FX2 
EST = EST1 
GO TO 30¢ 
eese STAGE SEVEN sane 


NATURAL CONVERGENCE IN PREVIOUS INTERVAL. THE FOLLOWING COMPLEX SEQUENCE 
CHECKS PRIMARILY THAT TOLERANCE LEVEL IS NOT TOO HIGHe UNDER CERTAIN 
CIRCUMSTANCES NON CONVERGENCE IS ASSIGNED AND / OR TOLERANCE LEVEL 

IS RE-SET. 


CONTINUE 
CHECK THAT IT WAS NOT LEVEL ZERO INTERVALeIF SO ASSIGN NON CONVERGENCE 
IFC LEV } 400.400, 705 


LEVTAG =-1] CEPS = CEPSF, ITS ORIGINAL VALUE. 

LEVTAG = oO CEPS IS GREATER THANCEPSF. REGULAR SITUATION. 

LEVTAG = 2 CEPS IS GREATER THANCEPSF.CEPS PREVIOUSLY ASKED FOR A BIG 
JUMP, BUT DID NOT GET ONE. 

LEVTAG = 3 CEPS IS GREATER THANCEPSF.CEPS PREVIOUSLY HAD A BIG JUMP. 


IF(LEVTAG) 800+710,719 
IN A NCRMAL RUN WITH NO ROUNO OFF ERROR PROBLEM, LEVTAG = -1 ANDO THE 
REST OF STAGE SEVEN IS OMITTED. 

CEPST = 15.C#CEPS 
CEPST HERE IS FACERR*CURRENT VALUE OF CEPS 
TF(CRIT) 715.900, 800 

IF(LEVTAG = 2) 720,740,750 
LEVTAG = 0 

IFCADIFF) 869,800,725 
IFCADIFF = QCEPS) 7309800866 

LFCAOIFF - CEPSF) 77097704735 

LEVTAG = 0 

CEPS = ADIFF 

EFACT = E€FACT + CEPST#(XL = XZERO) 

XZERO = Xt 

GO TO 445 
LEVTAG = 2 

LEVTAG = 0 

IE CADIFF) 76557659725 
LEVTAG = 3 

LEVTAG = 0 

LF(AOIFF) 77557759730 

CEPS = ACIFFL 

GO TO 775 

LEVTAG = -k 

FACERR = 1.0 

CEPS = CEPSF 

EFACT = EFACT + CEPST#(X1 — XZERO) 

XZERO = X1 

CONTINUE 

QCEPS = C.25eCEPS 

ones STAGE EIGHT anee 
ACTUAL CONVERGENCE IN PREVIOUS INTERVAL. INCREMENTS ADDED INTO 
RUNNING SUMS 


AOC INTG SUM ANDO SIM 
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800 CONTINUE 

SUM = SUM #4(ESTI+EST2)0(X5-X1) 

TF(LEVTAG) 805,810,810 

WE AOD INTO SIM ONLY IF WE ARE CLEAR OF ROUND OFF LEVELe 
805 SIM = SIM + OLFFS(X5=x1) 
B10 CONTINUE 


oO 


sees STAGE NINE enee 
SORT OLT WHICH LEVEL TO GO TOs THIS INVOLVES NIM NUMBERING SYSTEM 
DESCRIBED BEFORE STAGE ONE. 


aAaAaaAg 


905 NUM = NIM/2 
NOM-= NIK = 2@NUM 
TF(NOM) 910,915,910 
910 NIM = NUM 
Lev = Level 
GO TO 9¢5 
915 NIM = NIP ¢ 1 


c NEW LEVEL IS SET. IF LEV=0 WE HAVE FINISHED | 
IF( LEV ) 1100,1100,1000 

Cc 

Cc eoen STAGE TEN eece 

c SET UP CUANTITIES FOR CENTRAL CALCULATIONS 

c 

1000 CONTINUE 

XL = X5 
Fxis FXS 


X3 = X3ST(LEV) 

x5 = XS5ST(LEV) 

FX3=2 FX3ST(LEV) 
FX5= FXSST(LEV) 
EST= ESTST{LEV) 
ADIFF = PRECIF(LEV) 
GO TO 360 


weoue STAGE ELEVEN seee 
CALCULATION NOW COMPLETE. FINALISEe 


DAA 


1100 CONTINUE 
EFACT = EFACT + CEPS #(BIG= XZERO)*FACERR 
RUM = EFACT/160.0 
THIRDS SUM/12.0 
FIFTH ==SIM/1860.0 
SQUANK = THIRO + FIFTH 
RETURN 
c ENO OF SQUANK 
ENC 
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Squank (Simpson Quadrature Used Adaptively— 
Noise Killed) [J.N. Lyness, Comm. ACM 13 (Apr. 
1970), 260-263] 


P. Hallet and E. Mund [Recd. 18 Jan. 1971 and 27 


Apr. 1971] 
Service de Métrologie Nucléaire, Université Libre de 


Bruxelles, Brussels, Belgium 
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Key Words and Phrases: numerical integration, integration rule, 
adaptive integration, automatic integration, Simpson’s rule, 
numerical quadrature, quadrature rule, adaptive quadrature, 
automatic quadrature, round-off error control 

CR Categories: 5.16 


The algorithm was compiled and run without corrections on a 
CDC-6400 with a machine accuracy parameter of 0.7 «x 1074. 
Our purpose was to test SQUANK’s ability to integrate a function 
blurred by random noise, and so the function FUN(X) is the result 
of applying a random perturbation R to some regular function 
f(x), either by adding R to x before computing /, hereafter referred 
to as “‘x-noise’, or by adding R to f after having computed it, 
“‘y-noise’’. R is taken as 


R = C* (2. * RANF(X) — 1.) 


where C is the noise amplitude and RANF is a system function 
generating pseudorandom numbers (0. < RANF(X) < 1.) 

Our test program called SQUANK 100 times in situations 
involving all combinations of noise amplitude C = 10-?, 10-4, 10-6, 
10-8, 10-, required tolerance «; = 10-4, 10-®, 10-8, 10719, 1072, 
both noise types and the two functions fi(x) = ki exp(x) and 
So(x) = ke(1 + 104 x?) integrated on [0, 1]. The constants k, 
and k. were chosen to normalize unblurred integrals to unity so 
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that errors. and tolerances may be seen as absolute or relative. 

A rough calculation shows that y-noise causes in both integrals 
a deviation D that shouldn’t exceed C. For x-noise, with f(x + R) ~ 
f(x) + Rf'(x), D shouldn’t exceed respectively C and k2C (meaning 
that the second function is oversensitive to x-noise by a factor 
ky ce 200/z). 

The test program was run five times, yielding different results 
because the random perturbations were irreproducible. The fol- 
lowing quantities were kept and averaged over the five runs. 
| eg | actual error (specifically, e2 is the difference SQUANK(---) 

— 1.0). 

«; error estimate (specifically, «,; is the value of parameter RUM 
as returned by SQUANK). 
N number of function evaluations. 


A sample of these averaged results is given in Table I. 


Table I 

i Si(x), x-noise fi(x), y-noise 
© °. | €2 ee. N | e | 63 N 
10-8 1072 6.3 10° 5.5 10-9 11278 | 8.4 10719 5.9 10-9 8416 
10-8 1074 7.9 1079 1.0 1074 9 | 9.2 1079 1.0 10-4 9 
10-2 1072 | 7.6 10-4 4.6 1073 6986 | 8.9 10-4 5.3 10-3 8747 
107 10-4 | 1.6 107 1.5 1073 106 | 1.3 10-3 1.6 1073 59 
C é S2(x), x-naise S2(x), y-noise 

| €2 | 6 N | 2 | 6 N 
10-8 107-2 2.9 1078 4.41077 35956) 1.8 1079 §.5 10-9 63254 
10-8 107-4 5.8 1075 1.0 10-4 77 | 5.8 1076 1.0 10-4 77 
10-2 10712 6.3 10-2 3.5 1071 9127 | 1.3 1073 5.2 1073 8496 
10-2 {074 6.9 1072 4.2 1071 3896 | 7.2 10-4 2.5 10-3 205 


In 487 of the 500 calls it was found that SQUANK’s accuracy 
estimate of its own result was reliable, i.e. that e, > | e2|. For the 
remaining 13 calls, the ratio | e2 |/e; ranged from 1 to 20 (in the 
worst cases, 6; ~ «1 < C, just as if SQUANK had failed to notice 
the presence of the noise). 

In 473 of the 500 calls it was found that SQUANK’s estimation 
was as good as could be reasonably expected, i.e., that 
6 < max (D, «.) = e. For the remaining 27 calls (all of them for 
fr) the ratio e/e, never exceeded 1.15 (note that the test was made on 
e,, not on the actual error | «2 |). 
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ALGORITHM 380 
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KEY WORDS AND PHRASES: 
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rectangular matrix, transpose 


DESCRIPTION: 

The matrix (n X m) is assumed to be stored, column by 
column, in the one-dimensional array A, of length m X n. 
Then the position Jl of the element a; is A(Ji) where 
Jl =n X (j—1) +17. This element must be moved to the position 
J2 of the element a;; which is given by J2 = m X (:—1) + j and 
these two locations are related by the expression 


(J2—1) = m X (J1-1) — (mXn—1) X [J1—1)/n] 


where [e] indicates integer part of e. 

It is more convenient to work in terms of index I[1, taking values 
from 0 to K = m X n — 1, which gives the expression I2 = m X 
Ii — K xX [I1/n] and the value in A(I1+1) must be moved to 
A(I2+1). By repeating this formula we find that the transposi- 
tion consists of a series of ‘‘loops’’, Il > I2 > I8 > --- >I. 

We also note that this process is symmetric. For example, if I 
is the smallest value in a loop then K — I is the largest value of a 
loop, although both these values may in fact belong to the same 
loop. 

This is a special case of a more general result, which may be 
stated as follows: 

Theorem. If 1-7 12> 13---—>I1 ts a loop, then (K—I1) > 
(K—12) —+ (K—I3) --+ — (K—I1) ts also a loop. 

Comments on Theorem. This may be two representations of 
the same loop or it may describe a “symmetric” pair of loops. 
For a 2 X 8 matrix the process generates the following loops, 
where K = 2X 8—1 = 15. 


(a) I 1- 25> 4> 8-> 1 
K —I 4-5 7-11-> 13- 14 

(b) I 56-10—> 5 
K —I 10—> 5— 10 

(c) I 3—- 6-12-45 9— 383 
K—-I 12- 9-> 3- 6—> 12 


Case (a) is an example of a ‘“‘symmetric”’ pair of loops. Cases (b) 
and (c) are both examples of a duplicated loop. An idealized 
picture of the circuits for this example is given in Figure 1, where 
the closed curves only indicate the range of the circuits, since the 
actual directed paths will be intertwined in a more complex 
manner. 

Proof of Theorem. It is sufficient to show that 


(K—I2) = m X (K~—I1) — K X [K—I1)/n] --- (1) 


where I2 is generated from I1 by the expression given in the first 
paragraph. 
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Now 
(K-12) = K-—mxX 11+ K &X [I1/n]. 
Let 
[T1/n] = Li, [(K—I11)/n] = Lr. 
Hence it is required to prove that 


K-—-mxXti+KxX mX (K-ll)-KX kl, 


or 
K X Le = K X (m—-1—Ih), 
or ‘ 
é K x La =K x Ls ’ 
where 


L3=m—1—-—Ih. 


Note LZ; , Le and ZL; are integer. 
Let 
| (i/n) -In=a, ((K-—Il)/n) -L2 =e. 


From the definition of { ] in paragraph one, it is obvious that 


«: and e2 satisfy the inequalities 


O0O<a<1l-—I/n, 


Ose<S1i—I1/n. 


Now 


- KX (K-I1)/n) = K X& ((mXn—-1-—II)/n) 
K X (m—1/n—II1/n) 
K X (m—1-—I1/n+ (1—1/n)) 


K X (s—-at(1—I1/n)). 


Hou ou 


Also 

K & (K-—II1)/n) = K X (Lo+e2), 

K X (Ls—at+(1—-1/n)) = K X (Le+e2), 
or 

Ls —™ €1 + (1—1/n) = Ly + €2, 
or 

Lis com Le = €& + €— (1—1/n). 
Therefore 


0+0-— (-I/n) ¢< Ls -—I2<52X (l—-1/n) — (-I/n), 


or 


[|Ze-Le2| <1 - x 
n 


Since Lz and Ls aré integer and differ by less than unity, L2 must 
equal Z;, hence K X L2 = K X Zs , which implies that (1) is true. 

Method. Each matrix will contain two or more “single ele- 
ments,”’ that is, loops consisting of only one point. The condition 


Fig. 1 
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for this is 
Il=mxXI-—-K XX [I/n]. 


Writing I = a X n+ 0 and inserting K = m X n — 1, this condi- 
tion becomes 


a(n—1) = b(m—1). 


We shall always have the two pairs of integers (0, 0) and (m—1, 
n—1) giving I = 0 andI = K as single elements. 

The method used in the subroutine is as follows. First the 
number of ‘single elements’? in the array is calculated and 
NCOUNT is set equal to this value; then, starting with variables 
I = 1 and MAX = K + 1, we search through the array, moving 
the elements in each loop once until all the elements have been 
moved. The variable NCOUNT is used to record how many ele- 
merts have been moved and the process is terminated when 
NCOUNT = man. For each value of I, the loop generated by I is 
examined, and if it contains any values less than 1 or greater 
than MAX, then we know that this loop has already been moved 
and so go. on to examine the next value of I. If, however, I is the 
smallest value, the elements in this loop are moved round and at 
the same time a test is made to see if K — I also belongs to this 
loop. Each time a loop is completed NCOUNT is tested against 
mn. If K — I has been included in the loop for which I was smallest 
value, MAX is set equal to K — I and we return to statement 
number 20 to examine the next value of I. If K — I does not belong 
to the same loop as I, then the elements in the loop generated by 
K — I are also moved before returning to label 20. 

The process is further speeded up by use of an array MOVE, 
dimension IWRK. Initially all elements of MOVE are set equal 
to 0 and whenever element I is moved, MOVE(I) is set equal to 2. 
Hence, so long as I < IWRK, it is possible to detect whether or 
not the loop generated by I has been moved without calculating 
values around the loop. The value of IWRK to give the shortest 
possible time depends only on m and n, but so far it has not been 
possible to give a theoretical expression for this value. In 93 
percent of the cases examined, the value of IWRK = | 4(m+n)| 
was large enough for the transposition to be completed before I 
exceeded IWRK. Since this condition gives the minimum execu- 


TABLE I 
Size MXN Alg. 302 WERE XC@IC™) Tae Nie bis Susie 
(M-++N)/2 (T1+T2) (T1+-T3) 
MXN T1 (sec.) T2 (sec.) T3(sec.) 
7 X 60 0.56 0.29 0.640 0.37 0.426 
7 xX 70 0.96 0.32 1.012 0.37 0.897 
7 xX 80 0.68** 0.57 0.180 0.63 0.078 
7 xX 90 1.24 0.43 0.978 0.71 0.547 
7 X 100 1.49 0.46 1.057 0.56 0.911 
8 X 60 0.97 0.31 1.045 0.25 1.192 
8 X 70 0.83 0.61 0.300 0.75 0.105 
8 X 80 1.08** 0.78 0.319 1.02 0.061 
8 X 90 1.50 0.46 1.065 0.37 1.216 
8 X 100 1.60 0.71 0.766 1.05 0.419 
9 X 60 0.91 0.47 0.649 0.63 0.365 
9 X 70 1.11 0.57 0.641 0.82 0.298 
9 X 80 1.53 0.46 1.082 0.37 1.223 
9 X 90 1,92 0.51 1.156 0.62 1,028 
9 X 100 1.88 0.63 0.997 1.10 0.528 
45 X 50 4,89 2.09 0.804 2.89 0.515 
45 < 60 6.10 1.59 1.1738 1.38 1.261 
46 X 50 4.57 2.69 0.519 3.76 0.195 
46 X 60 5.99 2.88 0.701 4.16 0.361 
47 X 50 4.92 3.22 0.416 3.96 0.216 
47 X 60 6.59 1.66 1.195 1.45 1.279 
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TABLE II 


Percentage of results lying 
within this range 


Range of values of 
(T1 — T2)/4(T1 + T2) 


—0.5 to 0.0 6.4% 
0.0 to 0.5 29.5% 
0.5 to 1.2 64.1% 


tion time, we suggest the value | 4(m-+n)| for the length of the 
array MOVE. 

This routine has been compared with a Fortran version of 
Algorithm 302 [2]. In the cases where the transpose is effected in 
a few loops each containing a large number of elements, our 
routine is very efficient, in many cases halving the time needed by 
Algorithm 302. It is less efficient for cases with a large number of 
loops, but in the cases where Algorithm 302 is faster, the dif- 
ference in time is small. 

Our method for the rectangular arrays is similar to that at- 
tributed to J. G. Gower in the paper by P. F. Windley [1] but using 
our concept of ‘‘symmetry”’ greatly improves the efficiency of the 
process. The case of a square array is detected and treated sepa- 
rately, exchanging pairs a,; and a;; instead of testing for loops. 

Resulis. The execution times Tl, for the ForTRAN version of 
Algorithm 302, and T2, for our routine TRANS with IWRK = 
| #(m-+n)|, are given in Table I for a selection of matrices. The 
column T3 gives execution times for a version of TRANS from 
which all references to the array MOVE have been deleted. On the 
basis of more than 150 tests of this type, in which the relative 
difference between T1 and T2 was determined, only 6.4 percent 
gave a result favorable to Algorithm 302. A summary of the re- 
sults of these tests is given in Table IT. 

The stars by the values of T1 indicate the condition T4 < 
Tl < T5 where T4 and T5 are execution times for TRANS with 
IWRK = | $(mXn)| and IWRK = I respectively. In these cases, 
the length of IWRK determines whether Algorithm 302 or TRANS 
is the quicker. 

All the execution times refer to the ICL KDF9 computer. 


Acknowledgments. The authors wish to thank Dr. 8S. H. 
Hollingdale, director of the Computer Centre, for his support and 
encouragement. 
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ALGORITHM: 


A_1S°A ONE-DIMENSIONAL ARRAY OF ENGIN HRGREN, WHICH 
CONTAINS THE MXN MATRIX TO BE TRANSPOSED (STORED 
COLUMWISE) «MOVE IS A ONE-DIMENSIONAL ARRAY OF LENGTH IWRK 
USED TO STORE IMFORMATION TO SPEED UP THE PROCESSe THE 
VALUE IWRK#®(M4+N)/2 IS RECOMMENDEDe IOK INOICATES THE 
SUCCESS OR FAILURE OF THE ROUTINE. 
NORMAL RETURN IOK #0 
ERRORS 1OK= -1 »MN NOT EQUAL TO M#N. 

IOK= -2 sIWRK NEGATIVE OR ZERC. 

TOKeGTeO» (SHOULD NEVER OCCUR) eIN THIS CASE 
WE SET IOK EQUAL TO THE FINAL VALUE OF I WHEN THE SEARCH 
IS COMPLETED BUT SOME LOOPS HAVE NOT BEEN MOVED. 

DIMENSION A(MN) sMOVECIWRK) 


NAN ANANANNKRNNARABRADN 


CHECK ARGUMENTS AND INITIALISE 


IF (MeL Te2eOReNelLTo2)GO0 TO 60 
IF (MNeNEoM#N) GO TO 92 
IF CIWRKeLT21IGO TO 93 
IF(MeEQeN) GO TO 70 
NCOUNT=2 
M2=M-2 
DO 10 Tele TWRK 

10 MOVE(I)#*0 
IF (M2eLTe1)GO TO 12 


¢c 
C COUNT NUMBER»sNCOUNT S0F SINGLE POINTS. 
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DO 11 I[A=19M2 
IB@IA*(N-1)/ (M1) 
IF CLA®(N-1) NEC IB¥(M=1}9GO TO 11 
NCOUNT2NCOUNT 41 
T=LA#NntI1B 
IF (I .GTeIWRKIGO TO 11 
MOVE(1)=1 
1l CONTINUE 


c 
C SET INTITAL VALUES FOR SEARCHe 
12 K=MN~-1 

KMI=K~-1 
MAX2MN 
1#1 

AT LEAST ONE LOOP MUST BE RE-ARRANGEDe 
GO TO 30 


SEARCH FOR LOOPS TO REARRANGE. 


nanan ANN 


20 MAKeK-I 
Te[+1 
KMIsK-I 
IF d(T eGTeMAX) GO TO 90 
IFCLeGTeIWRKIGO TO 21 
IF(MOVE(T) eLTel)GO TO 30 
GO TO 20 

él TFC TeEGeM*I~K¥(T/N)) GO TO 20 
11s] 

22 T2=M¥IL-K*(1T1/N) 
ieigreatt eORe 12eGEeMAX) GO TO 23 


GO TO 22 
23 IF(T2eNE21}GO TO 20 


Cc 
C REARRANGE ELEMENTS OF A LOOP. 
C 


30 1151 
31 BsA(11+1) 
32 T2=M¥1I—-K*¥(T1/N) 
IF (IT 1LeLEeIWRK)MOVE(11)=2 
33 NCOUNT=NCOUNT41 
IF CIT 2cEGsTsOReT2eGEeKMI) GO TC 35 
34 ACI1+1)5A(1241) 
Tl=12 
GO TO 32 
35 TF OMAXeEQeKMI.OReT205Qe1) GO TO 4 
MAX=KMI 
GO TO 34 


Cc 
C TEST FOR SYMMETRIC PAIR OF LOOPS. 
C : 


41 ACTi+1)=8 
IF (NCOUNT eGEeMN} GO TO 60 
TF C12 eEQ eMAXeOReMAXsEQeKMI} GO TO 20 
MAX=KMI 
11=MAX 
GO TO 31 


NORMAL RETURNe 


ana 


60 1OK=0 
RETURN 


IF MATRIX IS SQUAREsEXCHANGE ELEMENTS A(IeJ) AND 


nan 


7U NI=N-1 
DO 71 IT=lsNl 
JL=I+1 
DO 71 JEJleN 
T1l=I+(J-1)*N 
T2sd+( 1-1) #M 
beA(11) 
ACI1)#A(12) 
A(T2)=8 
71 CONTINUE 
GO TO 60 


c 
C ERROR RETURNS» 
Cc 


90 I1O0K=1 

91 RETURN 

92 10K=-1 
GO TO 91 

93 IOK=~2 
GO TO 91 


END 


1 


Aldslde 
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Laflin and Brebner compared the execution times of their 
transposition algorithm (Algorithm 380) and Algorithm 302 [1] 
when run on an ICL KDF 9 computer. This comparison showed 
Algorithm 380 to be faster than Algorithm 302, in most cases. In 
order to generalize this comparison, the same matrix transposi- 
tions were run on the CU CDC 6400 computer. Table I shows the 


TABLE I. On tHe ICL KDF-—-9 


Range of values of Percentage of results lying 


(T1 — T2)/§(T1 + T2)* within this range 
—0.5 to 0.0 6.4% 
0.0 to 0.5 29.5% 
0.5 to 1.2 64.1% 


TABLE II. On tHe CDC 6400 


Range of values of Percentage of results lying 
(Ti — T2)/$(T1 + T2)* within this range 
—0.7 to 0.0 15.4% 
0.0 to 0.5 28.9% 
0.5 to 1.3 55.7% 


*T1 refers to execution time for Algorithm 380. T2 refers to execution for Algorithm 
302. 


results from the KDF 9 computer and Table II, the results from 
the 6400. It should be noted that Algorithm 380 did not enjoy as 
great an advantage on the 6400 as on the KDF 9. 
REFERENCES: 
1. Booturoyp, J. Algorithm 302, Transpose vector stored array. 
Comm. ACM 10 (May 1967), 292-293. 
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Certification of Algorithm 380 [F1] 

In-Situ Transposition of a Rectangular Matrix [Susan 
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324-326] 


I.D.G. Macleod [Recd. 25 Aug. 1970] 

Department of Engineering Physics, Research School 
of Physical Sciences, The Australian National 
University, Canberra, Australia, 2600 


Key Words and Phrases: rectangular matrix, transpose 
CR Categories: 5.14 


Algorithm 380 (i.e. subroutine TRANS) has been extensively 
tested using FORTRAN IV (level G) on the A.N.U’s IBM System 
360 model 50; the test matrices were correctly transposed in every 
case. It should be pointed out that the FORTRAN convention of 
column-major storage of the input matrix is assumed in TRANS. 
Implementations which assume row-major matrix storage will 
have to be appropriately modified. 

Some unnecessary computation can be avoided by changing: 


21 IF(LEQ.M*I-K*(U/N)) GO TO 20 
li=I 

22 2=M*ll —K*(1/N) 
IF(12.LE.1.OR.12.GE.MAX) GO TO 23 
In=1[2 
GO TO 22 

23 IF(2.NE.l) GO TO 20 

to: 

21 12 =M*l —K*(I/N) 
IF(12.LE.1.OR.I2.GE.MAX) GO TO 20 

22 12 =M*I2—K*(1I2/N) 


IF (12.GT.LLAND.2.LT.MAX) GO TO 22 
IF(I2.NE.1) GO TO 20 


As an extension of the timing tests reported by Laflin and 
Brebner, and Lachenmaier [1], four versions of TRANS were 
timed against TRANSPOSE [2] and PERMUTE [3], using FOR- 
TRAN IV G for all routines. As in the case of TRANS, the method 
employed in PERMUTE is similar to that attributed by Windley 
[4] to J.G. Gower, but PERMUTE is intended for general permu- 
tations and hence does not take advantage of the symmetry pres- 
ent in in-situ transpositions. Execution times on the A.N.U’s 
IBM System 360 model 50 for the test set of 21 matrices given by 
Laflin and Brebner are summarized in Table I; further tests on 
this machine have confirmed the relative efficiencies indicated. 


Table I 


‘Roiiting Execution 
time (sec) 

Original version of TRANS 

(i) IWRK = (M+ N)/2 9 

(ii) No MOVE 12. 

TRANS modified as 

recommended above 


(i) IWRK = (M+ N)/2 8.2 
(ii) No MOVE 10.8 
TRANS POSE 18.9 
PERMUTE 13.7 
References 
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Comput. J. 2 (Apr. 1959), 47-48. 
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ALGORITHM 381 

RANDOM VECTORS UNIFORM IN 

SOLID ANGLE [G5] 

Ropert E. Knop* (Reed 20 Nov. 1969 and 20 Jan. 1970) 

Department of Physics, Rutgers University, New Bruns- 
wick, NJ 08903 


* This work was supported in part by the National Science 
Foundation. 


KEY WORDS AND PHRASES: random number, random vector, 
random number generator, probability distribution, frequency 
distribution, simulation, Monte Carlo 

CR CATEGORIES: 5.5 


procedure unisph (X, Y, Z); 
real X, Y, Z; 

comment This procedure generates the components of random 
unit vectors distributed uniformly in solid angle. Let Z be the 
polar axis, @ the polar angle, and ¢ the azimuthal angle. The 
arguments returned may then be written as: 

X = sin(@) X cos(¢) Y, = sin(@) X sin(¢), Z = cos(@) 

In this algorithm, R11 represents a procedure which returns 
random numbers which are distributed uniformly over the in- 
terval (—1, 1) [1]. The algorithm operates by the method of re- 
jection [2]. The variables X and Y are first sampled from the 
uniform distribution over the interval (—1, 1). After rejecting 
points outside of the unit disk, we may transform variables 
from X, Y to, S by use of the formulas X = sgrt(S) X cos(¢) 
and Y = sgrt(S) X sin(). It can be demonstrated that S is a 
random variable uniformly distributed over the interval (0, 1). 
The distribution of the cosine of the polar angle must be uniform 
over the interval (—1, 1). Thus Z is determined from S by the 
formula Z = 2 X S — 1 [8]. Finally, the X and Y components of 
the vector are normalized using the constraint that the vector 
be of unit length [3, 4]. 

A modification of this algorithm could be used to generate 
vectors which were azimuthally uniform but have a specified 
nonuniform distribution in the cosine of the polar angle. This 
would be achieved by replacing the statement Z := 2X S — 1 
with Z := F(S), where F is a procedure to calculate the inverse 
distribution function of Z. 

The author wishes to express his gratitude to B. Kehoe for 
comments concerning this algorithm, and to R. Nelson for doing 
much of the programming involved in testing it. 

REFERENCES: 

1. Van Getper, A. Some new results in pseudo-random num- 
ber generation. J. ACM 14 (Oct. 1967), 785-792. 

2. Von Neumann, J. Various techniques used in connection 
with random digits. Nat. Bur. of Standards Appl. Math. 
Ser. 12, 1959, p. 36. 

3. Knutu, Donato E. The Art of Computer Programming, 
Volume 2, Seminumerical Algorithms. Addison-Wesley, 
Reading, Mass., 1968, p. 34. 

4. Knop, R. Remark on algorithm 334. Comm. ACM 12 (May 
1969), 281.; 

begin 

real X, Y, Z, S; 

comment Rejection method yields two independent random 

variables, the azimuthal angle ¢, and the square of the radius 

S.; 
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A: 
X := Ril; Y := Ril; 
S:=X fT2+Y fT 2; 
if S > 1 then goto A; 
comment Z must be uniform over the interval (—1, 1). It can 
be demonstrated that S is uniform over the interval (0, 1).; 
Z2:=2XS—1; 
comment Given Z, X and Y are normalized by the constraint 
that the vector be of unit length.; 
S := sqrt. ((l—Z f 2)/S8); 
X:=XXS; Y:=YXS; 
end unisph 


Remark on Algorithm 381 [G5] 
Random Vectors Uniform in Solid Angle [Robert E. 
Knop, Comm. ACM 13 (May 1970), 326] 


Giinther F. Schrack [Recd. 1 Aug. 1970, 7 June 1971, 
and 4 Oct. 1971] 

The University of British Columbia, Departments of 
Electrical Engineering and Computer Science, 
Vancouver 8, B.C., Canada 


Key Words and Phrases: random vector generator, points 
uniform on sphere, spherically symmetric probability distribution 
CR Categories: 5.5 


Syntax corrections: The type declaration of the procedure body 
should be 


real S; 

and not 

real X, Y, Z, S; 

The sequential operator if in the conditional statement should be 
boldface and not in italic. The semicolon following the last assign- 
ment statement should be deleted. Also, in reference [3], p. 34 


should be replaced by paragraph 3.4. 
The following three cases are considered in this remark. 


Case 1: the original algorithm, Algorithm 381. 
Case 2: the modification of case 1 obtained by replacing the third 
last arithmetic assignment statement by 
:= 2X sqrt(1—S); 

Case 3: an alternative modification of case 1 obtained by replacing 
the assignment statement for Z by 
Z:= Ril; 
possible because Z is uniformly distributed in [—1,1). 

The three cases were translated into Fortran IV and tested on a 
/360-67 running under the Michigan Terminal System. The gen- 


erated vectors were all normalized. Two statistical tests were con- 
ducted in order to investigate some characteristics of these versions. 
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For these tests, R11 was replaced by 2«FRAND — 1, where FRAND 
is the fast random number generator in [4] with the multiplier re- 
placed with 78125005. Each of the following two tests were repeated 
six times, initializing the random number generator once only with 
0.461000. The sample size used for all tests was 1000. 

(i) Chi-square test for goodness of fit for each variable. The number 
of categories used was 20. For case | the null hypothesis Hy that 
each variable X, Y, and Z is uniformly distributed was rejected at 
the 1 percent significance level for variable X once out of the six 
tests; for variable Y, Ho was rejected once at the 5 percent signifi- 
cance level for too good a fit; and was not rejected for variable Z. 
For case 3, no rejection of Ay occurred. 

(ii) Linear correlation coefficient between pairs of the variables. As 
the correlation coefficient p of the population has the theoretical 
value zero, two-tailed tests of the null hyposesis Hy: = O were con- 
ducted. For case 1, all sample correlation coefficients were suffi- 
ciently small as not to reject Hy at the 5 percent level of significance. 
For case 3,. Hy was rejected at the 1 percent significance level but 
not rejected at the 5 percent level for one out of the 18 sample 
correlation coefficients. 

Case 2 saves One division compared to case 1 but otherwise does 
not change the behavior of the algorithm as tested above. Case 3 
was slightly slower (less than 7 percent) than case 1 in execution 
time. 

Finally, a comparison in execution time of case 1 with three 
other methods published previously [1], 2, 3] was carried out. Al- 
gorithm 381 showed a considerable advantage in speed, the three 
algorithms in [1, 2, and 3] were between 30 and 100 percent slower. 


References 

1. Cook, J.M. Rational formulae for the production of a 
spherically symmetric probability distribution. Math. Tables Other 
Aids Comp. [1 (1957), 81-82. 

2. Hicks, J.S., and Wheeling, R.F. An efficient method for 
generating uniformly distributed points on the surface of an 
n-dimensional sphere. Comm. ACM 2 (Apr. 1959), 17-19. 

3. Muller, M.E. A note on a method for generating points 
uniformly on n-dimensional spheres. Comm. ACM 2 (Apr. 1959), 
19-20. 

4. Seraphin, D.S. A fast random number generator for IBM 360. 
Comm. ACM 12 (Dec. 1969), 695. 
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ALGORITHM 382 

COMBINATIONS OF M OUT OF N OBJECTS [G6] 
Puituie J. CHAsE (Recd. 18 Mar. 1969 and 31 Oct. 1969) 
Department of Defense, Fort Meade, MD 20755 


KEY WORDS AND PHRASES: permutations and combina- 
tions, permutations 
CR CATEGORIES: 5.39 


procedure TWIDDLE (z, y, 2, done, p); 
Boolean done; integer array 7; 

comment TWIDDLE can be used (1) in generating all combi- 
nations of m out of n objects, or (2) in generating all n-length 
sequences containing m 1’s and (n—m) 0’s. 

In the case (1), suppose the n objects are given by an array 
a[l:n], and let us successively store combinations in another 
array, say, c[1:m]. For the first combination, c[1] through c[m] 
are equated, respectively, to al[n—m-+1] through a(n]. 
TWIDDLE (a, y, 2, done, p) is called. If done = true, then all 
combinations have been processed and we therefore stop. If 
not, a new combination is made available by setting c[z] equal 
to a[z]. TWIDDLE is called, and we continue on this loop until 
done = true. 

In the case (2), let the sequences of m 1’s and (n — m) 0’s be 
stored successively in an integer array, say, b[1:n]. The first 
sequence is obtained by setting b[1] through b[n—m] equal to 0, 
and b[n—m-+1] through b[n] equal to 1. TWIDDLE (2, y, z, 
done, p) is called. If done = true, then all required sequences 
have been processed, and we therefore stop. If not, a new se- 
quence is made available by setting b[z] equal to 1, and b[y] 
equal to 0. TWIDDLE is again called, and we continue on this 
loop until done = true. 

m and n are used only in the initialization of the auxiliary 
integer array p[0:n+1], which is done in the main program as 
follows. (It is assumed that 0 < m < n and 1 < n.) p[0] is set 
equal to n + 1, and p[n-+1] is set equal to --2. p[1] through 
p[n—m] are set equal to 0. p[n—m-+1] through p[n] are set 
equal, respectively, to 1 through m. If m = 0, then set p[I] 
equal to 1. done is set equal to false. 

The algorithm has several features which deserve mention. 
When used in generating combinations: (a) at each stage, only 
one combination number, namely c[z], is changed, (0) 
TWIDDLE is order preserving in the sense that at each stage 
c{1] through c[m] will equal, respectively, some a[%] through 
al[t.,] where t% through 1, are strictly increasing. When used in 
generating fixed-density 0-1 sequences: (c) at each stage, it is 
only necessary to change two numbers of the sequence, b[z] 
and b[y], and these are changed in a specific manner. 

The algorithm underlying this procedure was discovered by 
Leo W. Lathroum in 1965. Another algorithm which accom- 
plishes combinations by transpositions was discovered by 
Donald E. Knuth in 1964. The author has knowledge of the work 
of Lathroum and Knuth from private communications. He will 
include further detail in a mathematical paper, which will in- 
clude justification of this procedure, to be published elsewhere; 

begin integer i,j,k; j := 0; 


integer 2, ¥, 2; 


Ti: 
j:=j+1; if p[j] < 0 then go to Zi; 
if p[j—1] = 0 then 
begin 


for 1 := j — 1 step —1 until 2 do p{t] := —1; pj] = 0; 
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p(l} :=a2:=2:=1; y:= 7; goto L4 
end; 
if j > 1 then p{j—1] := 0; 
L2: 
j:=j+1; if p[j] > 0 then go to L2; 
t:=k:=jg-1; 
D3: 
t:=24+1; if pit] = 0 then 
begin p[t] := —1; go to £3 end; 
if p[z] = —1 then 
begin 
pit] := 2:= pik]; w:= 4; y :=k; 
plk] := —1; goto L4 
end; 
if 1 = p[0] then begin done ,:= true; go to 14 end; 
2:= plj]:= pit); pit] := 0; t:= 9; y= 14; 


IA: 
end of TWIDDLE 


REMARK ON ALGORITHM 382 [G6] 

COMBINATIONS OF M OUT OF N OBJECTS 
[Phillip J. Chase, Comm. ACM 13 (June 1970), 368] 

Puruuip J. CHase (Recd. 18 Mar. 1969 and 31 Oct. 
1969) 

Department of Defense, Fort Meade, MD 20755 


KEY WORDS AND PHRASES: permutations and combina- 
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CR CATEGORIES: 5.39 


The following driver program illustrates the use of Algorithm 
382. 
begin integer m, n, 1, 2, y, 2,9, 7; Boolean done; 
integer array a, b, c[1:30], [0:31]; 
procedure TWIDDLE (zg, y, z, done, p); 
comment Body of TWIDDLE is to be inserted here; 
comment TWIDDLE is here used to generate: (1) all combi- 
nations c[1:m] of a[{1:n]. Here we take a[z] equal to 7, each 7. 
(2) all sequences b[1:n] consisting of m 1’s and (n—m) 0’s. 
The user must supply m and n such that 0 < m < nand1 <n. 
(Our declarations here require n < 30.); 
ininteger (2,m); ininteger (2, n); 
for 7 := n step —1 until 1 do a[z] := 7; 
comment We initialize the parameters » and done of 
TWIDDLE as follows; 
r:=n—™; 


for 7 := r step —1 until 1 do pit] := 0; 
for 7 := m step —1 until 1 do p[r-++7] := 7; 
pl0] := n+1; pln+1] := —2; done := false; 


if m = 0 then p[1) := 1; 

comment We initialize c[1:m]; 

for 7 := m step —1 until 1 do cli] := a[r+7]; 

comment Next we initialize b[1:n]; 

for 7 := m step —1 until 1 do b[r+1] := 1; 

for 7 := r step —1 until 1 do [2] := 0; 

comment Now we generate and output our successive com- 
binations and sequences; 

q := 0; 
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L: 
q:=qt1; 
outinteger (1, 9); 
for 1 := m — 1 step —1 until 0 do outinteger (1, c[m—7]); 
for 7 := n — 1 step —1 until 0 do outinieger (1, b{n—1)); 
TWIDDLE (cz, y, z, done, p); 
if — done then 
begin 
c[z] := a[z]; blz] := 1; bly] :=0; goto L 
end 
end of driver program 
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ALGORITHM 383 

PERMUTATIONS OF A SET WITH 
REPETITIONS [G6] 

Paiuup J. Caase (Recd. 4 Aug. 1969 and 13 Feb. 1970) 

Department of Defense, Fort Meade, MD 20755 


KEY WORDS AND PHRASES: permutations and combina- 
tions, permutations 
CR CATEGORIES: 5.39 


procedure EXTENDED TWIDDLE (zg, y, k, u, done, p); 
value k, u; integer x, y,k, u; Boolean done; integer array 
DP; 
comment EXTENDED TWIDDLE is a generalization both of 
TWIDDLE [2], which is used in generating combinations by 
transpositions, and of the Trotter-Johnson adjacent-transpo- 
sition permutation algorithms [5, 3]. 

In the main program, to successively store all distinct permu- 
tations of C[Z] numbers equal to N[Z] (7=1 to J) in an array A, 
take, as the first permutation, that obtained by dividing 
A[1:C[1]+---+C[J]] into J intervals and setting the C[I] 
numbers of interval J equal to N{Z] (J=1 to J). (We assume 
that J > 2 and that each C[J] > 1. For distinct permutations, 
we need N[I’]+N[I"] whenever I’ + I’. For somewhat better 
efficiency, it is desirable, but not necessary, that the sequence 
C[I] be non-increasing. ) 

EXTENDED TWIDDLE (a, y, k, u, done, p) is called. If 
done = true, then all permutations have been processed and 
we therefore stop. If not, a new permutation is made available 
by transposing A[z] and Aly], EXTENDED TWIDDLE is 
called, and we continue on this loop until done = true. 

EXTENDED TWIDDLE is initialized in the main program. 
k is equated to J, u is equated to C[1] + --- + Cl] + 1, done 
is equated to false, and p/[0] and plu] are equated to J + 1. 
p{l:u—1] is initialized by setting the members of the Jth in- 
terval, of length C[Z], equal toJ — 7+17U0=1to/); 

That the procedure proceeds by transpositions (not neces- 
sarily adjacent, this being impossible in general) will introduce 
a special economy in some cases. If this feature is of no value 
in a particular application, then the algorithm of Bratley [1] 
or of Sagg [4] might be appropriate. For J = 2, TWIDDLE [2], 
which also has the transposition feature, will be more efficient 
than EXTENDED TWIDDLE. If each C{[Z] = 1, then Trotter’s 
algorithm [5] for generating permutations by transpositions, 
is appropriate. 


REFERENCES: 
1. Bratury, P. Algorithm 306, Permutations with repetitions. 
Comm. ACM 10 (July 1967), 450-451. 
. Coase, P. J. Algorithm 382, Combinations of M out of N 
objects. Conum. ACM 18 (June 1970), 368. 
3. Jounson, S. M. Generation of permutations by adjacent 
transpositions. Math. Comp. 17 (1963), 282-285. 
4. Saaa, T. W. Algorithm 242, Permutations of a set with repe- 
titions. Comm. ACM 7 (Oct. 1964), 585. 
5. Trorrer, H. F. Algorithm 115, PERM. Comm. ACM 6 (Aug. 
1962), 434-435. ; 
begin integer s, 7, j, 0; 
j:=6b:=s8:=0; 
D1: 
j:=jg+1; if abs (p[j]) = k then 
begin if p[j] < 0 then s := j; go to L1 end; 


[ww] 
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if p[j—1] = & then 
begin 
for 7 := j — s — 1 step —1 until 2 do p(s + 7] := —k; 
if s > b then p[s] := k; 
plst+1] := plj]; pi] := k; c= s +1; 
end; 
if s > b then p[s] := k; 
D2: 
j:=j+1; if abs (p[j]) < k then go to L2; 
if j = u then 
begin 
if k = 2 then begin done := true; 
j:=b:=8; k:=k—1; gotolLl 
end; 
t:=b:=j-1; 
L3: 
t4:=7t+1; if pit] = k then 
begin p[{t] := —k; go to L3 end; 
if p[t] = —k then 
begin 
pl] := pid); 
end; 
if 7 = u then 


y:=j; goto lA 


go to 14 end; 


go to L4 


ll 
S. 
Si 


p[b] := —k; w2:=b; y: 


go to L4 end; 
go to Ll 


if k = 2 then begin done := true; 
3 jg:=b:= 8; k:=k—-1; 
w:= J; y=; plj]:= pit]; ple] = &; 
LA: 

end EXTENDED TWIDDLE 
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The following driver program illustrates the use of Algorithm 
383. 
begin integer z, y, k, u, J, Q, I, L; Boolean done; 
integer array p[0:31], A, C, N[1:30]; 
procedure EXTENDED TWIDDLE (cz, y, k, u, done, p); 
comment Body of EXTENDED TWIDDLE is to be inserted 
here; 
comment Program uses EXTENDED TWIDDLE in generat- 
ing all permutations of C[Z] numbers equal to N[Z] (7=1 to J). 
They are successively stored in A and output. The user must 
supply: 1. J (indexing above requires J<30); 2. C[Z] =1 to 
J), each > 1 (indexing above requires C[1]+----+C[J]<30); 
3. N[I] (i=1 to J), distinct numbers (declarations above 
requires integer type); 
ininteger (2, J); 
for I := 1 step 1 until J do 
begin ininteger (2, C[I]); ininteger (2, N[I]) end; 
comment The array A is initialized; 
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L:=1; 

for J := 1 step 1 until J do 

for Q := C[I] step —1 until 1 do 

begin A[L] := N[J]; LD := L+1end; 

comment EXTENDED TWIDDLE is initialized; 
L:=1; 

for J := 1 step 1 until J do 

for Q := C[I] step —1 until 1 do 

begin p[L] := J—I+1; L:=L+1end; 

pl0] := p[L] := J +1; 

done := false; 

k:=J; w:=D; 

comment Permutations are successively generated and 


output; 
Q:=0; L:=u-—-1; 
Dl: 
Q:=Q+1; 


outinteger (1, Q); 
for J := u — 2 step —1 intil 0 do outinteger (1, A[L—I]); 
EXTENDED TWIDDLE (a, y, k, u, done, p); 
I:= Alz]; Alz] := Aly]; Aly] := J; 
if — done then go to Ll 
end of driver program 
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The following algorithm by G. W. Stewart relates to the paper by the same 
author in the Numerical Mathematics department of this issue on pages 865- 
867, This concurrent publication in Communications follows a policy an- 
nounced by the Editora of the two departments in the March 1967 isaue. 


ALGORITHM 384 

EIGENVALUES AND EIGENVECTORS OF A REAL 
SYMMETRIC MATRIX [F2] 

G. W. Stewart (Recd. 7 Nov. 1969) 

Department of Computer Sciences, The University of Texas 
at Austin, *Austin, TX 78712 


* Work on this algorithm was supported in part by the National 
Science Foundation under grant GP-8442 and by the US Army 
Research Office (Durham) under grant DA-ARO(D)-31-124- 
G1050 at the University of Texas at Austin. 


KEY WORDS AND PHRASES: real symmetric matrix, eigen- 
values, eigenvectors, QR algorithm 
CR CATEGORIES: 5.14 

DESCRIPTION: 

SYMQR finds the eigenvalues and, at the users option, the 
eigenvectors of a real symetric matrix. If the matrix is not ini- 
tially tridiagonal, it is reduced to tridiagonal form by House- 
holder’s method [2, p. 290]. The eigenvalues of the tridiagonal 
matrix are calculated by a variant of the QR algorithm with origin 
shifts [1]. Eigenvectors are calculated by accumulating the prod- 
ucts of the transformations used in the Householder transforma- 
tions and the QR steps, a procedure which guarantees a nearly 
orthonormal set of approximate eigenvectors. 

At each QR step the eigenvalues of the 2 X 2 submatrix in the 
lower right-hand corner are computed, and the one nearest the 
last diagonal element is distinguished. When these numbers settle 
down they are used as origin shifts. 

The user may choose between absolute and relative convergence 
criteria. The former accepts the last diagonal element as an ap- 
proximate eigenvalue when the last off-diagonal element is a small 
multiple (EPS) of the infinity norm of the matrix. The latter re- 
quires that the last off-diagonal be small compared to the last two 
diagonal elements. To avoid an excessive number of QR steps, an 
important consideration when eigenvectors are computed, the 
following guidelines should be followed. The convergence tolerance 
should not be smaller than the data warrants [2, p. 102]. The rela- 
tive convergence criterion should be used only when there are 
eigenvalues, small compared to the elements of the matrix, that 
are nonetheless determined to high relative accuracy. Finally, 
when there is a wide disparity in the sizes of the elements of the 
matrix, the matrix should be arranged so that the smaller elements 
appear in the lower right hand corner. 

The program will work with matrices whose elements very 
nearly underflow or overflow the range of a floating-point word. 
Some accuracy may be gained by accumulating inner products. 
The places where this should be done are signaled by the appear- 
ance of the variables SUM and SUM1. 

REFERENCES: 

1, Stewart, G. W. Incorporating origin shifts into the sym- 
metric QR algorithm for symmetric tridiagonal matrices. 
Comm. ACM 18 (June 1970), 365-367. 

2. WILKINSON, J. H. The Algebraic Eigenvalue Problem. Claren- 
don Press, Oxford, 1965. 
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SYMOR (AsDesF sKO sNoNAvEPS sABSCNV eVECaTROOFAIL) 


EXPLANATION OF THE PARAMETERS IN THE CALLING SEQUENCE > 


A 


Ko 


NA 


EPS 


ABSCNV 


VEC 


TRD 


FAIL 


REAL 
1A(NA91) 9D(1 
2C oSeoTEMP oP» 

INTEGER 
TN oNM1 9NM2 oN 

LOGICAL 
LABSCNV»sVECe 

TITTER = 50 

NM1 = N-1 

NM2 = N~-2 

NINF = 0, 

ASSIGN 500 


A DOUBLE DIMENSIONED ARRAYs IF THE MATRIX IS NOT 
INITIALLY TRIDIAGONAL» IT IS CONTAINED IN THE LOWER 
TRIANGLE OF Ae IF FIGENVECTORS ARE NOT REQUESTED 

THE LOWER TRIANGLF OF A IS DESTROYED WHILE THE 

ELEMENTS ABOVE THE DIAGONAL ARE LEFT UNDISTURBEDe 

IF EIGENVECTORS ARE REQUESTED» THEY ARE RETURNED IN THE 
COLUMNS OF Ae 


A SINGLY SUBSCRIPTED ARRAYe IF THE MATRIX IS 
INITIALLY TRIDIAGONAL» D CONTAINS ITS DIAGONAL 
ELEMENTSe ON RETURN D CONTAINS THE EIGENVALUES OF 
THE MATRIXe 


A SINGLY SUBSCRIPTED ARRAYe IF THE MATRIX IS 
INITIALLY TRIOIAGONAL»s E CONTAINS ITS OFF-DIAGONAL 
ELEMENTSe UPON RETURN E(1) CONTAINS THE NUMBER OF 
ITERATIONS REQUIRED TO COMPUTE THE APPROXIMATE 
EIGENVALUE DCT). 


A REAL VARIABLE CONTAINING AN INITIAL ORIGIN SHIFT TO 
BE USED UNTIL THE COMPUTED SHIFTS SETTLE DOWNe 


AN INTEGER VARIABLE CONTAINING THE ORDER OF THE 
MATRI Xe 


AN INTEGER VARIABLE CONTAINING THE FIRST DIMENSION 
OF THE ARRAY Ao 


A REAL VARIABLE CONTAINING A CONVERGENCE TOLERANCE @ 


A LOGICAL VARIABLE CONTAINING THE VALUE eTRUEs IF 
THE ARSOLUTE CONVERGENCE CRITERION IS TO BE USED 
OR THE VALUE eFALSEe IF THE RELATIVE CRITERION 

IS TO BE USED. 


A LOGICAL VARIABLE CONTAINING THE VALUE «TRUEe IF 
EIGENVECTORS ARE TO BE COMPUTED AND RETURNED IN 
THE ARRAY A AND OTHERWISE CONTAINING THE VALUE 
oFALSEoe 


A LOGICAL VARIABLF CONTAINING THF VALUE eTRUE® 
IF THE MATRIX IS TRIOIAGONAL AND LOCATED IN THE ARRAYS 
D AND £ AND OTHERWISE CONTAINING THE VALUE eFALSEee 


AN INTEGER VARIABLE CONTAINING AN ERROR SIGNAL eo 
ON RETURN THE EIGENVALUES IN D(FAIL+1) se00e2D(N) 


AND THEIR CORRESPONDING EIGENVECTORS MAY BE PRESUMED 
ACCURATE e 


¥9E (1) oK09D1 929K v9EPS2S2 sCON oNINF eo TEST 9 CB fC eC De 
PP oQ 9QQ eNORM oR gs TI TTER» SUMeSUM1 pMAX 
AoFAILolsIloJob ol loLl oLL1 sNL sNUsNUM2 oSINCOS eRETURN 


TROD »SHFT 


TO SINCOS 


SIGNAL ERROR IF N IS NOT POSITINVE®s 


IF (NeGT*O) 
FAIL = -1 
RETURN 


SPECIAL TRE 


IF (NeGTel) 


GO TO 1 


ATMENT FOR A MATRIX OF ORDER ONEeo 


GO TO 5 


IFC eNOTe TRO) DC1) = Alls1) 


IF(VEC) ACL 
FAIL = O 
RETURN 


IF THE MATR 


IF(TRD) GO 
IF (NeEQe2) 


ol) = le 


IX IS TRIDIAGONALs SKIP THE REDUCTION’ 


TO 100 
GO TO 80 


REDUCE THE MATRIX: TO TRIDIAGONAL FORM BY HOUSEHOLDERS METHODe 


DO 70 L#1l sNM2 


L1 = L+l 
DtL)} = Allo 
MAX = Oo 
po 10 I=Lle 
MAX = AMAX1 


u) 


N 
(MAX sABS (ACT oL))) 


IF(MAXeNE*O.) GO TO 13 


FIL) = 0, 
A(LeolL} = le 
Go TO 70 
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13 


17 


20 


30 
40 
50 


60 
70 
ao 


100 


AAN ANDA 


110 
120 


AANA 


130 


135 
140 


150 


140 
170 


NaAann 


180 


190 


NAN 


200 


NANnN 


210 


220 
230 


2000 


240 


NaN 


SUM = O6« 
nO 17 T=lLleN 
A(T eb) = Atl eoLY/MAX 


SUM = SUM + A(ToL} #2 

§2 = SUM 

$2 = SQRT(S2) 

TFCACL LoL) el Te Of) S2 = -S2 
€(L) = -S2"MAX 


A(LlotL) = A(Lleol) + §2 
AtlLelL) © S2*A(LI,L} 

SUML = 0. 

00 50 T=LI16N 

SUM = O6« 

DO 20 JzL1lgl 

SUM = SUM + A(T sJ)#A(J ob) 
IFC I*EQeN) GO TO 40 

1) = I+ 

PO 30 JtIleN 

SUM = SUM + Al(JsLIFA( Ig I) 
ECT). = SUMZA(L oL) 

SUM1 = SUM] + ACT ,LIFEC(T) 
CON = e5#SUMI/AIL ol) 

DO 60 T=LleN 

E(r) = ECL) = CON#ACT »L) 
DO 60 J#Liel! 

A(T sdb = ACT oJ) - ACToL)#E( I) = ACJoL)MECT) 
CONTINUE 

DCNM)) ©& A( NM] eNM1) 

DIN) = A(NoN) 

FUNM1) = ACNoNM1) 


IF EIGENVECTORS ARE REQUIRED» INITIALIZE As 
IF (eNOTeVEC) GO TO 180 


IF THE MATRIX WAS TRIDIAGONALs, SET A EQUAL TO THE IDENTITY MATRIXe 
TFCeNOTeCTRD eANDe Ne NEo2) GO TO 130 

DO 120 Ie1sN 

DO 110 JzleN 

AlleJ) = O« 

AC(leI) = le 

GO TO 180 


IF THE MATRIX WAS NOT TRIDIAGONALs MULTIPLY OUT THE 
TRANSFORMATIONS OBTAINED IN THE HOUSEHOLDER REDUCTIONe 


A(NoN) = le 

A(NMILONML) = le 

A(NM]9N} = Oo 

A(N»sNM1L) © O06 

DO 170 L=1,NM2 

LL = NM2-L41 

tll = LL+1 

DO 140 T*LLioN 

SUM = 0.4 

BO 135 JzLLieN 

SUM = SUM + A(JeLLI#Al Js 1) 
A(tLisl) = SUM/ATLL sLL? 

DO 150 TeLLieN 

00 150 J=LL1oN 

A(lsJ) = AtEod) — ACT sLLIFACLE SJ) 
DO 160 I=LL1eN 

A(IoLL) = O06 

A(LLel) = 0. 


A(LL+tl) = 1. 


IF AN ABSOLUTE CONVERGENCE CRITERION 1S REQUESTED 
(ABSCNV#.TRUE*)» COMPUTE THE INFINITY NORM OF THE MATRIXe 


IF (eNOTe*ABSCNV} GO TO 200 

NINF = AMAX1(A8S(D(1))4+ABS(E(1)) sABS( O(N) )+ABSCE(NML) )) 
TF(NsEQ@e2) GO TO 200 

DO 190 1=29NM1 

NINF = AMAXL(NINF sABS(D(1))+ABS(F (1) }4ABS(E(I=“1))) 


START THE QR ITERATION. 


NU = ON 

NUM] = N-1 

SHFT = eFALSEs 
Kl = KO 

TEST = NINF#EPS 
EIN) = Of 


CHECK FOR CONVERGENCE AND LOCATE THE SUBMATRIX IN WHICH THE 
QR STEP 15 TO BE PERFORMED. 


DO 220 NNL=} »NUM) 

NL = NUMI-NAL41 

TF CeNOTeABSCNV) TEST = EPS#AMIN1(ABS(D(NL)) sABS(IDINL41) )) 
IFCABS(E(NL)) eL&. TEST) GO TO 230 

CONTINUE 

GO TO 240 

ECNL) © Oo 

NL * NL+] 

IF(NL «NEs NU} GON TO 240 

IF(NUML eEQe 1) RETURN 

TF(E( 200) NE ee) PRINT 2000s(D( 1) sEC1) ol@1 NU) 


“‘FORMATCIHOLOEL2¢64/(1H 10612064)) 


NU ® NUM] 

NUM] = NU-1 

GO TO 210 

E(NU) ® E(NU)*FLOAT (NUMI-NL ) 
TF (le oEQe led GO TO 250 

IF( Oe sEQe led GO TO 250 
FAIL = NU 

RETURN 


CALCULATE THE SHIFTe 
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250 CB = (DCNUMLI-DINUD)/26 
MAX = AMAX1(ABS(CB) eABSTE(NUM1))) 
CB = CB/MAX 
CC = (E(NUM1) /MAX) ##2 
CM = SORT(CB##2 + CC) 
IFt(CB eNEe Oc) CO = SIGN(CDsCEl) 
K2 = D(NU} = MAX#CC/(CB+CD} 
IF(SHFT) GO TO 270 
IFCABS(K2-K1) el Te eS*ABS(K2)! GO TO 260 
Kl = K2 
K = KO 
GO TO 300 

260 SHFT = eTRUEs 

270 K = K2 


PERFORM ONE QR STEP WITH SHIFT K ON ROWS AND COLUMNS 
NL THROUGH NU © 


NANNY 


300 IFIE( 200) eNEeOe sANDe KelEolof-14#ABS(D(NL))) K200 
P= DINL) - K 
Q@ = E(NL) 
ASSIGN 310 TO RETURN 
GO TO SINCOS*(500) 
310 DO 380 I=NL»NUM} 


IF REQUIRED», ROTATE THE EIGENVECTORS.» 


Nan 


IF(eNOTeVEC) GO TO 330 

DO 320 J=loN 

TEMP = C#A(JoT) + S*ACJoI +1) 

AlJoT41) = -S#A(IoT) + CHACJoT4+1) 
320 AlJeol) = TEMP 


PERFORM THE SIMILARITY TRANSFORMATION AND CALCULATE THE NEXT 
ROTATION. 


ANNAN 


430 Dt) = C#O(I) + S*¥ECT) 
TEMP = C#¥E(T) + S#DO( 141) 
OCI+1) = -S#E(1) + C#OCT+1) 
F(T) = ~S#K 
Ott) = C#D( 1) + S*TEMP 
IF¢E 2EQ, NUM1) GO TO 380 
TFCABS(S) eGTs ABS(C)) GO TO 350 
R= S/C 
D¢I+1l) = -S#ECT) + C#D(T+1) 
P= O(I+1) = K 
Q = C#F(T41) 
ASSIGN 340 TO RETURN 
GO TO SINCOSs (500) 

340 Etl) = R#NORM 
F(I+1} = @ 
69 TO 380 

350 P = CHE(I) + S¥D( 141) 
Q = S¥ECT+4+1) 
Dt I+1) = C#P/S + K 
Etl+1) = C#E(T4+1) 
ASSIGN 360 TO RETURN 
GO TO SINCOSs(500) 

360 E(Il) = NORM 

380 CONTINUE 
TEMP = C#E(NUM1) + S*D(NU) 
D(NU) = —S#E(NUM1) + C#®DUNU) 
E(NUM1) = TEMP 
GON TO 210 


INTERNAL PROCEDURE TO CALCULATE THE ROTATION CORRESPONDING TC 
THE VECTOR(PsQ)e 


ANNAN 


500 PP = ABS(P) 
aq = aBS(N) 
IF(QQ eGTe PP) GN TO 510 
NORM = PPRSORT( 1. + (QQ/PP)##2) 
GO TO 520 
$10 IF(QQ eEQ@e O06) GN TO 530 
NORM = QQ#*SQORT(1. + (PP/QQ)##2) 
520 C = P/NORM 
S = Q/NORM 
GO TO RETURNs(31093409360)} 
530 C = le 
S = 0-6 
NORM = O, 
GO TO RETURN» (31953409360) 
END 


REMARK ON ALGORITHM 384 [F2] 

EIGENVALUES AND EIGENVECTORS OF A REAL 
SYMMETRIC MATRIX [G. W. Stewart, Comm. ACM 
6 (June 1970), 369-371] 

G. W. STEWART 

Department of Computer Sciences, The University of 
Texas at Austin, Austin, TX 78712 

KEY WORDS AND PHRASES: real symmetric matrix, eigen- 


values, eigenvectors, QR algorithm 
CR CATEGORIES: 5.14 
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The following changes should be made in the subroutine 
SYMQR. Change the statement: 
REAL 
1A(NA,1),D(1),E(1),K0,D1,D2,... 
to: 
REAL 
1A(NA,1),D (1),E(),KO0,K1,K2,... 
After statement number 230 delete the statements: 
IF (E(200).NE.0.) PRINT 2000,(D(I),E(I),I=1,NU) 
2000 FORMAT (1H010E12.4/(1H 10E12.4)) 
Replace the statements: 
240 E(NU) = E(NU)+FLOAT(NUMI-—NL) 
IF(1. .EQ. 1.) GO TO 250 
IFO. .EQ. 1.) GO TO 250 
by: 
240 E(NU) = E(NU)+1. 
IF(E(NU) .LE. TITTER) GO TO 250 
Replace the statements: 


300 IF (E(200).NE.0. .AND. K.LE.1.E—14*ABS(D(NL)))K=0. 


P = D(NL) —- K 
by: 
300 P = D(NL) — K 
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ALGORITHM 385 

EXPONENTIAL INTEGRAL £;(zx) [813] 

KATHLEEN A. PactorEK* [Recd. 16 May 1969 and 11 
March 1970] 

Argonne National Laboratory, Argonne, IL 60439 


* Work performed under the auspices of the US Atomic Energy 
Commission. 


KEY WORDS AND PHRASES: exponential integral, special 
functions, rational Chebyshev approximation 
CRICATEGORIES: 5.12 


DESCRIPTION: 
The classical exponential integral is defined by 


© ot * ert 

Bia) = [ Fat = - —di, «>0 

| ot ae 

where the integral is to be interpreted as the Cauchy principal 
value. Except for the sign, it represents the natural extension of 
the function 


Ex(z) = / “ dt = — E,(—2), 


to the negative real axis. 
The rational approximations and corresponding intervals used 
in this routine are: 


Ein(z) = © [2 ae: Rita) |, n<—4 
0 
= —e*Rim(—1/z), —-4<¢2<-1 
= In(—2z) — Rin(—2), -l<2z2<0 
= In(x/m) + (& — m)Rin(t), O< et <6 


” 
Hl 


= Rin(1/2), 
x 


2 E ag Rint) | 
xv xz 


where the Ai, (¢) are rational functions of degree / in the numerator 
and m in the denominator, and 


Xo = .372507410781366634461991866580 


is the zero of H;(x). See [2, 3] for the derivation of these approxi- 
mations. 

In several of the ranges, it was necessary to express the rational 
functions either as J-fractions or as ratios of finite sums of Cheby- 
shev polynomials, since the original forms were found to be poorly 
conditioned, i.e. subject to cancellation errors (subtraction of 
nearly equal quantities), large roundoff errors, etc. The approxi- 
mations chosen for this routine have the following maximum 
relative errors. 


Range Rao or 
z less than —4 1.32D-19 
(—4, —1) 6.33D-20 
(—1, 0) 1.12D-21 
(0, 6) 1.24D-18 
(6, 12) 2.35D-18 
(12, 24) 6.0D-20 
x greater than 24 7.85D-19 
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Different approximations would naturally be required for use on 
computers with different word lengths. See [2, 3]. 

Test results. This routine was tested on an IBM System 360 
model 75, where truncation is approximately 7.0D-18, usual for 
long normalization form. However, since this is a base 16 machine, 
truncation may be 1.1D-16, maximum for short normalization. 
The testing procedure is described in [1]. The maximum relative 
errors (MRE) and root mean square relative errors (RMS) follow. 
(Note—Argonne National Laboratory versions of DEXP and 
DLOG, rather than the IBM subroutine library routines, were 
used in these tests.) 


Range MRE RMS 
(—150,-4) 4.44D-16 1.52D-16 
(4.1) 6.02D-16 2.50D-16 
(—1, 0) 4.41D-16 1.08D-16 
(0, 6) 6.681D-16 2.71D-16 
(6, 12) 7.45D-16 3.58D-16 
(12, 24) 8.18D-16 2.56D-16 
(24, 100) 3.881)-16 1.36D-16 


On the IBM System 360 model 75 the average time per call, 
excluding the Jump from the calling program, was 245 micro- 
seconds. Using small perturbations of the constant coefficients in 
the numerators of the rational functions, in order to compensate 
for the biased arithmetic on the IBM System 360, it is possible 
to reduce the MRE by an average of 25 percent and the RMS by 
an average of 45 percent. 

Machine dependent features. Since E,(0) = —, an argument of 
zero results in a function value which is the smallest negative 
floating point number on the IBM System 360. Both the Argonne 
version of DEXP and that of the IBM System 360 Subroutine 
Library treat an argument greater than 174.673 as an error and 
return the largest possible floating point number. Since DEXP (X) 
is used for X greater than 24, this exponential integral routine 
returns the largest possible floating point number on the IBM 
System 360 whenever the argument is greater than 174.673, elimi- 
nating the call to the DEXP routine. In order to maintain good 
relative accuracy in the vicinity of zo, the quantity (~ — 20) 
should be computed to higher than machine precision to preserve 
the low order bits of x»). This can be readily accomplished by 
breaking 2» into two parts, x: and 22 , such that, to the precision 
desired, to = 21 + 2x2 and the floating point exponent on x2 is much 
less than that of 2; . See [2]. Examining the hexadecimal represen- 
tation 29 = .5F5CA54AD2D7F0F264C3 (base 16), we see that for 
the IBM System 360 we might, and in fact this routine does, use 
a1 = .SF5CA54AD2 (base 16) and a. = .0000000000D7F0F264C3 
(base 16) or, 21 = 409576229586./2**40 (base 10), in a form which 
will avoid decimal to hexadecimal conversion errors and 22. = 
.7671772501993940D-12 (base 10). Then, (7 — 20) is computed as 
(x — x0) = (x — a) — x2. Additional precautions will have to be 
taken to compute In(x/xzo) for x near xo . We use a low order ra- 
tional approximation to In(z%/x0o) = log (1 + y), for | y | < i, 
where y = (2 — Xo)/20 . However, a few terms in the Taylor series 
for In (1 + y) will usually suffice. 


REFERENCES: 


1. Cuark, N.A., Copy, W. J., Hinustrom, K. E., anp THIELEKER, 
E. A. Performance statistics of the FORTRAN IV(H) 
library for the IBM System/360. Argonne National Labora- 
tory Rep. ANL-7321, May 1967. 

2. Copy, W. J., anp THacHEerR, Henry C., Jn. Chebyshev ap- 
proximations for the exponential integral E(x). Math. 
Comp. 23 (Apr. 1969), 289-303. 

3. Copy, W. J., anp THACHER, Henry C., Jr. Rational Cheby- 


COLLECTED ALGORITHMS (cont.) 


shev approximations for the exponential integral E,(z). 
Math. Comp. 22 (July 1968), 641-649. 


4. Rice, J. R. On the conditioning of polynomial and rational 


forms. Numer. Math. 7 (1965), 426-435. 


ALGORITHM: 


aaaanan 


aaaa 


FUNCTION DEI(X1) 


AN EXPONENTIAL INTEGRAL ROUTINE 

FOR X GREATER THAN Oy THE EXPONENTIAL INTEGRAL» Ely IS DEFINED BY 
EL(X)=INTEGRAL(CEXP(T)/T DT) » FROM T=-EINFINITY TO T=X 

WHERE THE INTEGRAL IS TO BE INTERPRETED AS THE CAUCHY PRINCIPAL 
VALUE. FOR X LESS THAN O» EI(X)=-F1l(-X)» WHERE 
EL(Z)=SINTEGRAL(EXP(-T)/T OT) FROM T=Z TO T=INFINITY. 


DOUBLE PRECISION DEE yXl9XoXO0eXMXOeVeR eDENMe FRAC pWeArBeCeoDe Ee 
XF oy P09 P1_P29P39P4,00901 102903 2049 PX OXy To SUMP » SUMO 

DIMENSION P1(9)9Q1(9) P2(9) 40218) ¢P3(20) 50319) pPX(10) oOX(10)% 
XP4(10) 99419) 9P0(6) 0016) 

DIMENSION A(6)9B(6)9C(8) 4D(8) E(8) oF (8) 

DATA PO/1.0092623069937666899751D0s 1470277059606809295D0,4 
X5010499279623219400D-1Le 4+890892537892791540=25 
X3 6 654622241323684290—4/ 

DATA Q0/1.00%267306993766689975100, 2473478695106925836D0>% 
X1.21765962960151532D0, 2428817933990526412D-—1; 
X1231114151194977706D-2/ 

DATA P1/5e9956994689237001L009, -2.50389994886351362D8, 
X7205921609590056747D8, -3436899564201591901D6+ 
X8.9868329164375831306, 74637147790184657443D04, 2.8544688181364701 
X5049 4612626667 248911939D2, 1410639547241639580D01/ 

DATA 01/2.55926497607616350D9_5 -—2479673351122984591D9, 

XB 60282778294695650708» ~1444980714393023883D08, 1.2771583080107998 
XB4D7_— 9 ~1649575457202559218N06, 825377100018074909704, -3.0252368 
X2238227410039 5612578125D1/ 

DATA P2/94989576665165517040-15 §-73116705744508018D0, 
X4018102422562856622D0»5 5.488658240753281111D0-, -1.941329675144307 
XO02Dl_ 7089472209294457221005 2632730233839039141D1, ~—3.67783113 
X478311458Dly_y —-2 246940983448 361265D0/ 

DATA 02/141462525324901619100, —1499149600231235164D2, 

X3 641365212524375539D25 54623165568734558614D1,_, 341727948925436932 
XBD2, -8.38767084189640707D0_ 9465405217429280303D02, 24639830073 
X18024593D0/ 


DATA P3/9.99993310616056874)D-1,. 71.84508623239127867D0% 
X22652575818452799B82D1 5 2 «49548773040205944D1, 

X— 30 32361257934396228D1y —9013483569999874255D—1, 
X-2 610574079954804045D1 » -1-0006419139892848301, 
X-1.860092121 4#2643758D1y -1.264772117246346314D0/ 

DATA Q3/1.40015338520453427000,- -1.0935561953910912401, 
X1.99100447081774247D2,y 1.19283242396860101D3,- 
X4042941317833 79284001 2653881931563070803D2 + 
X5299493232566740736D15 6 640380040535241555D1¢ 
X9079240359921729030D1/ 

DATA P4/1.200000000000000486D0, -3.00000000320981266D0+% 
X-5.00006640413131002D0,% —7 60681097789502935900, 

X= 145285662 3636929637015 ~7.6314770162025363100, 
X-2 ©79798528624305389D1 » -1.8194966492986B8906D1, 
X—-262312767077763241002, 1.75338801265465972D2/ 

DATA Q4/1299999999999048104D0>% —2 -99999894040324960D0, 

X-7699243595776339741D0,% ~1.2018776354715474301,% 


X720483184718042467601>¢ 1217179220502086455D2, 
X1.37790390235747999D2, 3.97277109100414518D0>% 
X3697845977167414721D4/ 

DATA A/~52772156649015328630-1, 70541643136630166200—1, 
X1-2298492329273732340-1y 2 -40681355683977413D-2, 
X1.2320843092096093710-3,» 6.577393997532645010-5/ 

DATA B/1-00094.25899193811589822D-1,y 709779471841022822D-2¢% 
X8.302084760987716770—3,% 4286427138393016416D-4, 
X1.-30655195822848878D-5/ 

DATA C/8.67745954838443744D-8, 946999955193013903020-1,% 
X1.18483105554945844D1 5 4.55930644253389823D1_ 
X629927945129100302301s 4.625202034768840779DL> 8.83671808803843939D 
XOv 46013776649406647200-1/ : 

DATA D/1.009122848193537915665001_, 5264433569561803199D1, 
X1.0664518376991388302, 8697311097125289802D1ls 3414971849170440750D 
Xlq 36 79559003762122243D0, 9-08804569188869219D0-2/ 

DATA E/-9499999999999973414D—-1y —3444061995006684895D1, 
X—402753267120198853902, ~2239601943247490540D3,4 
X-6.16885210055476351D39 —-6057609698748021179D3,% 
X-2¢10607737142633289035 —1448990849972948169D1/ 

DATA F/10093 -64061995006459804D1 94294345070209903645D2, 
X361902723748954330403y 14603370753085840977D04,y 
X21 26324145355778350304, 1611497752871096620D4,- 
X2¢3781389910216022103/ 

DATA X0/ «37250741078 1366634D0/ 

X=X1 

1 TF(X.LE20-000) GO TO 100 

IF(X.GE.12-90) GO TO 60 

TF(X.GE26-D9) GO TO 40 


X IN (096) 


T=X+X 
T=T/32000-2.000 
PX(10)=020D0 

QX (10) =0.000 
PX(9)=P1(9) 
OX(9)=Q1(9) 


THE RATIONAL FUNCTION IS EXPRESSED AS A RATIO OF FINITE SUMS OF 
SHIFTED CHEBYSHEV POLYNOMIALS AND IS EVALUATED BY NOTING THAT 
T*(X)=T(2X-1) AND USING THE CLENSHAW-RICE ALGORITHM FOUND IN 
REFERENCE (4). 


DO 10 L=2e8 
T210-L 
PX(L)=T#PX(T+#1)—-PX(1+2)+PL(1) 
10 OX( 1) =T#*QX( 141) -QX(1+2)401 (1) 
R=(o5DO*XT#PX(2)—PX(3)+P1(1))/ (-500*T*OX(2)-OX(3)+Q1(1)) 


(X=X0}=(X—X1) X29 WHERE X1=4095762295860/2*%*40 AND 
X2=—676717725019939400-12.- 


XMXO= (X=4095 76229586.D00/1099511627776eD0) ~e 76717725019939400—12 
IF(DABS(XMXO) .LT» 03700) GO TO 15 
DET =DLOG(X/X0)+XMXO#*R 
RETURN 
15 Y=XMX0/X0 
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A RATIONAL APPROXIMATION TO LOG(X/X0)=LOG(1+Y), WHERE Y=(X-X0)/X0 
AND DABS(Y) IS LESS THAN ely THAT IS FOR DABS(X=X0O) LESS THAN .037 
SUMP=((((PO0(6)*Y+P0(5) )*¥+P0(4))¥*Y¥+P0(3))*Y+P0(2))*¥+P0(1) 
SUMQ=( (( (Q0 (6) ¥¥+Q0(5)) #¥+00 (4) ) #¥4+Q013)) *Y¥+Q0(2)) *¥+00(1) 
DEI=( SUMP/ ( SUMQ*X0) +R) *XMXO 
RETURN 
C X IN (6912) 
40 DENM=P2(9)+X 
FRAC=Q2(8) /DENM 
C THE RATIONAL FUNCTION IS EXPRESSED AS A J-FRACTION. 
DO 25 J=2,8 
1=9-J 
DENM=P2(1+1)+X+FRAC 
25 FRAC=Q2(1)/DENM 
DEI=DEXP(X)*((P2¢1L)+FRAC)I/X) 
RETURN 
60 IF(XeGEe24-00) GO TO 80 
C X IN (12,24) 
DENM=P3(10)+X 
FRAC=Q3(9)/DENM 
C THE RATIONAL FUNCTION IS EXPRESSED AS A J-FRACTION. 
DO 26 J=2,9 
1=10-J 
OENM=P3 (¢1+1)+X+FRAC 
26 FRAC=Q3(1)/DENM 
DEI =DEXP(X)*((P3(1)+FRAC)I/X) 
RETURN 
( X GREATER THAN 24 
80 IFUX»LE*%174-67300) GO TO 90 
( X IS GREATER THAN 174.673 AND DEI IS SET TO INFINITY ON IBM S/360 
DEL=7.2075 
RETURN 
90 Y=1.0D0/X 
DENM=P4(10)+X 
FRAC=04(9)/0ENM 
(© THE RATIONAL FUNCTION IS EXPRESSED AS A J-FRACTION. 
NO 28 J=2%9 
I=10-J ; 
DENM=P4(1+1)+X+FRAC 
28 FRAC=Q4(1)/DENM 
DEI=DEXP(X)*(Y+Y*YS(P4(1)4+FRAC)) 
RETURN 
100 IF(X.NE.0.D0) GN TN 101 
C X =O AND DEI IS SET TO -INFINITY ON IBM S$/360 
DET=-7.2075 
PRINT 500 
500 FORMAT(S57HODET CALLED WITH A ZERO ARGUMENT, RESULT SET TO -INFINIT 
XY) 
RETURN 
101 Y=-X 
110 W=1-0D0/Y 
IF(Y.G6T.4.000) GO TO 300 
IF(Y.GT.21.000) GO TO 200 
CX IN (-1,0) 


an 


DEIT=DLOGEY) —C CC ( (ACE) FYFA (5) FYFA (4) )FYFA(3)) FYFA(2) FYFACL IDS 
X (C0008 06) *¥4B 05) *¥+B8 04) )*Y4B03) )*Y4+B(2))*¥4B01)) 
RETURN 


200 DEI==DEXP(-Y)*( (00 0001008) XWHC (7) DEWEC (6) WHC (5) EWHC (4) ) HW4CO 03D) 
CX IN (49-1) 
XeWHC (2) PRWEC (LIS OOOO 
XD0(3))*W+D(2))*Wt+DC1L))) 
RETURN 
Cc X LESS THAN -4 
300 DEI =-DEXP(-Y) *(WE(1 SOD0FWH (C100 0EL 
XE(4)) *WHE (3) FHFE C2) FWHECLIIZ OOO 
XaW+E (4) * WHE (3) DEWHE (2) WEF OLD DD) 
RETURN 
END 


((0(8)*W+D(7)) *W4D (6) )FW+D(5) )FW+D 04) EWE 


8) *W+HE (7) )*WHE (6) WHE (5) EWS 
CCF (8B) WHF (7) ) WHE (6) ) FWHF (57) 


CERTIFICATION OF ALGORITHM 385 [S13] 
EXPONENTIAL INTEGRAL £7 (x) (Kathleen A. 
Paciorek, Comm. ACM 18 (July 1970) 444-445) 

Epwarp W. Ne* (Recd. 2 Jan. 1970) 
Jet Propulsion Laboratory, California Institute of 
Technology, Pasadena, CA 91103 


* This paper presents the results of one phase of research car- 
ried out at the Jet Propulsion Laboratory, California Institute 
of Technology, under Contract NAS7-100, sponsored by the 
National Aeronautics and Space Administration. 


COLLECTED ALGORITHMS (cont.) 


KEY WORDS AND PHRASES: exponential integral, special 
functions, rational Chebyshev approximation 
CR CATEGORIES: 5.12 


General discussion. This algorithm computes, for x > 0, 


t . 
Ei(z) = £7 < dt and —E,(z) = Ei(—z). It is a straightfor- 


0 f 

ward implementation of approximations produced by Cody and 
Thacher, the references as given in the algorithm. It fills a gap left 
by previously published algorithms, e.g. Clenshaw et al. [1] and 
IBMSSP [2], in that it computes Hi(x) for all values of real x 
within computer restrictions and that it is done with comparably 
high precision. Moreover, it is based on more efficient approxima- 
tions than those used in the algorithms mentioned above. How- 
ever, it is inferior in one aspect to Clenshaw et al. in that the type 
of approximations used makes it difficult for implementing an 
algorithm of variable precision, a feature included in Clenshaw 
et al. 

The documentation and design of this algorithm are very good 
with clear reference to the method used, the amount and result of 
testing, the machine dependent features, etc. A minor defect is 
that the data are not identified by comments, probably because 
they can be recognized readily in the main body of the code. 

Testing. This algorithm was compiled and executed without 
any modification on a UNIVAC 1108 computer. It was tested 
against a reference subprogram QEIEI which computes Hi(z) in 
extended precision using a package of subroutines in 70-bit (about 
21 decimal) arithmetic, written by Dr. C. L. Lawson and associates 
at the Jet Propulsion Laboratory. The subprogram QEIEI, 
written by the present author, computes Ei(x) from truncated 
Chebyshev series for negative z [3], and from Taylor and asymp- 
totic expansions for positive x [4, eqs. (2.1), (2.2), (2.3)]. QEIEI 
itself has been tested for some overlap ranges of values of x where 
more than one computational method is applicable and is be- 
lieved to be correct to at least 19 significant decimal digits (except 
when z is very close to the zero of Ei, x = 0.3725 where relative 
accuracy is poor). 

For the seven intervals of z as indicated in this algorithm, tests 
were made of the algorithm against QE1EI which was considered 
as producing the ‘‘correct’’ function values. Each interval was 
partitioned into 1000 subintervals of equal length and in each 
subinterval one test value of x was selected using a uniform 
pseudorandom number generator. The results of the tests are as 
follows: 


Interval of x oe RES Reals 
{[—150, —4] 2.2D-16 5.1D-17 
[=4,-=1] 7.5D-17 1.2D-17 
[~1, 0] 8.7D-18 1.4D-18 
(0, 0.5] 1.8D-16« 3.2D-17« 
(0.5, 6] 5.5D-17 1.0D-17 
(6, 12} 1.6D-17 3.0D-18 
(12, 24] 2.9D-17 7.1D-18 
[24, 100] 8.9D-17 1.9D-17 


The errors marked by * are adjusted to exclude the subinterval 
(0.37245, 0.37255] in which QE1EI does not have sufficient relative 
accuracy to give meaningful comparison with this algorithm, 
which is coded in such a way as to retain good relative accuracy 
near Xo & 0.3725 --- . In fact, 2o is given in the data as two con- 
stants with a total accuracy of 79 bits, so that on the computer 
with an N-bit mantissa, this algorithm produces good relative 
accuracy for |e _ xo > 2N-79, However, the additional relative 
accuracy thus obtained is based on the assumption that 2 is 
exactly zero in the (N + 1)th through 79th bits—an assumption 
not too realistic in most applications. 

We observe that the errors found are smaller than those ob- 
tained by the author of the algorithm. This is due to smaller 
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truncation error for long precision on the UNIVAC computer 

(~1.D-18). 

In the range | x — ao | < 0.37, the author supplied an approxi- 
mation for log(z/zc). Such approximation is in the form of a 5-5 
rational function (i.e. a fifth degree polynomial divided by another 
fifth degree polynomial). It should be noted that there exists in 
the literature a 2-2 rational approximation suitable for the same 
purpose. (See [5, p. 111, Index 2720].) 

The two error exits occur for Ei(0) = —, and Hi(z > 
174.673) sy «©. These were tested and return -+7.2D75 which are 
approximately the smallest negative or largest positive floating 
numbers for the IBM 360. No timing test was performed owing to 
the apparent lack of reliability of time accounting on the UNIVAC 
1108 EXEC-8 system used here. 

I am indebted to C. L. Lawson and W. J. Cody for helpful dis- 
cussions. 

REFERENCES: 
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REMARK ON ALGORITHM 385 [813] 

EXPONENTIAL INTEGRAL Ei(x) [Kathleen A. 
Paciorek, Comm. ACM 18 (July 1970), 446-447] 

K. A. Repisx (Reed. 3 Aug. 1970) 

Department of Applied Mathematics, 
McMaster University, Ontario, Canada 


KEYWORDS AND PHRASES: ANSI Fortran standard 
CR CATEGORIES: 4.0, 4.22 


(a) This algorithm does not conform to the standard in that the 
DATA statements contain array names. Section 7.2.2 of ANSI 
Fortran standard [Comm. ACM 7 (Oct. 1964), 590-625] (1) states 
that the list(s) of a data statement contain ‘“‘names of variables 
and array elements.’’ It is therefore necessary to list the elements 
singly. (A more readable layout can be obtained in one of the 
following ways: 

DATA 

1 A(1)/~-5.77215664901532863 D —1/, A (2)/7.54164313663016620D —1/, 


2 A(3)/  1.29849232927373234D—1/, A (4)/2.40881355683977413 D —2/, 
3 A(5)/ 1.82084309209600371D—3/, A (6)/6.57739399753264501 D —5/ 


Hamilton, 


or 


DATA A(1) ; A(2) 
1 /—5.77215664901532863D—1 , 1.54164313663016620D —1/, 
2 A(@3) , A(4) 
3 / 1,29849232927373234D—1 , 2.40681355683977413D—2/, 
4 A() , A(6) 
5 / 1.32084309209609371D—3 , §.57739399753264501 D --5/ 


The latter example might well be broken into three separate 
data statements.) 

(b) In the discussion of Machine dependent features it is noted, 
in particular, that references are made to the largest positive real 
number and (in effect) its natural logarithm. These references are 
buried in the code, at the statement numbered 80, 2 lines later, 
and 2 lines after the statement numbered 100. I feel that these 
should, at least, be defined by DATA statements at the head of 
the program. In fact, perhaps the time is now ripe for standard 
names and definitions of these and other environmental entities. 


COLLECTED ALGORITHMS (cont.) 


Remark on Algorithm 385 [S13] 

Exponential Integral Ei(x) 

[Kathleen Paciorek, Comm. ACM’ 1 3 (July 1970), 
446-447] 


Michael J. Frisch [Recd. 27 Jan. 1971] 
University Computer Center, University of Minnesota, 
Minneapolis, MN 55455 


Key Words and Phrases: ANSI Fortran standard 
CR Categories: 4.0, 4.22 


The following items were found during compilation of the 
algorithms written in Fortran published to date in Communica- 
tions. The MNF compiler written at the University of Minnesota 
for CDC 6000 Series machines by Lawrence A. Liddiard and E. 
James Mundstock was used to check the validity of the algorithms. 


Algorithm 385 does not conform to the standard in that the 
function name DEI appears in a type statement (Section 8.3.1). 
It should not appear there, and the function statement should be 
DOUBLE PRECISION FUNCTION DEI (X1). The third state- 
ment (PRINT 500) after the statement numbered 100 is not among 
the statements allowed in standard Fortran. A comment card sepa- 
rates the initial line from the continuation line in the statement 
numbered 200 contrary to Section 3.2.1. 
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ALGORITHM 386 

GREATEST COMMON DIVISOR OF n INTEGERS 
AND MULTIPLIERS* [A1] 

Gorpon H. Brapiey (Reed. 14 Oct. 1969, 28 Nov. 1969, 
and 26 Feb. 1970) 

Administrative Sciences Department, Yale University, 
New Haven, CT 06520 


* This research supported in part by funds from the Yale Com- 
puter Center. 


KEY WORDS AND PHRASES: greatest common divisor, 
Euclidean algorithm, number theory, diophantine equations 
CRCATEGORIES: 3.15, 5.10 


DESCRIPTION: 
The algorithm calculates the greatest common divisor, IGCD, 
of n integers A(z). Multipliers Z(z) are constructed so that 


IGCD = A(1) X Z(1) +++: + Am) X Zn). 


Details of the method and comparisons to other algorithms are 
given in [I]. 

The algorithm is a new version of the Euclidean algorithm for 
n integers. The algorithm first calculates gcd(A(1), A(2)), then 
ged(gced(A(1), A(2)), A(3)), ete. The n — 1 calculations of the 
greatest common divisor of two integers is accomplished by means 
of a modified version of the Blankinship algorithm which is de- 
scribed in [1]. The n — 1 sets of multipliers are then used to cal- 
culate the multipliers for the A(z). 

If the m — 1 applications of the gcd algorithm for two integers 
requires a total of k iterations, then the algorithm requires 
2(n — 1) + 2k multiplications, k + n — 1 divisions, and 2k addi- 
tions. The number of arithmetic operations is less than indicated 
in [1] due to a modification noted below. In [1] the following bound 
on k is given. 

THEOREM. k is never greater than n — 2 plus five times the number 
of digits in A(1). 

CorouuaRy. k is less than n — 1 plus the logarithm of A(1) to the 
base 1.6. 

This bound can be achieved. The bound on k can be reduced by 
having A(L) be the smallest number (in absolute value) among the 
A(t). 

Tf at.some step of the algorithm the gcd becomes one, then the’ 
gcd calculations are terminated. There is a reduction in the num- 
ber of arithmetic operations in this case. 

If all input integers are zero, then output is zero gcd and all 
multipliers zero. 

The multipliers constructed by the algorithm are, in general, 
not small numbers. A minimal set of multipliers described in {1] 
can be constructed by a slight modification of the FORTRAN 
program. 


REFERENCES: 
1. Brapiey, G. H. Algorithm and bound for the greatest com- 
mon divisor of n integers. Comm. ACM 13 (July 1970), 433-436. 
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ALGORITHM: 


aAANNANgD 


SUBROUTINE GCON 

* (NyAvZ,1GCD) 
N NUMBER OF INTEGERS 
ACI) INPUT 
INPUT IS DESTROYED. 
OUTPUT ARRAY OF N MULTIPLIERS 
OUTPUT, GREATEST COMMON DIVISOR OF THE ACI) INTEGERS 


Z(1) 
1GCO 


DIMENSION A(50),Z(50) 
INTEGER AgZoClyC2e¥leV¥29Q 
FIND FIRST NON-ZERO INTEGER 
DO 1M = 19N 
TF(A(M).NE.O) GO TO 3 
1 Z(M) = 0 


C ALL ZERO INPUT RESULTS IN ZERO GCD AND ALL ZERO MULTIPLIERS 


Cc 


I1GCD = 0 
RETURN 
IF LAST NUMBER IS THE ONLY NON-ZERO NUMBER, EXIT IMMEDIATELY. 
3 IF(M.NE.N) GO TO 4 


~ IGCD = A(M) 
Z(M) = 1 
RETURN 

4 MPL = M+ 1 


MP2 = M + 2 


C CHECK THE SIGN OF A(M) 


ao 


Oa 


oO 


Cc 


TSIGN = 0 
IF(A(M).GE.0) GO-TO 5 
ISIGN = 1 

A(M) = -A(M) 


CALCULATE -GCD VIA N-1 APPLICATIONS OF THE GCD ALGORITHM FOR TWO 


INTEGERS. SAVE THE MULTIPLIERS. 
5 Cl = A(M) 
DO 30 I = MPlyN 
IF(A(I)-NE.0) GO TO 7 
A(T) = 1 
Z{I) = 0 
GO TO 25 
7Yle1 
Y2 = 0 
C2 = IABS(A(TI)) 
10 Q = C2/Cl 
C2 = C2 - O*C1L . 


TESTING BEFORE COMPUTING Y2 ANDO BEFORE COMPUTING Y1 BELOW SAVES 
ADDITIONS AND N — 1 MULTIPLICATIONS~ 
IF(C2.E0.0) GU TO 20 


Y2 = Y2 - O*Y1 
QO = C1/C2 
C1 = Cl — Q*C2 
IF(C1.E9.0) GO TO 15 
Yl = YL - Q*Y2 
GO TO 10 

15 Cl = C2 
Yl = Y2 

20 Z(1) = (Cl — YL¥A(M))/AUT) 
A(t) = YL 
A(M) = 


TERMINATE GCD CALCULATIONS IF GCD EQUALS ONE. 


25 IF(CLeEQs1) GO TO 60 
30 CONTINUE 
40 IGCD = A(M) 
CALCULATE ‘MULTIPLIERS 
00 50 J = 


Z(K)"S Z(K)*A(KK) 
50 A(K) =A(K)*A(KK) 
Z(M) = A(MPL) 
IF(ESIGN.EQ.0) GO TO 100 
Z(M) = ~Z(M) 
100 RETURN 
GCD FOUND, SET REMAINDER OF THE MULTIPLIERS EQUAL TO ZERO. 
60 IPL = 1 +1 
DO 65 J = IP1,N 
65 Z(J) = 0 
GO TO 40 
END 


0 


ARRAY OF N INTEGERS, ACI) IS USED AS WORKING STORAGE,» 


N- 1 


COLLECTED ALGORITHMS (cont.) 


Certification of Algorithm 386 [A1] 


Greatest Common Divisor of n Integers and Multipliers 
[Gordon H. Bradley, Comm. ACM 13 (July 1970), 447] 


Larry C. Ragland and Donald I. Goad [Recd. 18 
June 1971, 22 August 1972, and 6 November 1972] 
Department of Computer Sciences, The University of 
Texas at Austin, Austin, TX 78712 


Key Words and Phrases: proof of algorithms, greatest common 
divisor, Euclidean algorithm, inductive assertion method 
CR Categories: 3.15, 4.42, 5.10, 5.24 


Subroutine GCDN, Algorithm 386 as described in [1, 2], com- 
putes the greatest common divisor, JGCD, of n integers A(1),..., 
A(n) by using the Euclidean algorithm to compute first gcd(A(1), 
A(2)), then gcd(gcd(A(1), A(2)), A(3)), etc. It also computes in- 
teger multipliers Z(1),..., Z(a) such that IGCD = >?) A(i)Z(i). 

A formal proof that a modified version of GCDN performs 
these two tasks has been constructed and is available from the 
authors. The proof employs a slight variation of one of the inductive 
assertion method techniques described in [3, 4]. Eight points in the 
program were tagged with assertions and the verification conditions 
for the 20 resulting paths were constructed automatically and 
proved manually. The initial assertion used in the proof is 


1 < No < dimension (A) = dimension (Z) 


and the final assertion is 


IGCD = ged(A,(1),..., Ao(No)) and 
No 
IGCD = dD A(i)Z(i). 


A variable with a zero subscript denotes the value of that variable 
at the time the subroutine is entered, and a variable without the zero 
subscript denotes the value of the variable when the subroutine 
terminates. A proof of termination is not included, but termination 
can be deduced from the bounds Bradley describes for the algo- 
rithm in [2]. 

Three modifications of the program were necessitated by errors 
in the original algorithm. 
(a) The two statements following statement 3 
IGCD = A(M) 
Z(M) = 1 
should be replaced by 
IGCD = IABS(A(M)) 
Z(M) = A(M)/IGCD 


so that a positive greatest common divisor will result when all 
elements of array A are zero except the last, and it is negative. 
(b) The second statement after statement 40 


K=I1-J+2 
should be replaced by 
K=I1-J+ MP1. 


The statement replaced is correct only if the first element of array A 
is nonzero, in which case MP1 = 2. 
(c) Statement 60 


60 /P1 = 1+1 
should be replaced by 


60 IF(ILEQ.N)GO TO 40 
IPl =I+1, 
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This is necessary when the greatest common divisor becomes one 
on the last element of array A. If No is strictly less than dimension 
(Z) then this last change may be omitted; however, this leads to the 
possibility of the value of the initial parameter of a DO statement 
being greater than the value of the terminal parameter. This problem 
is discussed below. 

The proof of GCDN assumes that DO statements consist of the 
following four steps. 


Step 1. 
Assign the control variable the value of the initial parameter. 
Step 2. 
Execute the body of the DO statement. 
Step 3. 
If control reaches the terminal statement, execute the terminal 
statement and increment the control variable by the incremen- 
tation parameter. 
Step 4. 
If the value of the control variable is less than or equal to the 
value of the terminal parameter, go back to 2; otherwise the 
DO is satisfied and execution continues out of the statement. 


This interpretation of the DO statement makes it necessary to insert 
the statement 


I=N 


following statement 30. 

For implementations in which DO statements are not handled 
as described above, other program modifications may be necessary. 
For example, according to the Fortran standard [5], at Step 1 the 
value of the initial parameter must be less than or equal to the value 
of the terminal parameter and in Step 4, if the DO is satisfied, the 
control variable becomes undefined. In subroutine GCDN, the only 
DO statement in which the value of the initial parameter may be 
greater than the value of the terminal parameter is DO 50 J = 
MP2,I. The program will give the correct result whether this loop is 
executed once (as in the proof) or is bypassed; however, if a fatal 
error will result, then the statement 


IF(MP2.GT.DGO TO 51 
should be inserted before the statement 
DO 50 J = MP2,1 


and the statement following statement 50 should be labeled 51. In 
many implementations the control variable remains defined at the 
last value used in execution when the DO is satisfied. In this case the 
statement J = N, which was inserted earlier, may be omitted. This 
statement is necessary if the control variable becomes undefined, or 
if the control variable remains defined at its last value used in execu- 
tion plus the incrementation parameter (as in this proof). 
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ALGORITHM 387 

FUNCTION MINIMIZATION AND LINEAR 

SEARCH [4] 

K. Fretpine (Recd. 23 Sept. 1969) 

Computing Centre, University of Essex, Wivenhoe Park, 
Colchester, Essex, England 


KEY WORDS AND PHRASES: function minimization, rela- 
tive minimum, quasi-Newton method 
CR CATEGORIES: 5.15 


[Eprror’s NoTE. According to tests made by the referee this 
algorithm is slower than FLEPOMIN, Algorithm 251, Comm. 
ACM 8 (Mar. 1965), 169-170. However, in two out of six tests 
FLEPOMIN failed and BROMIN did not fail to find a mini- 
mum.—L.D.F.]| 


procedure Bromin (n, iterations, number, macxiters, toliter, toler- 
ance, x, f, g, h, compute f, compute g, converged); 
value n, tierations, toliter, tolerance, maxiters; 
integer n, iterations, number, maxiters; 
real toliter, tolerance, f; 
array x, g, h; Boolean converged; procedure compute f, 

compuie g; 

comment | This procedure minimizes a function using the method 
of Broyden [1]. The parameters are described as follows. n is 
the number of independent variables. iterations is an upper 
limit on the number of iterations allowed. On exit number is the 
actual number of iterations taken. maziters is the maximum 
number of function evaluations allowed on each linear search. 
toliter is the convergence limit for Linmin 2. tolerance is used as 
the convergence limit. A solution is assumed to have been reached 
if g(x)g’ (x) < tolerance. x[1:n] is an estimate of the solution. On 
exit it is the best estimate of the solution found. f is the current 
function value f(x). g[1:n] is the current gradient vector of f(z). 
h{1:n, 1:n] is the inverse Jacobian at the solution if number = n 
and if converged = true on exit. compute f(x, f) is a procedure 
provided by the user to evaluate the function at any point. 
compute g(x, g) is a procedure provided by the user to evaluate 
the gradient vector at any point. converged is a Boolean vari- 
able used as follows: 

On entry converged = true implies that 2, f, g, and h all have 
been assigned values, if converged = false however it is assumed 
that just x has been assigned a value and h will be set to a unit 
diagonal matrix. 

On exit converged = true means that asolution has been found, 
converged = false means that no solution has been found. How- 
ever x is. set to the best point found so far while the function 
value, gradient vector, and estimated inverse Jacobian cor- 
responding to x are inf, g, and h. 

The procedure Linmin 2 (n, maxiters, toliter, x, f, compute f, 
t, p) is used to find a linear minimum on each iteration. 

REFERENCE: 
lt. Broypgen, C. G. The convergence of a class of double-rank 

minimization algorithms. J. Inst. Math. Appl. (to appear); 
begin 
integer 7,7; real norm, t, ythy, pty, temp; 
array p, y, hy [1:n]; 
if “| converged then 
begin 

comment [Initialize g, f, h and converged; 

compute f(x, f); compute g(x, g); 

converged := true; 

for 7 := 1 step 1 until n do 
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begin 
h{i, 7] := 1.0; 
for j := 7+ 1 step 1 until n do 
h[i, 7] := hj, t) := 0.0 
end of loop on 7 to set up h 
end of initial set up 
start of main loop on number; 
for number := 1 step 1 until zierations do 
begin 
for 1 := 1 step 1 until n do 
begin 
comment 
pit] := 0.0; 
for j := 1 step 1 until n do 
pli] := plz] — hf, J] X gly] 
end of loop on 7 to evaluate p; 
Linmin 2 (n, mactters, toliter, x, f, compute f, t, p); 
comment Finds the optimum value of ¢ and the values of x 
and f associated with it; 
for 7 := 1 step 1 until n do 
ylz] := gle; 
comment Use y as a temporary storage location for the old 
gradient before evaluating the new one asy = g new — gold; 
compute g(x, 9); 


Evaluate the search vector p; 


norm := 0.0; 
for 1 := 1 step 1 until n do 
begin 

norm := norm + g[t] T 2; 


ylt] := git] — ylz] 
end of loop to calculate g’g and y; 
ythy := pty := 0; 


for 7 := 1 step 1 until n do 
begin 
hy{t] := 0; 


for j := 1 step 1 until n do 
hyli] := hylé] + Ale, j] X ylil; 
ylhy := ythy + yli] X hy[d]; 
ply := pty + pit] X y[7] 
end of loop to evaluate hy, p’y and y’hy; 
temp := ythy / ply + t; 
for 71 := 1 step 1 until n do 
begin 
h(i, 7] := Afi, 7] + ((p[e] X temp—2.0 X hylt]) X pli])/pty; 
for j := 1+1 step 1 until n do 
Al, j] := hlj, t] := Aft, 7] + Cplt] X temp — hyft]) 
x ply] — hylg] X plt))/pty 
end of loop to update the matrix h; 
if norm < tolerance then go to successful 
end of main loop on number; 


number := iterations; 
converged := false; 
successful: 


end of procedure Bromin; 

procedure Linmin 2 (n, maziters, toliter, x, f, compute f, t, p); 
value n, maxiters, toliter; 
integer 7, maziters; real toliter,f,t{; array z,p; procedure 

compute f; 

comment This procedure carries out a linear search over ¢. It 
considers f(z+p X t) as a function of ¢ alone. f is evaluated for 
three points. It is now assumed that f(t) can be approximated 
by a quadratic. If this quadratic has a minimum, then this is 
taken as a better estimate of the minimum of f(t). If, however, 
the quadratic is concave, a step is taken in the direction of the 
best point so far. If the four points obtained form an increasing 
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or decreasing sequence with respect to ¢ then the largest is re- 
jected. If they do not, then they must bracket a local linear 
minimum and the three points retained are those that most 
closely enclose this minimum. This process is repeated until it 
is felt that a good estimate of ¢ is available (see parameter 
toliter), or until some limit on the number of function evalua- 
tions is violated (see parameter maziters). The parameters are 
described as follows. n is the number of variables. maziters is the 
maximum number of function evaluations allowed in the linear 
search. toliter is the tolerance for minimization, exit if abs((t — 
last t)/t) < toliter. x[1:n] is the array of independent variables. 
f contains the function value f(x). compute f(z,f) is the user 
provided routine to evaluate the function values at any point. 
t contains the best value of the scalar used for the step length. 
pli:n] is the vector which gives the direction of the step. If 
if is the final value of ¢ then the actual step taken is p X Uf. 
This routine is based on the procedure quadmin by Broyden [2]. 


REFERENCE: 
2. Broypen, C. G. A class of methods for solving nonlinear 
simultaneous equations. Math. Comp. 19 (1965), 577-593; 
begin 
integer 7, left, center, right, count; 
real alpha, beta, gamma, last t, ptp; 
array vt, phi [1:3]; 
procedure reject (j); 
value j; integer Jj; 
comment This procedure replaces one of the old values of ¢ 
and then sorts the remaining three in ascending order of ¢ in 
the array vt; 
begin 
procedure inierchange (2, j); 
integer 1, 7; 
comment if vf[i] > vt[j] interchange 7 and 7; 
begin 
integer k; 
if vt(z) > vi[j] then 
begin k := 7; 71:=j; j:=kend 
end of interchange 
start of reject; 
vilj] := t; phil] := f; 
interchange (center, right); 
interchange (left, center); 
interchange (center, right) 
end of reject; 
procedure basic 
comment This procedure evaluates a new value for x and the 
corresponding value of f; 
begin 
for 7 ‘= 1 step 1 until 7 do 
x[t] := a[z] + (¢—last t) X plz]; 
last t:= ¢; compute f(z, f) 
end of basic 
start of Linmin 2 itself; 
comment Initialize phi, vt, left, center and right; 
phi[l] := f; 
left := 1; center := 2; right := 3; 
last t := vt{1] := pip := 0.0; 
for 7 := 1 step 1 until n do 
ptp := ptp + pit] T 2; 
ptp := 1.0/sqrt(pip); 
comment pip is now used to limit the initial step; 
vi[2] := ¢ := if ptp < 1.0 then pip else 1.0; 
basic; 
phi[2] := f; vt[8] := t= t XK 2.0; 
basic; 
phi[3] := f; 
comment. Sets up first three values before entering main loop; 
for count := 3 step 1 until maziiers do 
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begin 
alpha := vt[2] — vt[3]; 
beta := vt[3] — vt{1]; 
gamma := vt{l] — vé[2]; 
alpha := — (phi[1]<alpha+ phi[2]X beta+ phi[3] 
xX gamma)/(alpha X beta X gamma); 
beta := (phi[1] — phi[2])/gamma — alpha X (vi{1] + vé[2]); 
comment If the quadratic through the three points is con- 
vex, t is chosen as the minimum of it. If it is concave, how- 
ever, ¢ is chosen as a step in the direction of steepest descent; 
t:= if alpha > 0.0 then — beta/(2.0 X alpha) 
else if phi [right] > phi [left] 
_ then 3.0 X vt [left] — 2.0 X vt [center] 
else 3.0 X vt [right] — 2.0 X vt [center]; 
if abs((t — last t)/t) < toliter then 
begin 
t:= last t; go to extt 
end of exit where minimum has been found; 
basic; 
if ¢ > vt [right] 
VV (t > ot [center] A\ f < phi [center]}) 
V (é > vt [left] A t < vt [center] A f> phi [center]) 
then reject (left) else reject (right); 
comment Choose which point to reject; 
end of main loop which used count as an index; 


exit: 
end of Linmin 2 
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ALGORITHM 388 

RADEMACHER FUNCTION [822] 

H. Htsner 

Forschungsinstitut des FTZ der Deutschen Bundespost, 
Darmstadt 

H. Kremer, K. O. Linn, anp W. Scuwerine (Recd. 
16 Jan. 1970) | 

Institut fiir Allgemeine Nachrichtentechnik der Tech- 
nischen Hochschule Darmstadt, Germany 


KEY WORDS AND PHRASES: Rademacher function 
CR CATEGORIES: 5.12, 5.13 


integer procedure radfun(k, x); 
value k, zx; integer k; real z; 

comment. The procedure radfun computes the Rademacher 
function r;(x) as defined in [1, 2, 3]. This definition is used in re- 
cent papers and differs from the original definition [4] by an op- 
posite sign. The Rademacher functions 7;,(z%) form an incom- 
plete set of orthogonal, normalized, periodic square wave 
functions with period equal to one. They assume only the values 
+1 and —1. The Rademacher function rz(z) may be defined 
either by the formula 

ry(x) = sgn[stn(Qr2kx)] (1) 

or by the following algorithm: 


. : ™m m+1 
Let x be in the interval a sa2< gen = 0, +1, --- 
then 
_ {+1 for m even 
na) \-1 for m odd. (2) 


The index k must be a nonnegative integer and the argument x 
can be any real number in the range —» <2 < o. 

Equations (1) and (2) show that r;(z) is piecewise constant and 
has 2+! jump discontinuities in the interval 0 < x < 1. The 
procedure radfun uses eq. (2) for computation. 


REFERENCES: 
1. Pauey, R. BE. A remarkable series of orthogonal functions. 
Proc. London Math. Soc. Ser. 2, 34 (1932), 241-279. 
2. Fing, N. J. On the Walsh functions. Trans. Amer. Math. 
Soc. 66 (1949), 372-414. 
3. MorGENTHALER, G. W. On Walsh-Fourier series. Trans. 
Amer. Math. Soc. 84 (1957), 472-507. 
4, Rapemacnser, H. EHinige Satze von allgemeinen Orthogonal- 
funktionen. Math. Ann. 87 (1922), 112-138; 
begin 
integer 7; 
%:= 2% — entier(x); 
r := entier(xX2T (k+1)); 
radfun := if r/2 = r + 2then 1 else — 1 
end 
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ALGORITHM 389 

BINARY ORDERED WALSH FUNCTIONS [S22] 

H. Htsner 

Forschungsinstitut des FTZ der Deutschen Bundespost, 
Darmstadt 

H. Kremer, K. O. Linn, anp W. Scuwerine (Recd. 
16 Jan. 1970) 

Institut fiir Allgemeine Nachrichtentechnik der Tech- 
nischen Hochschule Darmstadt, Germany 


KEY WORDS AND PHRASES: Walsh functions, binary ordered 
Walsh functions 
CR CATEGORIES: 5.12, 5.13 


integer procedure binwal (k, x); 
value k,z; integer k; real xz; 

comment The procedure binwal computes the binary ordered 
Walsh function w;(z) as defined in [1, 2, 3, 4]. These functions 
form a complete set of orthogonal, normalized rectangular func- 
tions which are periodic with period equal to one. They assume 
only the values +1 and —1. Using the Rademacher functions 
rz (x) [5], the function wz(x) may be defined in the following way: 

Write & as a binary number 


= > ait, a; € ©, 1), 
then 
wi(%) = IT [rs (x) ]**. 


The functions are defined for k a nonnegative integer in the range 
—-s7 Srl. 
In binwal the procedure radfun [5] is used. 
REFERENCES: 
1. Pavey, R. EH. A remarkable series of orthogonal functions. 
Proc. London Math. Soc. Ser. 2, 34 (1932), 241-279. 
2. Finer, N. J. On the Walsh functions. Trans. Amer. Math. 
Soc. 65 (1949), 372-414. 
3. MorGentHaLer, G. W. On Walsh-Fourier series. Trans. 
Amer. Math. Soc. 84 (1957), 472-507. 

4. Iiammonp, J. L., ann Jounson, R. 8S. A review of orthog- 
onal square-wave functions and their applications to 
linear networks. J. Franklin Inst. 278 (1962), 211-225. 

. Htsner, H., Kremer, H., Linn, K. O., snp Scowerine, W. 
Algorithm 388, Rademacher function. Comm ACM 13 
(Aug. 1970), 510; 


ou 


begin 
integer 1, 1, m, ww; 
L:= ky m:= ww := 1; 
t:= —1; 
for 1 := i+ 1 while m < 1 do 
begin 


if k/(m +m) #k + (m-+™m) then 
begin ww := ww X radfun(t, 2); k:= k-— mend; 


m:=m—-m 
end; 
binwal := ww 


end 
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ALGORITHM 390 

SEQUENCY ORDERED WALSH FUNCTIONS [S22] 

H. Hiner 

Forschungsinstitut des FTZ der Deutschen Bundespost, 
Darmstadt 

H. Kremer, K. O. Linn, anp W. ScHwerine (Recd. 
16 Jan. 1970) 

Institut fiir Allgemeine Nachrichtentechnik der Tech- 
nischen Hochschule Darmstadt, Germany 


KEY WORDS AND PHRASES: Walsh functions, sequency 
ordered Walsh functions 
CR CATEGORIES: 5.12, 5.13 


integer procedure seqwal(k, x); 
value k,z; imtegerk; real 2; 
comment The procedure seqwal computes the sequency ordered 

Walsh function wal;(x) as defined in (1, 2]. These functions form 

a complete set of orthogonal, normalized, periodic rectangular 

functions with period equal to one. They are closely related to 

the binary ordered Walsh functions w;(x) [3]. The set of wal; (x) 

consists of the same functions as the set of wz(z) but in another 

scheme of ordering. The set of w:(x) is ordered with regard to 

the binary decomposition of the index k, whereas the set wal; (x) 

is ordered according to the number of jump discontinuities in 

the open basic interval 0 < x < 1 in the sense that wal;(x) has 
exactly k jumps. The relation between wal;(x) and wz(x) is 
given by walk (x) = wa(x) withn = k ® (k + 2), where@ means 
the addition modulo 2 (binary addition without carry). The 
functions are defined for & a nonnegative integer in the range 
0 <2 < «. In seqwal the procedure binwal [3] is used. 
REFERENCES: 

1. WausH, J. L. A closed set of normal orthogonal functions. 
Amer. J. Math., Vol. 48 (1923), 5-24. 

2. Harmurn, H. F. Transmission of Information by Orthogonal 
Functions. Springer-Verlag, New York, 1969. 

3. Hiner, H., Kremer, H., Linn, K. O., ann ScHwERING, W. 
Algorithm 389, Binary ordered Walsh functions. Comm. 
ACM 18 (Aug. 1970), 511; 

begin 

integer 1, k2, 1, m, m2, n, v1, v2; 

k2:=k+2; l:=k; m:=1; n:=0; 

t:= 0; 

for 1 := 7+ 1 while m < ldo 

begin 

vl := 02 := 0; m2:=m+™m; 

if k/m2 # k + m2 then 

begin k := k — m; v1 := lend; 
if k2/m2 ~ k2 + m2 then 

begin k2 := k2 — m; v2 := 1 end; 
if vl + vthenn:=n+m; 


m:=m—+—e-m 
end; 
seqwal := binwal(n, x) 


end 
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ALGORITHM 391 | 

UNITARY SYMMETRIC POLYNOMIAIS [Z] 

Joun McKay (Recd. 9 Mar. 1970) 

Department of Mathematics, California Institute of 
Technology, Pasadena, CA 91109 


KEY WORDS AND PHRASES: symmetric polynomials, uni- 
tary symmetric polynomials 
CR CATEGORIES: 5.11, 5.30, 5.5 


procedure unitary (a, x, n); 
array @,2; integer 7; 
comment With; in 2z[7], 7 
symmetric polynomials a, = DS tei, 

a(r], 7 = 1, 2, -+: , n on exit. 

It is suggested that this algorithm replace Algorithm 156 
which is an O(2*) procedure for computing ) 7-1 (—1)™a,. 

It is optimal in storage and requires n(n—1) additions and 
3n(n — 1) multiplications. It has uses in the theory of symmetric 
functions since the unitary symmetric polynomials form a basis 
for the symmetric polynomials. These polynomials arise, too, 
in probability theory. In numerical analysis it may be of in- 
terest to compute the coefficients (—1)*a, of the monic. poly- 
nomial with roots 21 , #2, °°* , Zn Which is best done by altering 
the two lines 


1,2, +--+ ,n, on entry, the unitary 
- 2;, will be found in 


alk] := afk] + ¢ X alk —- 1); 
afl] := afi] + ¢ 
to 
alk] := alk] — t X alk — 1]; 
a(l] := afl] — ¢; 
begin 


integer 1,k; real ¢; 

for 1 := 1 step 1 until n do 

begin 
alt] := 0; t= 2[z]; 
for k := i step — 1 until 2 do 

alk] := alk] + ¢ X alk—J]; 

a{l] := afl] + ¢ 

end 

end unitary 


Remark on Algorithm 391 [Z] 

Unitary Symmetric Polynomials [John McKay, Comm. 
ACM 13 (Aug. 1970, 512] 

Giinther F. Schrack (Recd. 9 Nov. 1970 and 11 Jan. 
1971) 

Departments of Electrical Engineering and Computer 
Science, University of British Columbia, Vancouver 8, 
B.C., Canada 


Key Words and Phrases: symmetric polynomials, elementary 
symmetric polynomials, unitary symmetric polynomials, polynomial 
synthesis, reverse Horner scheme, reverse synthetic division, 
binomial coefficients 

CR Categories: 5.11, 5.30, 5.5 


To avoid using semicolons in the body of the comment, re- 
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place: 

the two lines ...; begin 

by 

the plus signs to minus throughout; begin 


Algorithm 391 has been tested on the IBM 360-67 with the 
OS Algol F compiler running under the Michigan Terminal Sys- 
tem. A number of sets of real x;, i = 1, 2,...,” with m = 1, 2, 
..., 10 with various positive, negative, and zero elements were 
used, drawn from a collection of test polynomials. Both versions 
of the algorithm produced correct results. 

Remarks. 
1. The modified version produces the coefficients of the monic 
polynomial 


f(Z) = agz” + az 4 ++) + aaaz + an, 


i.e. as the leading coefficient aj = 1 is implied it must be supplied 
by the calling program. Alternatively, the insertion of 


a[O] := 1; 


immediately preceding the first for statement will supply that 


coefficient. In this case line 3 of the comment should be replaced by 
a{r], r= 0,1,..., 7 0n exit. 


2. unitary may be used for the generation of complex elementary 
symmetric functions or complex polynomials from complex x; , 
provided all real parameters in the procedure are declared complex. 
3. The number of additions is 42(n+1), i.e. for n > 3 it is less 
than the number claimed. 

4. Consider a polynomial with real zeros x; only, and consider a 
deflation of that polynomial by Horner’s scheme (i.e. synthetic 
division) by the linear factor (x— x:). Again using Horner’s scheme, 
deflate the quotient by (x—-x2). Repeat this procedure for all 
zeros and call the resulting table the repeated Horner scheme. 
unitary is in effect the repeated Horner scheme carried out in re- 
verse order. Since the Horner scheme is optimal in the number of 
operations [1], so is unitary. 

5. The second version of unitary with the modification suggested 
in Remark 1 may be used to calculate all binomial coefficients 


(nm), m = 0, 1,..., ” by setting x; = 1, i = 1, 2,..., nm. The 
algorithm then represents an in situ generation of Pascal’s triangle 
with a; = (*), k = 1, 2,..., n, i= 0, 1,..., &. Because all 


x; are unity, this can be programmed using additions as the only 
arithmetic operations. Then the accuracy of the binomial coeffi- 
cients is limited only by the word length of the computer. 


References 
1. Pan, V.Ya. Methods of computing values of polynomials. 
Russian Math, Surveys 21 (1966), 105-136. 


re 
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ALGORITHM 392 

SYSTEMS OF HYPERBOLIC P.D.E. [D3] 

Rospert R. SmitH anp Dennis McCauu (Recd. 7 Jan. 
1969 and 17 June 1969) 

US Naval Electronics Laboratory Center, San Diego, 
CA 92152 


KEY WORDS AND PHRASES: hyperbolic p.d.e., character- 
istic, extrapolation, second order p.d.e., quasilinear p.d.e. 
CR CATEGORIES: 5.17 


DESCRIPTION: 
CHARAC solves the initial value problem for the quasilinear 
hyperbolic system of equations 


AU, + AU, + AsV: + AaV, = Ay 


(1) 
BiUz + BU, + BsVz + BaV, = Ha 


in two independent variables X, Y and two unknown functions 
U(X, Y), V(X, Y), where Ai = Ai(X, Y,U,V),-°++, He = Hi(X, 
Y, U,V). Specified data X;, Y;,U;,V; (¢=1,+-: , M) given along 
a noncharacteristic curve I are used to find U and V at character- 
istic grid points in the entire characteristic cone associated with 
the initial curve. Values in the opposite characteristic cone can be 
computed by specifying the initial data points X;, Y:, U;, V;in 
the opposite order (X;, Yi, U1, Vi becomes Xy¥,Y¥u,Uu,Vu, 
ete.). 

If the system (1) is hyperbolic, it can be reduced to a normal 
form containing directional derivatives along two characteristic 
directions. The derivation of this normal form is given in Forsythe 
and Wasow [1, p. 38]. 

For (1) the normal form is 


i= 1,2, (2) 


where (5/5.X); is the directional derivative along the characteristic 
with slope Ci. Let A = A.B; — A3B,, C= AB, - A,B:, B= 
3 (A1By—A,Bi— A3B2+A2B;). Then the coefficients in (2) are given 
by 


Pinan, — 7 = i 
oi(X, YU, V) = SIR ACY 


R(X, Y,U,V) = Ai(Bio;— Bz) — B,(Aiwi—A2), 
Si(X, Y, U, VY) = A3(Bio;— B2) == B;(A.oi— Az), 
T(X,Y,U,V) = H,(Bii:—B2) — H2(Awi—A2). 


The system (1) is called hyperbolic if B? — AC > 0 and if RiS2 — 
RS: ¥ 0. 

The subroutine CH VAR (XYUV, VAR) computes the values 
61,02, Ri, Re, S:, S2, 71, To from A;, B;:, H; evaluated at the 
values X, Y, U, V given in the array XYUV. (The subroutine 
CH COEF giving A;, B;, H; must be provided by the user, see 
Examples.) The computed values are returned in the array VAR 
of length 8. If o;, R;, S;, T; are known to the user, he may provide 
his own routine CH VAR. 

The system (2) is discretized by Massau’s method, which is 
described in Forsythe and Wasow [1]. Given two adjacent points 
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on the initial curve I, the nonparallel characteristics through the 
points intersect at a third point adjacent to the curve r. The values 
X,Y, U, V at the third point are estimated by replacing the dif- 
ferential equations in (2) by simple difference equations. The sub- 
routine CH STEP performs this discretization. By repeating this 
process for each pair of adjacent points on I, datum points are 
computed on a curve I” adjacent to I’ and inside the characteristic 
cone. If the initial curve has characteristic slope somewhere, the 
curve I’ will not be adjacent to r but will cross it. The routine does 
not recognize this, but it is obvious from the output. Successively 
calling CHARAC generates a sequence of adjacent curves until the 
entire characteristic cone is filled in. 

Extrapolation to the limit is applied to this discretization as 
follows: Compute the data on I’ by using only every fourth initial 
datum point on Tr. Then use every other initial datum point to 
estimate the data on an intermediate curve and then on I’. Finally 
use every datum point. Thus three estimates are found with dif- 
ferent step sizes hy , ho/2, and ho/4. One can then extrapolate these 
estimates to h = 0 in an attempt to obtain a better estimate. Nu- 
merical results have indicated that extrapolation does indeed sig- 
nificantly improve the estimates. In fact the method with extrapo- 
lation has an error of 0(h’) while Massau’s method alone has an 
error of o(h). The theoretical considerations of extrapolation are 
given by Bulirsch and Stoer [2], and applications to integration 
and ordinary differential equations are discussed. In general, ex- 
trapolation improves calculated results only if the exact solution 
is sufficiently differentiable. CHARAC can thus be expected to be 
an improvement over Massau’s method only when the coefficients 
of the system (1) and the initial data are sufficiently differentiable. 
Note that the extrapolation requires M = 4X N + 1 for some inte- 
ger N. 

CHARAC is defined by 


SUBROUTINE CHARAC (DATA, M, IFAIL). 


In the parameter list of CHARAC, M is the number of datum 
points on the initial curve. DATA is dimensioned DATA (4, M) 
(where M=4xN-+1 for some N) and the column DATA («,J) con- 
tains the four datum values X;, Y;, U;, V; of the Jth datum point. 
Upon calling CHARAC, the data on an adjacent curve I’ are com- 
puted and restored in DATA and M = 4 X N +1 is replaced by 
M—4=4x (N—-1) + 1. Hence CHARAC can immediately be 
called again with I’ the initial curve. Continuing until M@ = 1 will 
yield the single datum point at the apex of the characteristic cone. 
(See TEST CH used to solve example.) IFAIL igs a flag which is 0 
if the call to CHARAC was successful. If JFAZL = 1 upon return- 
ing, then CH VAR detected that B? -- AC < 0,so0 (1) was not hyper- 
bolic. If JFAIL = 2 upon returning, then CH STEP detected that 
o. = o2 or RS: = R2Si within a relative tolerance of 10-5; this 
tolerance parameter is represented by EPS in CH STEP. This 
indicates that (1) was not hyperbolic or that the characteristics 
are so close to parallel that the method fails. 
The user must provide a routine 


SUBROUTINE CH COEF (COEFF, XYUV) 


which computes the coefficients of the system (1) for the values 
X, Y, U, V given sequentially in the list XYUV of length 4. The 
computed coefficients must be returned in the list COEFF of length 
8 in the order A;, A2,A3, Aa, M1, Bi, B., Bs, Ba, Ae. 

Example (I). Unsteady, one-dimensional Isentropic Flow. 
(See Jeffrey and Taniuti [3, p. 71].) The system of equations for the 
flow velocity u(x, ¢) and the density p(z, ¢) in terms of the space 
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ALGORITHM: 


MAAAMAARAAAAAAMAAAN 


DIMENSION DATA 


SUBROUTINE TEST CH 
(4,81) 


N = 20 
C GENERATE INITIAL DATA. 


100 


9 
2 


00 
00 


*DATA(291)9 
910 FORMAT 


M=4*®N +1 


FM = 4.0 * FLOAT (N) 

00 100 I = 19M 

DATA(1L,yI) = FLOAT (1-1) / FM 
DATA(2,1) = 020 

DATA(391I) = 040 

DATA(491) = 260 * EXP (DATA(191)) 
CONTINUE 

IFAIL = 0 

WRITE (51, 900) 

FORMAT (1H1) 

00 250 I = lM 

WRITE (519910) DATA(1¢I)¢ 


DATA(3y1)_ DATAI4y1) 
(4H X =yE200995X94H Y =e 


*E200995X_94H U =yE2009% 
*5Xy4H V =E2029) 
250 CONTINUE 


300 


920 


GO TO 300 
GO TO 300 
IFAIL) 


IF (M.LE.1) 
IF(IFAIL.«NE.O) 
CALL CHARAC (DATAy My 
WRITE (51+ 900) 

GO TO 200 

CONTINUE 

WRITE (5l1_ 920) My IFAIL 

FORMAT (X93HM =912,8H IFAIL =912) 
RETURN 

END 

SUBROUTINE CH COEF 
DIMENSION COEFF(1LO)> 


(COEFF, XYUV) 
XYUV(4) 


C COMPUTES COEFFICIENTS Aly A2y A3% 
C A4y 
C STORES THEM SEQUENTIALLY IN COEFF. 


Hl, Bly B29 83+ B4y H2 AND 


COEFF(1) = 1.0 — XYUV(3)**2 
COEFF(2) = -XYUV(3) * XYUV(4) 
COEFF(3) = ~XYUV(3) *XYUV(4) 
COEFF(4) = 1.0 — XYUV(4)**2 
COEFF(5) = -4.0 * XYUV(3) * 
*EXP (XYUVO1L)) **2 

COEFF(6) = 0-0 

COEFF(7) = 1.0 

COEFF(8) = -1.0 

COEFF(9) = 0-0 

COEFF(10) = 0.0 

RETURN 

END 


SUBROUTINE CHARAC (DATAy My IFAIL) 

DIMENSION DATA(4, M)sD0(4), DLO4)» 
*02(4)_¢ D3(4), D4(4) 

DIMENSION T1(4)_5 T2(4)_ T34), 
*T4(4)_ T5(4)5 T6064), T7(4) 9 VI(B)+ 
*V2(B), V3(B), V4(08), V5(8)_ V6(8), 
*V7(8), V8(8), V9(8), V1LOB) 

NIMENSTON S1(4), S2(4), $3(4) 

INTEGER FAILs FLAG 

COMMON/CHFAIL/FAILs FLAG 

THIS ROUTINE ADVANCES ONE GRID STEP 
THE SOLUTION OF THE SYSTEM 

AL * U(X) + A2 * UY) 

+ AZ * VIX) + AS * VOY) = HI 

Bl * U(X) + B2 * UY) 

+ BB * V(X) + B& * VIY) = H2 

WHERE U(X) MEANS PARTIAL DFRIV. OF 
U WeReTe Xe FTCoy 

AND Al = Al (Xo Yo Uy V0e ---. 

THE INITIAL DATA IS GIVEN IN THE 
MATRIX DATA, EACH COLUMN OF FOUR 
ELEMENTS CONTAINING A VALUE 

Xo Yo Us Ve 

M IS THE NUMBER OF DATA POINTS 

ON THE INITIAL CURVE. 

FAIL = O 
Ms=M— 4G 
IF (Me.LE2O) RETURN 
00 145 J = 194 
OL(J) = DATA(Je1) 

D2(J) = NATA(J 92) 

O3(J) = DATA(Is3) 

145 D4(J) = DATA(Js4) 

CALL CH VAR (Dly V2) 

IF (FAIL.NE.O) GN TO 250 

CALL CH VAR (D2, V3) 

IF (FAIL-NE«O) GO TO 250 

CALL CH VAR (D35 V4) 

IF (FAIL.NE.O) GO TO 250 

CALL CH VAR (D4,_ V5) 

IF (FAIL.NE.O) GO TO 250 

FLAG = 0 
DY = 0202) = O12) 

DX = N2(l) - N11) 

IF (DY) 148, 149, 150 

148 DY = -NY 
OX = -DX 
GO TO 150 
149 DY = 1.0 
OX = 1.0€&30 
150 DXL = DY / V2(1) 

0X2 = DY / V212) 

IF ((OX1.LT«¢DX).AND.(0X2.GE2DX)}) 
*GO TO 170 
TF ((0X2eLTOX) AND. (0X1 -GE.DX)) 
*GO TO 175 
TF (DX2.GE.0X1) GO TO 175 


170 FLAG = 1 
175 CONTINUE 
CALL CH STEP 
*D3- V4y T1) 
IF (FAIL.NE.O) GO TO 250 
CALL CH VAR (Tly V6) 
IF (FAIL«NE.O) GO TO 250 
CALL CH STEP (D2_ V3y 
T6) 
IF (FAIL.NE-O) GO TO 250 
CALL CH VAR (T6» V1O) 
IF (FAIL.NE.0) GO TO 250 
CALL CH STEP (03, V4_ 
T2) 
IF (FAIL.«NE.O) GO TO 
CALL CH VAR (T2y V7) 
IF (FAIL.«NE.O) GO TO 250 
CALL CH STEP (D2, V3y 
V49 T4) 
IF (FAIL.NE.O) GO TO 250 
CALL CH VAR (T4y V9) 
IF (FAIL.NE.O) GO TO 250 
CALL CH STEP (T4, V9, 
T3) 
IF (FAIL.NE.O) GO TO 
CALL CH VAR (T3, V8) 
IF (FAIL.«NE.O) GO TO 
CALL CH STEP (Dly V2, 
TS) 
IF (FAIL»NE.O) GO TO 250 
CALL CH VAR (TSy V1) 
IF (FAIL.NE.O) GO TO 250 
CALL CH STEP (TS, Vly 


(Dly V2¢ 


250 


250 


250 


*T4_ V9" T4) 

IF (FAIL.NE.O) GO TO 250 
CALL CH VAR (T4+ V1) 

IF (FAIL.NE.O) GO TO 250 


CALL CH STEP (T4, 
T4) 
IF (FAIL.NE.O) GO TO 250 
CALL CH VAR (T4, V9) 
IF (FAIL.NE.O) GO TO 250 
00 100 I = 1yM 
DO 200 J = 1,48 
vi(g) = v2(J) 
vats) = VB(d) 
v3(d) = V4(J) 
vas) = V5SCS) 

200 CONTINUE 
DO 201 J = 
Dot) 
Di(s) 
n2tv) 
D3(S) 


Vis 


tom ou wt 


194 
DLJ) 
02) 
0305) 
D4(J5) 
D405) DATAC I, 1+4) 
CALL CH VAR (D4_ V5} 
IF (FAIL.NE.O) GO TO 


CALL CH STEP (DO, Vly 
*D49 V5_ $1) 
IF (FAIL-NE.O) GN TO 250 
CALL CH STEP (D2+ V3, 
V5— T5) 
IF (FAIL«NE.O) GO TO 
CALL CH VAR (T5y V1) 
IF (FAIL.«NE.O) GO TO 250 
CALL CH STEP (Thy V6, 
$2) 
IF (FAIL.NE.0) GO TO 250 
CALL CH STEP (D3, V4, 
Tl) 
IF (FAIL.NE.O) GO TO 250 
CALL CH VAR (Tls V6) 
IF (FAIL.«NE.O) GO TO 250 


201 


250 


250 


CALL CH STEP (T2y V7y 
*Tl, V6e T7) 
IF (FAIL«NE.O) GO TO 250 
NO 210 J = 14 
T2(J) = TliJ) 
V7(J) = V6(J) 
V7(5S#4) = VO6(J+4) 
T1(J) = T6(J) 
V6l(J) = VLO(J) 
V6(JSt4) = VLOCU+4) 
T6(J) = TH) 
v1l0(J) = V1ltJ) 
V1O(Jt4) = V1I(U+4) 
210 CONTINUE 
CALL CH VAR (T7y Vl) 


IF (FAIL.NE.0) GO TO 250 
CALL CH STEP (T3y V8, 


*T7y Vie T5) 
IF (FAIL«NE-O) GO TO 250 
DO 220 J = 194 
T30J) = TT) 
V8(J) = V1tJ) 
V8(S+4) = VI(S+4) 
220 CONTINUE 
CALL CH VAR (T5» vi} 


IF (FAIL.NE.O) GO TO 
CALL CH STEP (T4y V9, 
*T5, Vly S3) 
IF (FAIL.NE.0) GO TO 250 
DO 230 J = 194 
T4(5) = T5(d) 
VO(J) = VICI) 
V9O(Jt4) = VLC S44) 
230 CONTINUE 
C EXTRAPOLATE 'THE THREE 
C SUCCESSIVE APPROXIMATIONS 
DD 300 J = 194 
300 DATA (Jo1) = 0.3333333333 * 
*(8.0 * $3(J) - 6.0 * S2(y) + S1IJ)) 


250 
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100 CONTINUE 
IFAIL = 0 
RETURN 


C ERROR EXIT. 
250 IFAIL = FAIL 
RETURN 
END 
SUBROUTINE CH VAR (XYUVs VAR ) 
DIMENSION XYUV(4)>+ VAR(8)% T(10) 
INTEGER FAILy FLAG 
COMMON/CHFAIL/FAILy FLAG 
C COMPUTES THE VALUES SIGMA1L »SIGMA2»s 
C Rly R2e Sle S2e The T2 
C (STORED IN THE LIST VAR) 
C FROM THE COEFFICIENT FUNCTIONS 
C AND THE VALUES Xy Yo Us V 
C (IN THE LIST XYUV). 
CALL CH COEF (Ty XYUV) 
C COEFFICIENTS OF SYSTEM ARE STORED 
C IN THE LIST (T)e- 
A = TOL) * T(8) - TO3) * T(6) 
B = 065 * (TEL) * TI9) T(4) * 
*T(6) -— TO3) * T(7) + TC2) * T(8) ) 
C = T(2) * T19) = TI4) * TIT) 
IF (A.NE.0.0) GO TO 150 
LF (BeEQ-0.0) GN TO 500 
VAR(1) = 160E15 
VAR(2) = 0.5 * C / B 
LF (BeGT.0.0) GO TN 400 
VAR(1L) = VAR(2) 
VAR(2) = 1e0EL5 
GON TO 400 
10 D=B*B-A*C 
IF (DeLE-0-0) GO TO 500 
DO = SORT (0D) 
IF (BelLT~-0.0) GO TON 300 
VAR(1) = (B + D) SA 
VAR(2) = C / (A * VAR(L) ) 
GO TO 400 
300 VAR(2) = (B - D) / A 
VAR(L) = C / (A * VAR(2) } 
400 00 100 I = 192 
T(4) = TCL) * VAR( TE) ~ TC2) 
T(9) = T(6) * VAR(E) - TUT) 
VAR(I4+2) = TOLI*TOO) = T(6)*T(4) 
VAR (144) = TU3)*TO9) T(8)*T(4) 
VAR(I46) = TO5)¥*T(9) = T(10)*T(4) 
100 CONTINUE 
RETURN 
C ERROR EXIT. 
500 FAIL = 1 
RETURN 
END 
SUBROUTINE CH STEP 
*(DIy Vile D2, V2+ 03) 
DIMENSION D1(4)y D2(4)y D3C4)¢ 
*V1(8), V2(8) 


INTEGER FAILy FLAG 
COMMON/CHFAIL/FAIL+ FLAG 


C THIS ROUTINE COMPUTES THE VALUES 
C OF X39 Y39 U3s V3 AT THE POINT 
C DETERMINED BY THE INTERSECTION OF 
C THE CHARACTERISTICS THROUGH 
C XLyY¥1 AND X29Y2e 
EPS = 1.0F-5 
IF (FLAGeNE.O) GO TO 150 
100 SIG = V1(2) 
RL = VIC4) 
Sl = vil6) 
Tl = V1(8) 
GO TO 180 
150 SIG = V1(1) 
R1 = V1(3) 
Sl = V15) 
Tl = VIC?) 
180 CONTINUE 
IF (FLAGeNE-O) GO TO 250 
200 RHO = V2(1) 
R2 = V2(3) 
S2 = v2i5) 
T2 = Vv2(7) 
GO TO 280 
250 RHO = V2(2) 
R2 = V2(4) 
S2 = V2(6) 
T2 = v2(8) 
280 CONTINUE 
C COMPUTE X39Y39U39V3 
DEML = SIG ~ RHO 
IF (ABS(DEML) -LT-EPS*ABS(SIG)) 
* GO TO 900 
AA = 01(2) - SIG * D1(1) 
BB = D2(2) -— RHO * D2(1) 
D32 = (SIG * BB — RHO * AA) / DEML 
D31 = (8B - AA) / DEML 


TA = T1 * (D321 — DiC) } 
* + RL * O1L(3) + SL * DLI4) 
TB = T2 * (031 ~ O2t1) ) 

* + R2 * 02(3) + S2 * D2(4) 
TC = RL * S2 

TD = R2.* SL 

DEM2 = TC - TD 


TC = AMAXL (ABS(TC)» ABS(TD) ) 
IF (ABS (DEM2) -LEeEPS*TC) 
* GO TO 900 
p3(3) = (TA*S2 — TB¥S1) / DEM2 
D3(4) = (TB*¥RL — TA¥R2) / DEM2 
03(2) = 032 
03(1) = D31 
RETURN 
900 FAIL = 2 
RETURN 


END 


COLLECTED ALGORITHMS (cont.) 


variable z and time ¢ are 


plz + Upz + pe = 0 
(3) 


puus + pur + a’p, = 0. 


Assume the sound speed a = 1. Let the initial data given along the 
curve t= 0,0 <2<1beu(z,0) = 0 and p(z, 0) = 1+ cz for some 
constant c. 

Setting = y, u = U, and p = V, (8) has the form of (1) with 
Ai = V, A.=0, A, =U, A, =1, MW =0, Bi = UV, B. = V, 
B; = 1, B, = 0, A,=0. 

For c = 1 the problem is well conditioned. Solving this problem 
on a 10-digit machine using the 21 initial datum points X; = (j—1)/ 
20; Y; = 0; U; = 0; V; = 1+ X;; 7 = 1, -:: , 21, the following 
values were computed for the apex of the characteristic cone (by 
calling CHARAC 5 times): 


X = .4107503; Y = .5099940; 

U = —.3465748; V = 1.4142185. 
The correct values for the apex are 

X= .4107581; Y= .5099899; 


= —.3465736; V = 1.4142136. 


The maximum relative error is 1.9 X 10-5. Using 41 initial datum 
points and calling CHARAC 10 times, the computed values for the 
apex were 


X =  .4107572; Y = .5099904; 


U = —.3465737; V = 1.4142142. 


The maximum relative error is 2.2 X 10-*. Thus doubling the num- 
ber of points decreases the error by a factor of about 8, as would be 
expected for a third order method. 

The above problem was also solved for c = 10 using 21 initial 
datum points X; = (j-—1)/20; Y; = 0; U; = 0; V; = 14 10X;; 


j = 1,--- , 21. The computed values at the apex were 
X= 0905; Y = .6190; 
U = 1.2028; V = 3.3100; 
while the correct values are 
XxX = .0936; Y= .6176; 
U = 1.1990; V = 3.3165. 


Using 41 initial datum points the computed values are 
A= .0930; Y = 
U = —1.1996; 


.6178; 
V = 3.3158. 


Doubling the number of points decreases the error by a factor of 
only 5. The high order of the truncation error is partially obscured 
by the rounding error, which is larger for c = 10 than for c = 1. 

Example (II). Steady Two-dimensional Supersonic Flow. 
(See Jeffrey and Taniuti [3, p. 76].) The single second-order equa- 
tion 


(P—2")G22 — 2erGy~ey + (?—¢9,) oy = H (4) 


is hyperbolic if ¢-? + ,? > c?. Set H = —4y, exp (2), so that 
o(X, Y) = 2exp (X) sin (Y) is a solution of (4). Then (4) is hyper- 
bolic force = 1 if X > ln (0.5). 

Letting U = gz, V = ¢,, (4) becomes 


(1—U?)U, — UV(U,y+Vz) + (1—V2)Vy = —4U exp (2X) 
U, — Vz = 0. 


Let the initial data given along Y = 0,0 < X < 1 be U(X, 0) = 0, 
V(X, 0) = 2 exp (X). Then throughout the cone the exact solution 
is U(X, Y) = 2ezp (X) sin (Y), V(X, Y) = 2 exp (X) cos (Y). 
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This problem was solved using 81 datum points on the initial 
curve X; = (j—1)/80; Y; =0; U; =0; Vj =2exp (U3); j=1,°°°, 
81. By calling CHARAC 20 times, the following values were com- 
puted for the apex of the characteristic cone: 


X = 1.6130; Y = 1.1576; 

U = 9.1980; V = 4.0184. 
The correct values for the apex are 

X = 1.6144; Y = 1.1580; 

U = 9.2057; V = 4.0312. 


Using 81 datum points on the initial curve but not applying extrap- 
olation, the computed values were 


X = 1.5889; Y = 1.1418; 
U = 9.0441; V = 3.7319. 


Thus extrapolation significantly improved the results. 

By plotting the characteristic grid points in the X-Y plane, one 
sees that the characteristics become more parallel near the apex. 
Thus the above problem is ill conditioned. If the initial curve is 
chosen as Y = 0, 1 < X < 2, the problem becomes so ill condi- 
tioned that the method fails for 81 datum points on the initial 
curve. 


Example of use. In the following listing TEST CH sets up the 
initial data and makes the necessary calls to CHARAC to solve 
Example (11) for 81 initial datum points. CH COEF computes the 
coefficients A; = 1 — U?, A2= —UV, Az; = — UV, As = 1 — V?, 
H, = —4U exp (2X), B: = 0, B. = 1, Bs = —1, Bs = 0, H2 = 0 
as determined from (5). 
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Remark on Algorithm 392 [D3] 

Systems of Hyperbolic P.D.E. 

{Robert R. Smith and Dennis McCall, Comm. ACM 13 
(Sept. 1970), 567-570] 


Michael J. Frisch [Recd. 27 Jan. 1971] 
University Computer Center, University of Minnesota, 
Minneapolis, MN 55455 


Key Words and Phrases: ANSI Fortran standard 
CR Categories: 4.0, 4.22 


The following items were found during compilation of the 
algorithms written in Fortran published to date in Communica- 
tions. The MNF compiler written at the University of Minnesota 
for CDC 6000 Series machines by Lawrence A. Liddiard and E. 
James Mundstock was used to check the validity of the algorithms. © 


Algorithm 392 does not conform to the standard in subroutine 
CHARAC in which at six statements before the statement num- 
bered 145, the variable dimension M of the array DATA is re- 
defined during execution contrary to Section 7.2.1.1.2. 
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ALGORITHM 393 

SPECIAL SERIES SUMMATION WITH ARBITRARY 
PRECISION [C6] 

S. Kamau ABDALI* (Recd. 23 June 1969 and 9 Mar. 1970) 

University of Wisconsin, Department of Computer 
Sciences, Madison, WI 35706 


* This work was done while the author was at the University of 
Montreal, Montreal, Canada. 


KEY WORDS AND PHRASES: 
summation, approximation 
CR CATEGORIES: 5.12, 5.13 


function evaluation, series 


procedure series (places, terms, base, digit, sgn, numerator, de- 
nominator, num0, denom0); value places, terms, base; integer 
places, terms, base, sgn, num0, denom0; integer array digit; 
integer procedure numerator, denominator; : 

comment Programs for very precise summation of series are 
conventionally written in machine language and employ multi- 
precision routines to perform arithmetic on especially defined 
multiword registers. The present algorithm requires only integer 
arithmetic and can be implemented in any algebraic language. 
It is applicable to series in which the ratios of successive terms 
can be expressed as quotients of given integers or integer func- 
tions of term positions. 

The sum of a given series is computed to a given number of 
places, places, in a specified base for representation, base. The 
number of terms needed, terms, should be calculated outside the 
procedure. Procedures numerator and denominator are to be 
obtained from the fraction ith term/(z—1)-th term, expressed as 
aratio of two integer functions of 7. (That fraction should prefer- 
ably be reduced to its lowest terms.) nwm0 and denom0 are the 
integer numerator and denominator of the Oth term. The out- 
puts of the procedure are the sign of the result, sgn, the integer 
part, digit [0], and the digits of the fractional part, digit [1],--° , 
digit [places]. 

For example, one way to compute sin 0.6 = .6 — .68/3! + .65/5! 
— +--+ correct to 1000 decimal places is to call series with the 
parameter values: terms = 226, num0 = 3, denom0 = 5, (and 
since 7th term/(¢—1)th term = —.62/2i(2i+1)) numerator(i) = 
—9 and denominator(z) = 50i(2i+1). By taking base = 100000 
and places = 200, five decimal digits of the result will be obtained 
per word of the array digit. 

The use of a large base (and, consequently, smaller places) 
results in faster computation, as the number of operations is 
proportional to (placesXterms) for large values of terms and 
places. However, the intermediate products (baseXnum[i]X 
coef|i]) (and coef[i] can almost equal denom[it]) should not ex- 
ceed the largest number representable by an integer variable. 
Also within this limit should be the product of base and the in- 
teger portion of the result; 

begin 
integer i, j, k, 1; integer array num[—1:terms], denom, 
coef|0: terms]; 
comment Express the series by the expression 


no Lt ne 
(at 2(a+ = +2) ---)) (1) 


where n,; and d; are positive and c; are +1. (For short, n, d, c 


NE ee i 
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and ¢in (1) stand for num, denom, coef and terms, respectively) ; 
num[—1] := 1; num[0] := abs(num0}; denom[0] := abs- 
(denom0); coef[0] := sign(num0) X sign(denom0); 
for j := 1 step 1 until terms do 
begin 
k := numerator (j); 
denom[j] := abs(l); 
end; 
comment Calculate digits one at a step by extracting the in- 
teger part of base X (1) and restoring the fractional part in 
form (1); 
for 7 := 1 step | until places do 
begin 
l:=0; 
for j := terms step —1 until 0 do 
begin — 
k := numl[j] X (coef[j]<Xbase+l); 1 := k + denom{[y); 
coef[j] := k — 1X denom{[j];  num{j] := num[j—1] 
end j; 
digit{i] := 1 
end 7; 
comment Some digits may be negative or larger than base in 
absolute value. Process the array digit to obtain true base 
representation; 
L:=0; 
for i := places step —1 until 1 do 
begin 
k := digit{t] + 1; 
if digit{t] < 0 then 


| := denominator(j); num[j] := abs(k); 
coef[j] := coef[j—1] X stgn(k) X sign(l) 


lL:= k + base; digit{i] := k — base X 1; 


begin digii[i] := digit[i{] + base; 1:=1—1end 
end; 
digit(0] := 1; sgn := signi(l); 
if 1 < 0 then 
begin 
digit(0] := —l —1; digit[places] := digit[places] — 1, 


for 7 := 1 step 1 until places do digit[1] := base — 1 — digit{t] 
end 
end series 


Remark on Algorithm 393 
Special Series Summation with Arbitrary Precision [C6] 
[S. Kamal Abdali, Comm. ACM 13 (Sept. 1970), 570] 


Arthur H.J. Sale 
Basser Department of Computer Science, University of 
Sydney, NSW 2006, Australia 


Key Words and Phrases: function evaluation, series summation, 
approximation 
CR Categories: 5.12, 5.13 


Algorithm 393 has been tested on a number of different series, 
including those for e? and sin(x) and the harmonic series, and in 
all cases it gave the expected results. Some remarks should however 
be made concerning this algorithm. 


COLLECTED ALGORITHMS (cont.) 


This algorithm is a slight generalization of a method first 
described in the reference given here in which it was used to pro- 
duce an accurate approximation to the transcendental number e. 
AS noted in that reference the digits computed when expanding the 
e-series are correct as produced, and need no subsequent process- 
ing. This technique is very well suited to this application. 

As the author correctly states some types of series will allow 
negative digits to be computed, or digits which exceed the value of 
the chosen base. The series for si(x) can give rise to the first case, 
for it contains negative as well as positive terms; the second case 
can arise if the remnant series is not always fractional (and will 
always occur if the value of the original series has an integer part). 
To illustrate this the first few terms of the harmonic series may be 
summed: . 


WtWMtEUt 3+ = 1.45000... 


which using a base of 10 produces the digits 14, 4, 10. This means 
that the answer returned by the algorithm is not necessarily correct 
to the number of places requested either in a truncated or rounded 
sense. This is particularly important if it is possible that the 
(i +1)-th term is greater in magnitude than the ith term, for then 
the final remnant series (which is of course the truncation error) 
may have.a large value. 

The author too has not sufficiently emphasized the problem of 
integer overflow. Intermediate results produced can be quite large, 
and for example the evaluation of the above mentioned few terms 
of the harmonic series generated an intermediate value of 100 
(with a base of 10). Reversing the order of the terms gave a worse 
result: a value of 378 was generated, which even exceeds the bound 
given by the author of the algorithm. The implications of this are 
that considerable care must be taken to choose a base that is not 
too large, and that the technique may be restricted in application 
by the size of common computer words. For example to evaluate 
sin(0.999) (given to three decimal places), using 100 terms and a 
base of 10, would appear to require an integer range of about 10" 
by the author’s bound, which is certainly beyond the capacity of a 
32 bit machine. 

To summarize, this technique is fairly specialized; it is not 
suitable for summing series whose values have large integer parts, 
and care must be taken in applying it to an arbitrary series. 


References 
1. Sale, A.H.J. The calculation of e to many significant digits. 
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393-P 2- 


0 


COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 394 

DECISION TABLE TRANSLATION [H] 

Rosert B. Diat (Recd. 31 Oct 1969 and 8 May 1970) 

Alan M. Voorhees and Associates, Inc., McLean, Vir- 
ginia, and Department of Civil Engineering, University 
of Washington, Seattle, WA. 98105 


KEY WORDS AND PHRASES: Decision table, decision table 
translation 
CR CATEGORIES: 4.19 


integer procedure decitable(t, m, n, test, yes, no); value m, n; 

comment This algorithm converts the limited-entry decision 
table stored in the m by n matrix ¢ into a machine processable 
test-and-branch code matrix returned in the column vectors 
test, yes, and ne. The input decision table’s format and terminol- 
ogy generally agree with that introduced in Pollack [1]. The rows 
of ¢ represent the decision table’s conditions, its columns, its 
rules. Each of its entries represents a Y (truth), or an N (falsity), 
or a — (indifference). The output code matrix tabulates a 
decision tree, which can be traced to ascertain efficiently which 
rule any given transaction satisfies. Intended for use by a com- 
puter, this code matrix can readily drive an interpretive routine, 
or it can easily be transformed into code in some specified lan- 
guage. An example of a test-and-branch code matrix appears be- 
low in Figure 2. Figure 1 is the input decision table which gen- 
erates it, and Figure 3 is the decision tree it represents. 


Test-and-Branch 


i rf yes[i) | nolél 
Rule oer ae 

1 4 2 5 

Ri R2 R3~ R4 2] 2 —3 3 

a a a 3 3 4 —4 
Condition | Cl Y N —~— — 4; 1 —1 0 
c2;N — Y N 51 1 6| —2 

C3 Y — — N 6 3 7 0 

C4;/— N Y Y 7 2 0 —1 
Fig. 1. Decision Table Fie. 2. Code Matrix 


Each row of the code matrix in Figure 2 corresponds to a non- 
terminal, decision node in the tree in Figure 3. These row num- 
bers have been posted alongside the nodes in Figure 3. The root 
node corresponds to row 1, and the first condition to be tested 
is C4, indicated by the 4 in test[1]. In general test[z] contains the 

condition (decision table row) number to be tested at node 7. 

yes|t] and no[t] specify subsequent alternative actions selected 

on the basis of the result of testing condition fes¢(z]. yes[7] is an 
integer telling what to do if condition tesi[z] is true. Its interpre- 
tation depends on its relationship to zero: 

1. If yes[i] is positive, then the next thing to do is perform the 
test-and-branch given in row yes[t] of the code matrix. This is 
equivalent to moving down one ply in the decision tree via the 
“true arc’”’ to enter another decision node. 

2. If yes{t] is negative, no more testing is necessary; Rule 
abs(yes|i]) has been satisfied. This is equivalent to encounter- 
ing terminal rule node in the decision tree. In typical appli¢a- 
tions, a procedure would be invoked to perform the actions 
corresponding to Rule abs(yes[z]). 

3. If yes{z] is zero, then testing is complete; no rule can be satis- 
fied. In this case a terminal node is reached which indicates 
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that none of the decision table’s rules is satisfied. The action(s) 

corresponding to the ‘‘Else-rule’’ would be invoked here. 

The interpretation of no[7] is identical to that of yes[z], apply- 
ing to the case where the result of testing Condition test[i] is 


false. 


The algorithm’s technique is due to Pollack, who explains it 
in fine tutorial manner [2]. Another excellent discussion is given 
by Press, who provides additional insights and refinements [3]. 
In brief, the procedure selects a row of the decision table and 
bifurcates the table into two decision subtables from which the 
selected row is excluded. One subtable contains only rules 
(columns) for which the selected row’s condition may be true 
(Y or —). The other subtable contains only rules for which the 
condition may be false (N or —). This splitting is recursively ap- 
plied to each subtable (which is at least one row smaller than the 
parent table) until a ‘‘degenerate’’ subtable results. If the de- 
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generate subtable has no rows or is composed of only dashes, 
then a rule is satisfied and noted. If the degenerate subtable has 
no columns, then the Else-rule is in effect. 

In the Algol below, the author attempts to provide code which 
would allow a flexible and practical implementation. Computa- 
tional efficiency is traded off for storage conservation and ease of 
modification. No local arrays are declared in recursive routines. 
The decision table’s manipulation and subtable ‘‘creation’’ are 
effected by sorting the global row and column index arrays, row 
and col. The algorithm never modifies or reproduces any part of 
the original copy of the input decision table. 

To facilitate user control of the desired attributes of the out- 
put decision tree, the routine which selects the condition row 
on which to split the table is made a separate procedure. To im- 
pose his own criterion for row selection, the user can easily 


- modify or substitute code in the procedure select. His procedure 


generally depends on the kind of code matrix he wants. For ex- 
ample, if storage were a problem, he would want the shortest 


‘code matrix, i.e. a tree with fewest decision nodes. On the other 
‘hand, if execution time of the code matrix were of prime impor- 


tance, he would want to minimize the expected number of exe- 


COLLECTED ALGORITHMS (cont.) 


cuted decision nodes. In general, each of these criteria does not 
yield the same select procedure. The procedure below uses a 
criterion given in [2]. Others may be found in [2 and 3]. 

The syntax of Algol 60 does not allow strings such as “‘Y’’, 
“N’’, and ‘““—”’ to be elements of an array such as the decision 
table matrix ¢. Thus in the code below, the local variables N, D, 
and Y contain the integers —1, 0 and 1 to represent respectively 
the characters ‘“‘N’’, ‘““—’’, and “Y’’. Accordingly, the input 
decision table must also follow these conventions, or the user 
must appropriately modify the three assignment statements 
which establish the value of these local variables. 

The author thanks the referee and the editor for their valuable 
observations. 
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begin 
own integer array row(1:m], col[1:n]; 
own real array cc{l:1i; own integer N, D, Y, line; 
integer 7; 
integer procedure select (i, rows, first, last); 
comment This vrocedure picks a row of the decision (sub)- 
table detined by the row indices row[1], row[2], --+ , row[rows] 
and the eoluran iodices col[first], collfirst+1], --- , col{last]. 
The criterior. is a intimal “dash count’’, with the difference 
between the sumber of Y’s and the N’s to be minimized in 
ease of a tie. \ short code matrix should result [2]; 
value rows, first, last; 
begin 
integer 7, j, amin, delta, deltamin; real dash, dmin; 
dmin := (last—first+1) X (27 rows); imin := 0; 
for j := first step 1 until last do 
begin 
comment Calculate column count; 
cec{col[j]] := 1; for 2 := 1 step 1 until rows do 
if {rowl?}, collal] = D then cc[col[j]] := 2 X cc[col[j]] 
else imin := 1 
end: 
if imin ~ 0 then fori := 1 step 1 until rows do 
begin 
comment Calculate dash count; 
dash :== delta := 0; for j := first step 1 until last do 
if t{row[t], col[j]] = D then dash := dash + cc{col[j}] 
else delia := 
- delta + (if t[row[2], col[j]]=Y then 1 else —1); 
if dash < dmin \V (dash=dmin/\abs (delta) <deltamin) then 
begin 
comment Row 7 has the smallest dash count so far; 
tmin := %;dmin := dash; deltamin := abs (delia) 
end 
end; 
select := imin 
end select; 
procedure left (t, row, first, last, key, lyp, ldp); 
comment This procedure creates the two subtables described 
above with respect to condition row by rearranging the column 
indices coll[first], col[first+-1], -++ , col[last] based on the con- 
tents of dfrow, colffirst]], --° , t[row, col{last]]. Upon return, 
col[first] up to col[lyp—1] contain all the column indices 7 such 
that é[row, j] = Y. col[lyp] up to col[ldp—1] return the indices 
j such that ¢[row, j] = Dj, and col[ldp) up to col{last] have the 
indices such that t[row, j] = N. Thus the two subtables are 
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defined by the indices col[first], +++ , col[idp] and col[lyp], ++: : 
col{last]. left is executed twice for each external reference, 
First it places all the ‘“Y’’ columns at the far left. Second it 
calls itself to push all the ‘‘—-’’ columns to the right of the last 
‘“‘y”? column. The parameter key contains the code for ‘“‘Y”’ 
or ‘‘—’’ to indicate which character is being matched; 
value row, first, last, key; 
begin 
integer 2, j, temp; 
4 := first;j := last; 
fori := 7 while < j doif irow, col[t]] = key then 1 :=7+1 
else 
begin 
for j := j while ¢{row, col[j]] # key Ai <jdoj:=j-—1; 
temp := col[i]; col[i] := col[j]; col[j]:= temp; g:=j7—1 
end; 
lyp := 7; if key + D then left (t, row, 7, last, D, ldp, lyp) 
end left; 
integer procedure split (t, rows, first, last, test, yes, no) 
comment This procedure recursively bifurcates the nonde- 
generate decision subtable defined by the row indices row(1], 
-*+ , row[rows] and the column indices col[first], ++ , col[lasé). 
The global parameter line determines the position of the, 
code matrix into which split enters test-and-branch data. 
The procedure ‘‘creates’’ subtables from which the selected 
condition row is deleted by swapping the selected condition 
row index with the last row index, reducing the rows counter 
by 1, and having procedure left rearrange the column indices. 
If the input table has no rows, then split returns zero indicat- 
ing the Else-rule. If the table is entirely dashes or has no 
columns, then split returns the value —col[first], indicating a 
terminal, rule node. Otherwise split places the next condi- 
tion to be tested as a decision node into ftest[line+1] and 
calls itself for the corresponding subtables; 
value rows, first, last; 
begin 
integer mine, imin, lyp, ldp; 
mine := 0; if first < last then 
begin 
imin := select (t, rows, first, last) ; 
if imin = 0 then 
begin 
mine := —coll[first]; if jirst ¥ last then 
begin 
outstring (1, ‘Following rules are redundant:’); 
for i := first step 1 until last do outinteger (1, coll[z)) 
end; 
end else 
begin 
mine := line := line + 1; test[mine] := row[imin]; 
rowlimin] := row[rows]; 
left(t, test{mine], first, last, Y, lyp, ldp); 
yes[mine] := split (t, rows—1, first, ldp—1, test, yes, no); 
comment Restore column indices rearranged in 
recursion; 
left(t, test[mine], first, ldp—1, Y, lyp, ldp); 
no[mine] := spltt(t, rows-—1, lyp, last, test, yes, no); 
row[imin] := test[mine] 


end 
end; 
split := mine 
end split; 


for 7 := 1 step 1 until m do row[t] := 7; 

for 7 := 1 step 1 until n do, col[t] := 7; 

N:=—-1; D:=0; Y:=1; line:=0; 

4 := splii(t, m, l, n, test, yes, no); 

decitable := line; 

comment The value of decitable is the length of code matrix; 


end decitable 


COLLECTED ALGORITHMS (cont.) 


Remark on Algorithm 394 [H] 
Decision Table Translation [R.B. Dial, Comm. ACM 
13 (Sept. 1970), 570] 


D.R.T. Marshall [Recd. 3 Mar. 1971] 
Data Processing Department, University of Waterloo 
Waterloo, Ontario, Canada 


Key Words and Phrases: decision table, decision table translation 
CR Categories: 4.19 


The first comment of procedure sp/it has the words ‘‘columns’’ 
and “row’’ transposed in sentences four/five. It should read “If 
the input tables has no columns, then sp/it returns zero,.... 
If the table is entirely dashes or has no rows, then split,.... 

The statement in the main procedure invoking the procedure 
split uses a variable “/’’, which is not defined. 

This variable should be initialized to establish the “first” 
column in the array to be processed. This would, of course, nor- 
mally be set to one. 

The writer has programmed and executed the algorithm suc- 
cessfully in PL/I with the above noted changes. 
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COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 395 

STUDENT'S t-DISTRIBUTION [S14] 

G. W. Hit (Reed. 17 Nov. 1969 and 23 Mar. 1970) 

C.S.I.R.0., Division of Mathematical Statistics, Glen 
Osmond, South Australia 


KEY WORDS AND PHRASES: Student’s t-statistic, distribu- 
tion function, approximation, asymptotic expansion 
CR CATEGORIES: 5.12, 5.5 


real procedure student (t, n, normal, error); value t, n; real t, n; 
real procedure normal, error; 

comment student evaluates the two-tail probability P(¢ |) 
that ¢ is exceeded in magnitude for Student’s [1] ¢-distribution 
with n degrees of freedom. The procedure provides results accu- 
rate to 11 decimal places and 8 significant digits for integer val- 
ues of n, with approximate continuation of the function through 
noninteger values of n (over 6 decimal places for n > 4.3). 

The procedure normal (x) returns the area under the standard 
normal frequency curve to the left of x, so that a negative argu- 
ment yields the lower-tail area. The user-supplied procedure, 
error(n), should produce a diagnostic warning and may go to a 
label, terminate, or return a distinctive value (zero or —1.0) as 
a signal of error to the calling program. 

Student’s series expansion of the probability integral is sup- 
plemented by a faster asymptotic approximation for large values 
of m and by a more precise ‘“‘tail’’ series expansion for large 
values of ¢. 

The value of x, defined as the normal deviate at the same 
probability level as ¢, may be approximated by an asymptotic 
normalizing expansion of Cornish-Fisher type [2]. 


x = 2+ (29+8z)/b — (427+33254-2402?-+855z) /10b? 
+ (642114-7882°-+-98012?+-897752°-+-5433752*-+ 1788885z)/210b*— - - - 


where z = (aXIn(1+?2/n))!, a = n — $ and b = 48a? [3]. 
This is well approximated by the first three terms with the third 
term’s divisor replaced by 


10b (6+-0.82z4+100). 


The student probability is double the normal single-tail area, 
corresponding to the deviate x. 

The maximum error in the probability result for all values of ¢ 
is displayed as a function of n in Figure 1, for this approxima- 
tion, for the first few terms of the asymptotic expansion and for 
Fisher’s [4] fifth-order approximation used in Algorithm 321 [5] 
for n > 30. 

For small n and moderate ¢ the result is calculated as P(t |n) = 
1 — A(é|n) using Student’s cosine series for A (é | 7), rearrang- 
ing formulas 26.7.3 and 26.7.4 of the NBS Handbook [6] in 
nested. form 


oe (n—5) 
(n—4)b 


(n—3) )' 
= eR} is t : 


where y = +/(t?/n) and 6 = 1 + #?/n. In the nested form, terms 


A(é|n odd) = 2[ arctan +5 {i ae = P 


SUES) gg ke”) 
A(t|n even) = S970) {3 + z{ (nb {1 
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are treated in reverse order to the summation in Algorithm 321 
and Algorithm 344 [7], reducing the number of operations re- 
quired and reducing build up of roundoff error. Explicit decre- 
menting of the “loop”? parameter ensures that its final value 
remains defined on exit from the loop for use in an odd/even test. 

Execution times for Fortran versions run on a CDC 3200 with 
programmed floating point are displayed in Figure 2, which 
indicates that nesting decreases the time for the cosine series 
method by about 30 percent and that it is appropriate to change 
over to the asymptotic method (using Algorithm 209 [8] for 
normal) when n > 20. Although this approximation would be 
accurate to more than 11 decimal places, the use of Algorithm 
209 limits accuracy to about 9 decimals. This accuracy may be 
sufficient for many applications, in which case student may be 
abbreviated by deleting lines 15 and 27 through 35, removing 


MAXIMUM ERROR IN PROBABILITY VALUE 


48 6 8 10 4 2 
DEGREES OF FREEDOM 1 
Fie. 1. Maximum error of approximations for ‘Student’s”’ 
t-probability: 1, 2, and 3 term expansion, approximation with 
adjusted divisor, and Fisher’s 5th order approximation 
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Fie. 2. Execution times (CDC3200 with programmed floating 


point). Broken lines: ‘‘tail’’ series for selected values of ¢ (upper 
left); asymptotic method using precise normal (right) 


COLLECTED ALGORITHMS (cont.) 
the declaration and assignment of z from line 3, replacing line 
5 by 
if n > entier(n) V n > 20 then 
and replacing line 25 by 
student := if a > 1.0 then 0.0 else 1.0 — a 


The latter avoids spurious negative results due to roundoff 


error when a is near 1 for large values of ¢. The storage required’ 


for this abbreviated version was a little less than for Algorithm 
344 and less than half that for Algorithm 321. 

Applications such as production of tables or function inversion 
to obtain extreme quantiles may require greater precision at 
extreme probability levels than these methods provide. For the 
cosine series and the asymptotic approximation using a high 
precision procedure for normal, such as Algorithm 304 [9], the 
relative error in the result increases in magnitude as the result 
decreases to extremely small values, as illustrated in Figure 3. 


ASYMPTOTIC 
APPROXIMATION =20 


PROBABILITY LEVEL P 
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time for large ¢ beyond a probability level near 10-*. For the 
asymptotic method, using for normal a higher precision pro- 
cedure based on Algorithm 304, the execution times for different 
values of the argument approach those shown at the right of 
Figure 2. Averaged over a range of arguments arising in practice, 
the provision for higher precision more than doubles the time 
required. In the case of Smirnov’s [10] 6D tables of S(t|n) = 
1— 0.5 X P(|n), retabulation to 10D, using the more precise 
procedure for normal, increased the time from about 7 minutes 
to 12 minutes, while introducing the tail series method to tabu- 
late P(t | n) over the same range to 8 significant digits increased 
the time further to about 16 minutes. Use of the asymptotic 
approximation enabled Smirnov’s 6D tables of y(t | 1000/é), 
which is an approximate continuation of S(¢|n) over non- 
integer values of n = 1000/é, to be extended to 10D for = 0(2)30 
in 5 minutes, and permits continuation to é = 200 with over 6D 
accuracy as indicated in Figure 1. 
The preparation of diagrams by Murray C. Childs is gratefully 
acknowledged. 
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expansion of P(t |) in terms of w = 1/sqri(1+??/n), if aS 1a Shida Seon) -clee 


1 ixw 1X3xXw! begin 
P nee a — eoe 
(t{n) = C(n) X w (2 + Bn4+2) 2x Ant) ss \, reala,b,y,z; 2 := 1.0; 


t:= tf 2; y:=t/n; b:=10+4+y; 
if n > entier(n) Vn > 20At<2nV n> 200 then 
‘begin 
comment Asymptotic series for large or noninteger n; 
if y > »:—6 then y := In(b); 
a:=n—05; b:=48.0Xaft 2; y:=axy; 
' : y := (((((—0.4Xy—3.3) Xy—24.0) Xy—85.5)/ 
The cosine series method loses precision mainly in the sub- (0.8Xy T 2+100.0+b)+y+3.0)/b+1.0) Xsqrt(y); 
traction 1 — A(t | 7) as well as from the sqrt procedure and arctan student := 2.0 X normal(—y); 
when n is odd. In the worst case, n = 19, the error is kept below end 
3 decimals by changing to the tail series if t > 2, which ensures 8 else 
significant digits in the result for the 36-bit (about 11 decimal) ifn < 20 A é < 4.0 then 
precision real variables for the processor used. As shown in begin 
Figure 3, change over from the asymptotic method to the tail comment Nested summation of ‘‘cosine’”’ series; 
series when ¢? > n maintains about 8 significant digits in the a@:=y:= sgqrtty); ifn = 1 thena := 0.0; 
result. For a machine of greater precision the use of more terms loop: 
in the asymptotic series may be warranted, and the change over n:=n—2; ifn > 1then 
criteria would need adjustment to balance speeds and precision begin a := (n—1)/(Xn) Xa+y; go to loop end; 
between the three methods. a := ifn = 0 then a/sqrt(b) 
Execution times for the tail series are shown as broken lines else (arctan(y)+a/b) * 0.63661977236; 
in Figure 2 for selected values of ¢: with bounds t¢ > 2 forn < 20, comment 2/7 = 0.6366197723675813430755351 --+ ; 
t? < n for n > 20 and with the limit n < 200 preventing excessive student := z— a 


where C(n) = I'((n-+1)/2)/(/rXI'(n/2)). The series is summed 
till a negligible term occurs and then the factor C(n)Xw* is 
applied using the same repeated loop as the cosine series. Except 
for w near 1 when ¢ is small, the truncation error is small, and 
accumulation of error in the repeated loop is moderate unless n 
is very large. 
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end begin 
else z2:=a; y:=yX G—-1)/0X)j); a@:=a + y/(nty)) 
begin end; 
comment “‘tail’’ series expansion for large ¢-values; ni=n+2; z:=y:= 0.0; a@:= —a; goto loop 
integer j; a := sqrt(b); y:=aXn; j:=0; end 
for j := j + 2 while a # zdo end 


ACM Transactions on Mathematical Software, Vol. 5, No. 2, June 1979, Pages 238-239. 


REMARK ON ALGORITHM 395 


Student’s ¢-distribution [S14] 
[G.W. Hill, Comm. ACM 13, 10 (Oct. 1970), 617-619] 


and 
REMARK ON ALGORITHM 396 


Student’s Quantiles [S14] 
[G.W. Hill, Comm. ACM 13, 10 (Oct. 1970), 619-620] 


Mohamed el Lozy [Recd 9 June 1978] 
Department of Nutrition, Harvard School of Public Health, 665 Huntington 
Ave., Boston, MA 02115 


Both of these algorithms incorporate very accurate mathematical methods, but 
contain a source of loss of precision which is severe for the many processors with 
precision less than or not sufficiently greater than that claimed for the algorithms. 

In Algorithm 395 the use of the asymptotic series involves the evaluation of 
In(1 + t?/n). For small y = ¢?/n and b = 1+ y, In(d) is of the order of magnitude 
of y, so that the statement 


if y > 10° then y := In(d) 


admits a loss of precision of up to 6 decimal digits. This loss will be especially 
marked on a machine with hexadecimal number representation, since the leading 
byte in 1 + y will be hexadecimal 1, or binary 0001, with a loss of a further 3 bits, 
in addition to the loss inherent in the addition. Where the processor’s implemen- 
tation of In(b) for 6 near 1 effectively involves the Taylor series (6b — 1) -- (6 — 
1)?/2 + ..., the replacement statement 


if b # 1 then y := y X (In(b)/(6 — 1)); 


as in IMSL’s subroutine MDTD [1], counteracts the loss of precision in evaluating 
the logarithm as evidenced by column 3 of Table I. However, in the general case 
there are two solutions, the simplest of which is to evaluate Y = DLOG(1.0D0 
+ DBLE(Y)), using the variable Y (single precision) for ¢t?/n, as in the algorithm 
under discussion. An alternative method might be based on the use of single 
precision LOG(1.0 + Y) for “sufficiently large” Y, and a suitable number of terms 
of the Taylor expansion otherwise. In this case the optimal crossover point 
between the two methods of evaluation would be machine dependent and the 
coding would be longer, as exemplified for an analogous case in Algorithm 465 
[2]. 

In Algorithm 396 the expression exp(x”/n) — 1 occurs, and here again substan- 
tial loss of precision can occur for small y, to use the algorithm’s notation. 
Admitting a loss of precision of up to nearly 3 decimal digits, this algorithm shifts 
to a Taylor series expansion of exp(y) — 1 for y < 0.002, but this choice is machine 
dependent and unsuitable for 32-bit machines. Here again I would opt for double 
precision evaluation of that one expression (storing the result in single precision) 
over the alternative Taylor series approach. 


The Remark on Algorithm 396 was supported by the Fund for Research and Teaching, Department 
of Nutrition, Harvard School of Public Health. 
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end begin 
else z2:=@; y:=y X G-1)/OXj); a:=a+ y/(nt)j) 
begin end ; 
comment ‘‘tail’”’ series expansion for large ¢-values; ni=n+2; z2:=y:= 0.0; a:= —a; goto loop 
integer j; @ := sqrt(b); y:=aXn; j:=0; end 
for j := 7+ 2 whilea ¥ zdo end 
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REMARK ON ALGORITHM 395 


Student’s f-distribution [S14] 
[G.W. Hill, Comm. ACM 13, 10 (Oct. 1970), 617-619] 


and 
REMARK ON ALGORITHM 396 


Student’s Quantiles [S14] 
[G.W. Hill, Comm. ACM 13, 10 (Oct. 1970), 619-620] 


Mohamed el Lozy [Recd 9 June 1978] 
Department of Nutrition, Harvard School of Public Health, 665 Huaritineton 
Ave., Boston, MA 02115 


Both of these algorithms incorporate very accurate mathematical methods, but 
contain a source of loss of precision which is severe for the many processors with 
precision less than or not sufficiently greater than that claimed for the algorithms. 

In Algorithm 395 the use of the asymptotic series involves the evaluation of 
In(1 + ¢?/n). For small y = t?/n and b = 1 + y, In(bd) is of the order of magnitude 
of y, so that the statement 


if y > 10°° then y := In(d) 


admits a loss of precision of up to 6 decimal digits. This loss will be especially 
marked on a machine with hexadecimal number representation, since the leading 
byte in 1 + y will be hexadecimal 1, or binary 0001, with a loss of a further 3 bits, 
in addition to the loss inherent in the addition. Where the processor’s implemen- 
tation of In(b) for 6 near 1: effectively involves the Taylor series (b — 1) — (6 — 
1)?/2 + ..., the replacement statement 


if b ~ 1 then y := y X (In(b)/(b — 1)); 


as in IMSL’s subroutine MDTD [1], counteracts the loss of precision in evaluating 
the logarithm as evidenced by column 3 of Table I. However, in the general case 
there are two solutions, the simplest of which is to evaluate Y = DLOG(1.0D0 
+ DBLE(Y)), using the variable Y (single precision) for ¢?/n, as in the algorithm 
under discussion. An alternative method might be based on the use of single 
precision LOG(1.0 + Y) for “sufficiently large” Y, and a suitable number of terms 
of the Taylor expansion otherwise. In this case the optimal crossover point 
between the two methods of evaluation would be machine dependent and the 
coding would be longer, as exemplified for an vauniogous case in Algorithm 465 
[2]. 

In Algorithm 396 the expression exp(x?/n) — 1 occurs, aid here again substan- 
tial loss of precision can occur for small y, to use the algorithm’s notation. 
Admitting a loss of precision of up to nearly 3 decimal digits, this algorithm shifts 
to a Taylor series expansion of exp(y) — 1 for y < 0.002, but this choice is machine 
dependent and unsuitable for 32-bit machines. Here again I would opt for double 
precision evaluation of that one expression (storing the result in single precision) 
over the alternative. Taylor series approach. 


The Remark on Algorithm 396 was supported by the Fund for Research and Teaching, Department 
of Nutrition, Harvard School of Public Héalth. 
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Table I. Relative Errors in the Calculation of In(1 + ¢?/n) and exp(x’/n) — 1 by the 
Methods of Algorithms 395 and 396, for x = ¢ = 2 and Various Values of n 


In(1 + t?/n) exp(x?/n) — 1 

n PDP IBM IMSL/IBM PDP IBM 
20 0.245E—6 0.654E—6 0.0 0.538E—6 0.242E—5 
40 0.313E—6 0.500E—5 0.0 0.708E—6 0.567E—5 
80 0.137E—5 0.149E—4 0.299K—6 0.203E—5 0.118E—4 
160 0.754E—6 0.151E—4 0.0 0.177E—5 0.311E—4 
320 0:817E—5 0.150E—4 0.0 0.281E—5 0.349E—4 
640 0.688E—5 0.909E—4 0.0 0.187E—4 0.178E—4 
1280 0.201E—4 0.244E—3 0.298E—6 0.153E—4 0.282E-—3 
2560 0.224E—5 0.244E-—3 0.149E—6 0.372E—6 0.447E—6 
5120 0.700E—4 0.244E-—3 0.0 0.745E—7 0.298E—6 


10240 0.104E-—3 0.164E—2 0.0 0.745E—7 0.0 


Table I shows the relative errors of single precision evaluation of these two 
expressions for ¢ (or x) equal to 2 and for various values of n, using the first two 
terms of the Taylor series for the exponential for y < 0.002 as in the algorithm, as 
well as the IMSL “fix.” The computations were done on an IBM 370/168 running 
under OS/MVT and on a PDP 11/70 running under UNIX. Though both 
machines have a mantissa of 24 bits, the results on the PDP are far better than 
those on the 370, presumably due to the hexadecimal normalization of the latter 
machine. 
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REMARK ON ALGORITHM 395 


Student’s ¢-Distribution [$14] 
[G. W. Hill, Commun. ACM 13, 10 (Oct. 1970), 617-618. ] 


G. W. Hill [Received 6 December 1978; revised 7 July 1979; accepted 6 August 
1979] 
Division of Mineral Chemistry, CSIRO, Port Melbourne, Australia 3207. 


The precision loss noted in [1], in the evaluation of In(1 + ¢?/n) for Algorithm 
395, exceeds the margin of precision of the 36-bit processor over the eight 
significant decimal digits target mentioned in the algorithm. A suitable correction 
for this case is the replacement (recall that y = ¢?/n and b = 1 + t?/n) of line 8 of 
the procedure body by 


if y > 0.01 then y := /n(b) 
else y := ((—y X 0.75 + 1.0) X ¥/3.0 —0.5) X yx y+ y; 


However, when extended precision is required [2, 4], a number of details of the 
algorithm must be changed. A more generally applicable replacement of line 8 
imitates a technique in Algorithm 465 [3]. 


z:=t:= y; if y > cmax then y := In(bd) 


else 
for a := 2.0, a + 1.0 while y ¥ b do 
begin z:= —zxt;b=y;y:=2z/a+y end; 


For small y (<cmax say) the precision lost in evaluating In(1 + y) corresponds 
to a relative error about €/y, where e denotes the relative magnitude of processor 
roundoff. The alternative summation of the logarithmic series until the Rth term 
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is negligible, (y"/(R + 1) < ¢), accumulates roundoff error resulting in an average 
relative error of about eVR. The maximum of these relative errors is minimized, 
as in Algorithm 465, by choosing cmax = R~'””, where R is determined for a p-bit 
precision processor by an approximate criterion for neglecting the Rth term; 
cmax®/R = € = 2-”, or equivalently, R/2 + 1 = 2”. For p = 36 the solutions 
R = 16 and cmax = 0.25 imply an approximate relative error about 4e in the 
result. For precision as extended as p = 96, cmax = 0.168 holds this precision loss 
to about one decimal digit. 

For each combination of actual parameter values, Algorithm 395 applies criteria 
to select whether to use Student’s cosine series, the asymptotic normal approxi- 
mation, or the “tail” series, in order to achieve 8S (significant decimal digits) 
without excessive loss of speed for the 10.8S processor used. For an extended 
precision version the criteria must be changed to balance precision against speed 
characteristics of the processor used. In the case of double precision to about 29S 
of a CDC 6000-7000 series processor; a target precision of 25S allows for precision 
loss up to four decimal digits, such as occurs in the subtraction of almost equal 
quantities, P(t/n) = 1 — A(t/n), to obtain small tail probabilities using Student’s 
cosine series for A(t/n). The effect of this and other causes of precision loss is 
illustrated in Figure 3 of Algorithm 395. 

Greater precision is achieved in the case of extreme probability levels and large 
n values by the use of the asymptotic normal approximation. To improve precision 
for larger n, it is efficient to extend the normal approximation up to the sixth 
term of the series [2] in terms of z = [(n — 4)In(1 + t?/n)]’” and b = 48(n — 4)’. 


X= zt (2° + 3z)/b — (427 + 332° + 2402° + 855z)/100? 
+ (64z") + 788z° + 98012’ + 897752° + 5433752° + 1788885z) /210b° 
— (11522 + 18896z"° + 3294962" + 4698585z° + 5202792027 
+ 4243031102*° + 23498748002" + 74128304252) /4200b* 
+ (122882"° + 2517762" + 5645776z"® + 1087885202" 
+ 17382754172"! + 224992216352° + 2291922240302’ 
+ 17546111144102z° + 93095490584252* + 28756631378475z) /46200b° — --. 
To achieve at least 25S for n > 100, the sixth term’s divisor is replaced by 
46200b4 (b + 0.435952* + 22° + 537), 


which accounts for a substantial portion of the omitted next terms, in a fashion 
similar to the effect displayed in Figure 1 of Algorithm 395, which also illustrates 
“diminishing returns” in precision gain from additional terms of the series. 
However, the consequent increase in computing time is moderated by the fact 
that two-thirds of the arithmetic operations arise in evaluating the fifth and sixth 
terms, for which single-precision arithmetic and representation of coefficients 
prove sufficient. 

For large enough values of z‘/b = [In(1 + ¢?/n) ]*/48, the asymptotic approxi- 
mation becomes poor or even divergent, so that for such large values of 1 + t?/n 
the tail series in powers of w? = 1/(1 + t?/n) is used and converges rapidly with 
little accumulation of rounding error. The factor P'((n + 1)/2)/ (Va xX I'(n/2)) x 
w” may be evaluated using the same repeated loop as for the cosine series, or by 
using Algorithm 465 to evaluate the frequency function f(¢|n) as a factor for the 
equivalent tail series expansion, 


vnfl 1xw°? 1x3xw' 
ccd (eels geen sca Meg sys ert ace Seo 
JVwl|n 2(n+2) 2x 4(n +4) 


This can improve speed for large n and, since Algorithm 465 is valid for 
noninteger n, permits continuation of the probability integral over noninteger 
values of n down to n = 1 with considerable precision for t? > n; that is w? < 4. 


P(t\n) = 2f(t|n) x 
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In neither form does the series converge well for w near 1; and for small ¢ or large 
n the time required for evaluation, the accumulated roundoff error, and the 
truncation error can increase to unacceptable levels. a 

Where the domains of validity of the three methods overlap, correspondence 
between results of two methods can be used as a basis for determining the error 
level of the third. Where two methods achieve precision exceeding the target, 
counts of instructions or timing tests may be used to select the faster. Increase or 
decrease of the target precision is found to have a marked effect on computing 
time so that some compromise trade-off of precision against speed is required 
according to the particular processor used and the intended application. Reason- 
able speed of execution with precision at least to 23S, but generally 25S or more, 
is achieved for the CDC 6000-7000 series processor by replacing line 5 by (recall 
that ¢ represents t”, the square of the actual parameter value) 


ifn > entier(n) Vn >1000 Vn=100A¢t<0.1x*n-—5 then 


to select evaluation by the six-term asymptotic approximation. Replacement of 
line 15 by 


ifn <100 4 t< 16 then 


selects the cosine series method for smaller values of n and t < 4; the else clause 
evaluates the double-precision tail expansion to obtain a sufficiently precise result 
for smaller probability levels. For continuation extension as outlined in the 
preceding paragraph, the replacements of lines 5 and 15 are 


ifn > 1000 Vn =100 A t<0.1xn—-—5 then 
if mn = entier(n) An<100 4 ¢< 16 then 


Some margin of precision loss from the full precision level of the processor is 
unavoidable due to accumulated roundoff error and is traded off further to 
achieve an acceptable speed of execution. With this reservation the methods of ai 
Algorithm 395 can be extended to provide higher precision results, as evidenced 
by their use in evaluating quantiles to 20D [2]. 


REFERENCES 

1. Ex. Lozy, M. Remark on Algorithm 395. Student’s ¢ distribution. ACM Trans. Math. Softw. 5, 
2 (June 1979), 238-239. 

2. HILL, G.W. Reference Table: “Student’s” {-distribution quantiles to 20D. Tech. Paper No. 35, 
Div. Math. Statist., CSIRO, Australia, 1972, 24pp. 

3. Hitt, G.W. Algorithm 465. Student’s ¢ frequency. Commun. ACM 16, 11 (Nov. 1973), 690. 

4. Linc, R.F. A study of the accuracy of some approximations to ¢, x? and F tail probabilities. -/. 
Amer. Statist. Assoc. 73, 362 (1978), 274-283. 


9 SRR SR Ria a aa ee WN aa em NCI RAEI IO a A A A A RN, ERRAND NG URI SUING IRIE 


COLLECTED ALGORITHMS (cont.) . 395-P 4-0 


Table I. Relative Errors in the Calculation of In(1 + ¢?/n) and exp(x?/n) — 1 by the 
Methods of Algorithms 395 and 396, for x = ¢ = 2 and Various Values of n 


In(1 + ¢?/n) exp(x?/n) — 1 

n PDP IBM IMSL/IBM PDP IBM 
20 0.245E—-6 0.654E—6 0.0 0.538E—6 0.242E—5 
40 0.313E—6 0.500E—5 0.0 0.708E—6 0.567E—5 
80 0.137E-—5 0.149E—4 0.299E—6 0.203E—5 0.118E—4 
160 0.754E—6 0.151E—4 0.0 0.177E—5 0.311E—4 
320 0.817E—5 0.150E—4 0.0 0.281E—5 0.349E—4 
640 0.688E—5 0.909E—4 0.0 0.187E—4 0.178E—4 
1280 0.201E—4 0.244E-3 0.298E-—6 0.153E—4 0.282E-—3 
2560 0.224E—5 0.244E-3 0.149E—6 0.372E—6 0.447E—6 
5120 0.700E—4 0.244E-3 0.0 0.745E—7 0.298E—6 


10240 0.104E—3 0.164E-2 , 0.0 0.745E—7 0.0 


Table I shows the relative errors of single precision evaluation of these two 
expressions for ¢ (or x) equal to 2 and for various values of n, using the first two 
terms of the Taylor series for the exponential for yy < 0.002 as in the algorithm, as 
well as the IMSL “fix.” The computations were done on an IBM 370/168 running 
under OS/MVT and on a PDP 11/70 running under UNIX. Though both 
machines have a mantissa of 24 bits, the results on the PDP are far better than 
those on the 370, presumably due to the hexadecimal normalization of the latter 
machine. as 
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ALGORITHM 396 

STUDENT’S t-QUANTILES [S14] 

G. W. Hit (Recd. 6 Jan. 1970 and 18 May 1970) 

C.S.I.R.0., Division of Mathematical Statistics, Glen 
Osmond, South Australia 


KEY WORDS AND PHRASES: Student’s t-statistic, quantile, 
asymptotic approximation 
CR CATEGORIES: 5.12, 5.5 


real procedure ¢ quantile (P, n, normdev, error); 
value P, 7; real P, n; real procedure normdevy, error; 
comment This algorithm evaluates the positive quantile at the 
(two-tail) probability level P, for Student’s ¢-distribution with 
n degrees of freedom. The quantile function is an inverse of the 
two-tail 


Tn+4) = du 
VJ (an) (an) Je (-bu2/n) ae) 


which is approximated in Algorithm 395 [1] by series whose in- 
verses are used in this algorithm for ¢ quantiles. Test calculations 
to 36-bit precision indicate that the result is correct to at least 
6 significant digits, even for the analytic continuation through 
noninteger values of n > 5. 

The procedure normdev(p) is assumed to return a negative 
normal deviate at the lower tail probability level p, e.g. —2.32 
for p = 0.01. The user-supplied procedure for error(n) should 
give a diagnostic warning that the value of P or n is invalid and 
may go to a label, terminate, or return a distinctive value as an 
error signal to the calling program. 

For n = 1 and n = 2 the exact result of integration is readily 
inverted to yield ¢ = cot(PX7/2) and t? = 2/(P(2Q—P))-2, 
respectively. For larger n an asymptotic inverse expansion 
about normal deviates is applicable, while for smaller values of 
P a second series expansion is used to achieve suffigient preci- 
sion. Both approximations have been adjusted to enhance pre- 
cision for n as low as 3. 

Both methods involve an expansion of the factor 


d/n = 44 Wa (34n)/Thgn + ¥) 
in terms of a = 1/(n—}4) and b=48/a? 
d/n = V/ (amr/2) (1—3/6+94.5/b2—9058.5/b3-+---) [2]. 


A three term approximation uses 6(b+c) instead of b? as a 
divisor, where the coefficients in 


¢ = 96.36 — 16a — 98a? + 20700a?/b, 


P(i|n) = 2 


have been fitted to ensure 8 significant digits in d for n as low 
as 3. 
The inverse asymptotic expansion of Cornish-Fisher type re- 


lates a function y(t) = V[(n—})In(Q1+é/n)] to the normal 
deviate x at the corresponding probability level, P/2: 


y=x — (x2+3x)/b + (4x74+-63x5+360x3+945x) /10b? 
— (64x"+1628x°-+ 19881 x7 + 145719x5+4- 6945752 


+1902285)/210b? + --+ [2], 


whence ¢ = ~/[nX(exp(aXy*)—1)]. For a three term approxi- 
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mation the third term’s divisor is replaced by 
10b X (b-+c—2x—7x?—5x?+0.05XdXx‘), 


whose coefficients have been fitted to reduce the error for small 
n and for larger n and x. For n < 5, c is increased by 0.3(n—4.5) 
(x+0.6) to further reduce error in an interval of P not well 
covered by the following approximation. 

For small P, where ¢?/n is large, the iniegrand may be ex- 
panded in terms of w? = 1/(1+-¢?/n) and integrated term by term 
to yield 


nu (1 wt 1X 3ut 
i paper Fax 44 7 of 


which may be inverted to express ¢?/n in terms of y = (PXd)" 


Deg OE oie 3. 
Sraresit 1+ dh | 3(n-+2)(n-+-6) 


n(n+3)(2n?+9n—2)y* } 


8(n-+2)"(n-+4)%n+8) * 


Since the ratio of successive terms is nearly n X y/(n+6) for 
small n, replacement of the term in y* by y/[3(n+2){(n+6)/ 
(nXy)—1.0}] provides an approximate allowance for subsequent 
terms in the series, which is empirically improved by replacing 
the —1.0 by —0.822 — 0.089 X d. 

As n and P increase, the errors for the asymptotic approxima- 
tion decrease, whereas errors for the second series increase, so 
that for each value of n the error curves intersect at a value of 
P above which the asymptotic approximation is better and be- 
low which the second series should be used. By adjusting the 
two approximations the error level at these intersections has 
been balanced at about the seventh significant digit for n > 3 
and P > 10-*4. The value of y at these points is about a + 0.05 
and this fact provides a convenient criterion for selecting which 
approximation to use: the asymptotic series if y exceeds a + 
0.05, otherwise the second series. 

Although better approximations could be obtained by use of 
more terms in each series, greater precision can be achieved by 
using the result of this algorithm as a starting value for iterative 
inversion of P(t |), whose value and derivative can be com- 
puted with considerable precision using recurrence relations as 
in Algorithm 395. 

A comparison of results from this algorithm against values 
obtained by inverting the function provided by Algorithm 395 
indicates a precision of over 6 significant digits for 10-4 < 
P <0.9,n > 1. At the conventional tabulation points in 0.001 < 
P < 0.9 results for n = 1, = 2, and n > 10 checked to 8 signifi- 
cant digits. 

Previously published tables [8, 4, 5] provide 3 or 4 decimal 
place check values, some of which are found to be slightly in 
error. Thus for n = 2, P = 0.001, ¢ is given as 31.598 by Fisher 
and Yates and by Federighi, 31.5991 by Smirnov, and 31.5990546 
by this procedure, while for n = 1, P = 0.001 the value 636.6096 
given by Smirnov conflicts with Fisher and Yates, Federighi 
(636.619) and this procedure (636.61925). Other errors in the last 
few digits in Smirnov’s table for low values of n and P include 
10.2129 for n = 3, P = 0.002, which should be 10.2145, and 4.7812 
for n = 9, P = 0.001, which should be 4.7809. 

t quantile may be used to obtain percentiles at values of P and 
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n not provided in existing tables or for extending their accuracy. 
Such tables are customarily used for assessing the significance of 
a sample value for ¢, but for automatic computation the proba- 
bility level is more effectively determined as P(¢|n) using a 
direct procedure such as Algorithm 395. 

Pseudorandom t-values may be generated for sampling appli- 
cations by using uniformly distributed pseudorandom numbers 
for P, and in this case normdev may be a real procedure return- 
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begin 


real half pi; half pi := 1.5707963268; 
if n = 1 then 
begin P := P X half pi; t quantile := cos(P)/sin(P) end 
else 
begin 
real a, b, c, d, 2, y; 
b := 48.0/a T 2; 


ing pseudorandom normal deviates which are independent of P. c := ((20700Xa/b—98) Xa—16) X a + 96.36; 
d := ((94.5/(b-+c)—3.0)/b+1.0) X sgrt(aXhalf pi) X n; 
z2:=dX P; y:=2 fT (2.0/n); 
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ify > 0.05 + a then 
begin 


x := normdev(PX0.5); y:= 2 fT 2; 


y := 
1.0) XK 2; 
y:=aXyf 2; 


end 


(n+2.0)X3.0)+0.5/(n+4.0)) Xy—1.0) X 
(n+1.0)/(n+2.0) + 1.0/y; 
t quantile := sqgri(nXy) 
end 
end Student’s ¢-quantile 


ifn<1VP>10V P < 0.0 then t quantile := error(n) 
else if n = 2 then ¢ quantile := sqrt (2.0/(PX (2.0—P))—2.0) 
else 
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Student’s ¢-distribution [S14] 
[G.W. Hill, Comm. ACM 13, 10 (Oct. 1970), 617-619] 


and 
REMARK ON ALGORITHM 396 


Student’s Quantiles [S14] 
[G.W. Hill, Comm. ACM 13, 10 (Oct. 1970), 619-620] 


Mohamed el Lozy [Recd 9 June 1978] 
Department of Nutrition, Harvard School of Public Health, 665 Huntington 
Ave., Boston, MA 02115 


Both of these algorithms incorporate very accurate mathematical methods, but 
contain a source of loss of precision which is severe for the many processors with 
precision less than or not sufficiently greater than that claimed for the algorithms. 

In Algorithm 395 the use of the asymptotic series involves the evaluation of 
In(1 + ¢?/n). For small y = ¢?/n and b = 1 + y, In(d) is of the order of magnitude 
of y, so that the statement 


if y > 10° then y := In(d) 


admits a loss of precision of up to 6 decimal digits. This loss will be especially 
marked on a machine with hexadecimal number representation, since the leading 
byte in 1 + y will be hexadecimal 1, or binary 0001, with a loss of a further 3 bits, 
in addition to the loss inherent in the addition. Where the processor’s implemen- 
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comment Asymptotic inverse expansion about normal; 
ifn < 5thenc :=c + 0.3 X (n—4.5) X (4+0.6); 

c:= ((0.05XdxXxz—5.0)X2—-7.0)Xx—-2.0) Xz +b+ ¢; 
(((((0.4Xy+6.3) Xy+36.0) Xy+94.5)/ec—y—3.0)/b+ 
y := if y>0.002 then exp(y) — 1.0 elsee05Xy f 2+y 


else y := ((1.0/(((n+6.0)/(nXy) —0.089 X d—0.822) X 
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Table I. Relative Errors in the Calculation of In(1 + ¢?/n) and exp(x”/n) — 1 by the 
Methods of Algorithms 395 and 396, for x = ¢ = 2 and Various Values of n 


In(1 + ¢7/n) exp(x?/n) — 1 


n PDP IBM IMSL/IBM PDP IBM 


20 0.245E—6 0.654E—6 0.0 0.538E—6 0.242E—5 
40 0.313E—6 0.500E—5 0.0 0.708E—6 0.567E—5 
80 0.137E—5 0.149E—4 0.299E—6 0.203E—5 0.118E—4 
160 0.754E—6 0.151E—4 0.0 0.177E—5 0.311E—4 
320 0.817E—5 0.150E—4 0.0 0.281E—5 0.349E—4 
640 0.688E—5 0.909E—4 0.0 0.187E—4 0.178E—4 
1280 0.201E—4 0.244E—3 0.298E—6 0.153E—4 0.282E—3 
2560 0.224K—5 0.244E—-3 0.149E—6 0.372E—6 0.447E—6 
5120 0.700E—4 0.244E—3 0.0 0.745E—7 0.298E—6 


10240 0.104E—3 0.164E—2 0.0 0.745E—7 0.0 


tation of In(b) for 6 near 1 effectively involves the Taylor series (b — 1) — (b — 
1)?/2 + ..., the replacement statement 


if b ~ 1 then y := y x (In(b)/(b — 1)); 


as in IMSL’s subroutine MDTD [1], counteracts the loss of precision in evaluating 
the logarithm as evidenced by column 3 of Table I. However, in the general case 
there are two solutions, the simplest of which is to evaluate Y = DLOG(1.0D0 
+ DBLE(Y)), using the variable Y (single precision) for ¢?/n, as in the algorithm 
under discussion. An alternative method might be based on the use of single 
precision LOG(1.0 + Y) for “sufficiently large’ Y, and a suitable number of terms 
of the Taylor expansion otherwise. In this case the optimal crossover point 
between the two methods of evaluation would be machine dependent and the 
coding would be longer, as exemplified for an analogous case in Algorithm 465 
[2]. 

In Algorithm 396 the expression exp(x?/n) — 1 occurs, and here again substan- 
tial loss of precision can occur for small y, to use the algorithm’s notation. 
Admitting a loss of precision of up to nearly 3 decimal digits, this algorithm shifts 
to a Taylor series expansion of exp(y) — 1 for y < 0.002, but this choice is machine 
dependent and unsuitable for 32-bit machines. Here again I would opt for double 
precision evaluation of that one expression (storing the result in single precision) 
over the alternative Taylor series approach. 

Table I shows the relative errors of single precision evaluation of these two 
expressions for ¢ (or x) equal to 2 and for various values of n, using the first two 
terms of the Taylor series for the exponential for y < 0.002 as in the algorithm, as 
well as the IMSL “fix.” The computations were done on an IBM 370/168 running 
under OS/MVT and on a PDP 11/70 running under UNIX. Though both 
machines have a mantissa of 24 bits, the results on the PDP are far better than 
those on the 370, presumably due to the hexadecimal normalization of the latter 
machine. 
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Table I. Relative Errors in the Calculation of In(1 + ¢?/n) and exp(x?/n) — 1 by the 
Methods of Algorithms 395 and 396, for x = ¢ = 2 and Various Values of n 


In(1 + t?/n) exp(x?/n) — 1 

n PDP IBM IMSL/IBM PDP IBM 
20 0.245K—6 0.654E—6 0.0 0.538E—6 0.242E—5 
40 0.313E—6 0.500E—5 0.0 0.708E—6 0.567E—5 
80 0.137E—5 0.149E—4 0.299E—-6 0.2083E—5 0.118E—4 
160 0.754E—-6 0.151E—4 0.0 0.177E—-5 0.311E—4 
320 0.817E—5 0.150E—4 0.0 0.281E—5 0.349E—4 
640 0.688E—5 0.909K—4 0.0 0.187E—4 0.178E—4 
1280 0.201K—4 0.244E—3 0.298E—6 0.153E—4 0.282K—3 
2560 0.224E—5 0.244E—3 0.149E-6 0.372E—6 0.447E—6 
5120 0.700E—4 0.244E—3 0.0 0.745E—-7 0.298E—6 


10240 0.104K—3 0.164E—2 0.0 0.745E—7 0.0 


tation of In(b) for b near 1 effectively involves the Taylor series (b — 1) — (6 — 
1)?/2 + ..., the replacement statement 


if b # 1 then y := y X (In(d)/(b — 1)); 


as in IMSL’s subroutine MDTD [1], counteracts the loss of precision in evaluating 
the logarithm as evidenced by column 3 of Table I. However, in the general case 
there are two solutions, the simplest of which is to evaluate Y = DLOG(1.0D0 
+ DBLE(Y)), using the variable Y (single precision) for ¢?/n, as in the algorithm 
under discussion. An alternative method might be based on the use of single 
precision LOG(1.0 + Y) for “sufficiently large” Y, and a suitable number of terms 
of the Taylor expansion otherwise. In this case the optimal crossover point 
between the two methods of evaluation would be machine dependent and the 
coding would be longer, as exemplified for an analogous case in Algorithm 465 
[2]. ; 
In Algorithm 396 the expression exp(x”/n) — 1 occurs, and here again substan- 
tial loss of precision can occur for small y, to use the algorithm’s notation. 
Admitting a loss of precision of up to nearly 3 decimal digits, this algorithm shifts 
-to a Taylor series expansion of exp(y) — 1 for y < 0.002, but this choice is machine 
dependent and unsuitable for 32-bit machines. Here again I would opt for double 
precision evaluation of that one expression (storing the result in single precision) 
over the alternative Taylor series approach. 

Table I shows the relative errors of single precision evaluation of these two 
expressions for ¢ (or x) equal to 2 and for various values of n, using the first two 
terms of the Taylor series for the exponential for y < 0.002 as in the algorithm, as 
well as the IMSL “fix.” The computations were done on an IBM 370/168 running 
under OS/MVT and on a PDP 11/70 running under UNIX. Though both 
machines have a mantissa of 24 bits, the results on the PDP are far better than 
those on the 370, presumably due to the hexadecimal normalization of the latter 
machine. 
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REMARK ON ALGORITHM 396 


Student’s ¢-Quantiles [S14] 
[G. W. Hill, Commun. ACM 13, 10 (Oct. 1970), 619-620.] 


G. W. Hill [Received 6 December 1978; revised 7 July 1979; accepted 6 August 
1979] 
Division of Mineral Chemistry, CSIRO, Port Melbourne, Australia 3207. 


The precision in excess of six decimal digits, claimed for quantiles evaluated using 
Algorithm 396 on a 36-bit precision processor, cannot be achieved for a processor 
precision of six hexadecimal digits. As noted in [1], the statement 


y := if y > 0.002 then exp(y) — 1.0 else 0.5 xX yf 2+y 


should be replaced by its implied extension 


y := if y > 0.1 then exp(y) — 1.0 
else ((y + 4.0) X y + 12.0) X y X y/24.0+y 


The relative error of this truncated Taylor series is less than that recorded for 
exp(y > 0.1) — 1 in el Lozy’s tests [1] on an IBM 370/168. 

For extended precision quantiles an initial approximation by Algorithm 396, 
for example, fo := t quantile (P, n, normdev, error), may be used as argument in 
an extended precision version of Algorithm 395 [2] to evaluate the two-tail 
probability integral P(t |). The difference of this result from the target proba- 
bility level may be divided by twice the frequency f(to|n), evaluated using 
Algorithm 465 [3], to obtain the first-order correction for fo, 

_ 3(P(to|n) — P) 
F(to | n) 
Rather than iterative inversion t,.: = t, + z(t, |), as suggested in the commentary ah 


of Algorithm 396, it is more efficient to avoid repeated evaluation of the proba- 
bility integral and frequency function by using the Taylor series expansion [5] 


2 2 , 3 
tmoter te +e, wee, 
where 
—d (n + 1)to 
amare f(t|n)] beroerir ae 
oy (n + 1)(n — £6) 
ato (n + £3)? 


> 


v= 


rt ry r Co = 1 
Cc = +— lie ; = 1. 
: dlo 


The relative error of the series, truncated to order z°, is approximately W°tie**'/ 
(s + 1), where € = z/to is the relative error of the initial approximation. Using 
Algorithm 396, for which |«| < 107°, the first few terms of the series provide 
considerable precision in the result. 

For processor “double precision” of 14 hexadecimal (16-17 decimal) digits, such 
as that of the IBM 360/370 series, the first three terms are sufficient: 


t:=(n+1)Xh&xz2xzx +2+ by; 


o Xitn 
provided that both the precision of P(é|m) and the sum of precisions of ft and 

{(to|n) at least equal a level appropriate for 14 hexadecimal precision, such as 14 an 
decimals to allow for precision loss in evaluating P(to|n). For 96-bit double - 
precision of the CDC 6000 series processor, allowing two or three decimal digit 
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COLLECTED ALGORITHMS (cont.) 


precision loss in P(t)|n), the series to z°® is sufficient for precision in excess of 
25 decimal digits, except for extreme probability levels beyond 10°” and large 
n (>50), for which the term in z* ensures 25-26 decimals. 


It is faster to use single-precision rather than double-precision operations in 


evaluating higher order terms of the Taylor series, such as the first term in the 
statement displayed above and the terms in z° and 2‘ in the fifth-order case. This 
approach has been validated by a FORTRAN implementation to double precision 
for the CDC 6400 and 7600 for tabulation of Student’s ¢-quantiles rounded off to 
20D [4]. 


REFERENCES 


1. 


2. 
3. 
4. 


Ex Lozy, M. Remark on Algorithm 395. Student’s ¢ distribution. ACM Trans. Math. Softw. 5, 
2 (June 1979), 238-239. 

Hitu, G.W. Algorithm 395. Student’s ¢ distribution. Commun. ACM 13, 10 (Oct. 1970), 617-619. 
Hit., G.W. Algorithm 465. Student’s ¢ frequency. Commun. ACM 16, 11 (Nov. 1973), 690. 
Hii, G.W. Reference Table: “Student’s” ¢-distribution quantiles to 20D. Tech. Paper No. 35, 
Div. Math. Statist., CSIRO, Australia, 1972, 24pp. 


. HILL, G.W., AND Davis, A.W. Generalized asymptotic expansions of Cornish-Fisher type. Ann. 


Math. Statist. 39 (1968), 1264-1273. 


396-P 5- 


0 


COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 397 

AN INTEGER PROGRAMMING PROBLEM [H] 

S. K. Cuana AnD A. Giuu (Recd. 16 Feb. 1970 and 
11 May 1970) 

Electronics Research Laboratory and Department of 
Electrical Engineering and Computer Sciences, 
University of California,* Berkeley, CA 94720 


* Research sponsored by the Air Force Office of Scientific Re- 
search Office of Aerospace Research, United States Air Force, 
AFOSR Grant AF-AFOSR-639-67 and the National Science 
Foundation, Grant GK2277. 


KEY WORDS AND PHRASES: integer programming, change- 
making problem 
CR CATEGORIES: 5.41 


procedure MINDIST(C, M, SENSE, W, RESULT); 
value C,M; integer C,M; Boolean SENSE; 
integer array W, RESULT; 

comment This algorithm solves an integer programming prob- 
lem described in [1]. Given is a fixed weight vector w = (wi, 
W2,°+** , Wm), Where the w; are nonnegative integers, where m 
is a positive integer, and where 


1 = wi < we < 088 Wy 


For any nonnegative integer c (representing cost), an m-dis- 
tribution of c relative to w is an m-tuple (a1 , a2, °°: ,@m) such 
that the a; are nonnegative integers, and such that > =: acwi 
= c. The m-distribution (a1 , dz, --- , @m) is minimal if, for any 
m-distribution (61, b2, -:: , bm) of ¢ relative to w, we have 
DSmiai < DM1b;. The m-distribution (a1, a2, -+: , am) is 
standard if it is obtainable as follows: 


Cm = C 
Cy = Cipr — Giga XK Weg (@=m—1, m—2, --- , 1) 
as = Ci/ws (=m, m—1, «-- , 1) 


(where all divisions are integer divisions). 

If MINDIST(C, M, SENSE, W, RESULT) is called with a 
nonnegative integer C, a positive integer M, and an array 
W = (W[1], W[2], --- , W[M]), then the resulting array 

RESULT = (RESULT(1), RESULT(2], --- , RESULT{[M]) 
is a minimal M-distribution of C relative to W. If, before calling 
MINDIST, SENSE is set to true, then MINDIST retains 
SENSE as true if and only if RESULT is also a standard M- 
distribution of C relative to W. 


REFERENCE: 


1. Coane, S. K., anp Griut, A. Algorithmic solution of the 
change-making problem. J. ACM 17 (Jan. 1970) 113-122; 


begin 
integer [, J, R,Q, SUM, SUN; 
integer array A[1:M], B{1:M]; 
if M = 1 then 
begin 
RESULT(I} := C; 
EXIT! : 
go to EXIT 
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end 
Q := C/W[M); 
if (QXW[M]) > C then Q :=Q — 1; 
R:= C — W[M] XQ; 
if M = 2 then 
begin 
RESULT(1] := R; RESULT(2] := Q; 
EXIT? . 
go to EXIT 


end; 


LOOP: 
MINDIST (R+JXW[M], M—1, SENSE, W, B); 
if J ~ 0 then go to NOT ZERO; 


BETA: 
for J := 1 step1 until M—1do A[J] := B(J); 
A[M] := 0; 
GAMMA: 
if J = Q then 
begin 


for J := 1 step 1 until M do RESULT) := Al); 
EXITS: 


go to EXIT 
end; 
SUM := 0; 


for J := 1 step 1 until M do SUM := SUM + All); 
if (W(M|XxSUM—R—JXW[M))/(W[M]—W[M-—1]) < 0 then 
begin 
for [J := 1 step 1 until M — 1 do RESULT{|I] := Ald); 
RESULT[M] := A(M] + Q -—J; 


EXIT4: 


go to EXIT 
end; 
J :=J4+1; 
go to LOOP; 
NOT ZERO: 
SUM :=0; SUN := 0; 
for J := 1 step 1 until M do SUM := SUM + Al[q]}; 
for J := 1 step 1 until M — 1 do SUN := SUN + Bid); 
if SUM < SUN then 
begin A[M] := A[M] + 1; go toGAMMA end; 
SENSE := false; 
go to BETA; 
EXIT: 
end PROCEDURE MINDIST 


Remark on Algorithm 397 [H] 
An Integer Programming Problem [S.K. Chang and A. 
Gill, Comm. ACM 13 (Oct. 1970), 620-621] 


Stephen C. Johnson and Brian W. Kernighan (Recd. 
15 Sept. 1971) 
Bell Laboratories, Murray Hill, NJ 07974 


Editor’s note: The first correction was also noted by K.W. Coull 
of the University of Alberta.—L.D.F. 


COLLECTED ALGORITHMS (cont.) 


Key Words and Phrases: integer programming, change-making 
problem 
CR Categories: 5.41 


The published algorithm contains two substantial errors. 
1. Five lines after the label EX/T3, the line 
if (W[M|xSUM— R-—JXW([M})/(W([M]—W[M—1]) < 0 then 
should be replaced by 
if (WIM—1]XSUM—R-JXW[M]) < (W[M]—W([M—1]) then 
The use of W[M-—1] instead of W[M] corrects an error which also 
appears in the J. ACM article {1] upon which Algorithm 397 is 


based. 
2. Four lines after the label NOT ZERO, the line 


if SUM < SUN then 
must be replaced by 
if SUM < SUN then 


When this change is made, the algorithm correctly solves the test 
case described in [I], although producing a different answer than 
was published there. 

The algorithm would be clarified if, three and four lines after 
the label EX/T1, the statements 
Q := C/W{M], 
if (QXW([M}) >C then Q:=Q-1; 
were replaced by 
Q:=C+ W(|M; 


References 
1. Chang, S.K., and Gill, A. Algorithmic solution of the change- 
making problem. J. ACM 17 (Jan. 1970), 113-122. 


397-P 2- 


0 


COLLECTED ALGORITHMS FROM CACM 


ALGORITHM 398 
TABLELESS DATE CONVERSION* [Z] 


Ricuarp A. STonE (Recd. 2 Jan. 1970 and 6 April 1970) 


Western Electric Company, P.O. Box 900, 
Princeton, NJ 08540 
* Patent applied for. 


KEY WORDS AND PHRASES: date, calendar 
CR CATEGORIES: 5.9 


procedure calendar (y, n, m, a); 
value y, 7; integer y, n,m, d, t; 


comment calendar is called with the year in y and the day of the 
year in n. The month number is returned in m, and the day of the 
month is returned in d. The first section of the procedure changes 
the dates so that February has 30 days. The second section uses 
the fact that 30.55 (m+2) — 91 passes through the number of 


days preceeding each month. 


Error detection: m will be in the range 1-12 if and only if n 


is in the correct range; 
begin 
t:= if (y + 4)*4 = y then 1 else 0; 
comment The following statement is unnecessary 
if it is known that 1900 < y < 2100; 
t := if (y+400)+400 = y V (y+100)*100 ¥ y then ¢ else 0; 
d:=n-+ (ifn > (69+) then 2 — i else 0); 
m := ((d+91)+*100) + 3055; 
d := (d+91) — (m+3055) + 100; 
m:=m— 2 
end calendar 


Remark on Algorithm 398 [Z] 


Tableless Date Conversion [Richard A. Stone, Comm. 
ACM 13 (Oct. 1970), 621] 


J. Douglas Robertson [Recd. 16 Dec. 1970 and 30 
Mar. 1971] 
200 Oakcrest Drive F-161, Lafayette, LA 70501 


Key Words and Phrases: date, calendar, Fortran statement func- 
tion, arithmetic statement function 
CR Categories: 3.15, 4.9, 5.9 


As a companion to Algorithm 398, I offer a relatively compact 
algorithm for calculating the day of the year on which a particular 
date falls given the year, month, and day of the month. The algo- 
rithm is written below as a Fortran arithmetic statement function, 
where J is the year; J is the month, (1 = Jan, ..., 12 = Dec); 
and XK is the day of the month. 

IDAY(LJ,K) = 3055*(J+2)/100—(J+10)/13+2—91 
+ (1 — U—1/4*4+3)/4+ —1/100*«100+-99) /100 
—- (I—1/400*400+ 399) /400)*(J+10)/13+K 
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The above, along with Stone’s Algorithm 398, Robert G. 
Tantzen’s Algorithm 199 [2], and the two algorithms by H.F. 
Fliegel and T.C. Van Flandern [1] constitute a comprehensive set 
of algorithms for processing calendar dates. A useful addition to 
this set would be an algorithm for Zeller’s Congruence (calculates 
the day of the week on which a particular date falls) as described 
in [3]. It appears below as a Fortran arithmetic statement function, 
where J is the year; J is the month, (1 = Jan, ..., 12 = Dec); 
and K is the day of the month. 


IZLRU,J,K) = MOD ((134(J+10—(J+10)/13+*12)—1)/S+K+77 
+. 5*(I+ (J—14)/12— + (J— 14) /12)/100*100) /4 
+ U+(J—14)/12)/400— 7+ (J—14)/12)/100*2,7) 


References 

1. Fliegel, H.F., and Van Flandern, T.C. A machine algorithm 
for processing calendar dates. Comm. ACM 11 (Oct. 1968), 657. 
2. Tantzen, Robert G. Conversions between calendar date and 
Julian day number, Algorithm 199. Comm. ACM 6 (Aug. 1963), 
444. 
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ALGORITHM 399 sae 
SPANNING TREE [H] Bat ee 
if V([j] = 0 then V(t] := Vij] :=c:=c4+1 

Jouko J. SEPPANEN (Recd. 6 Jan. 1970 ‘aia 8 May 1970) ae nr fo ar 
Computing Center, Helsinki University of Technology, Vii] := VU 

Otaniemi, Finland end 

: else if V[j] = 0 then 
KEY WORDS AND PHRASES: graph, tree, spanning tree begin 
end 

procedure spanning tree(v, €, I, J, D, f); else if V[i] ¥ Vil then 

valuev,e; integerv,e,p; integer array /,J,T; begin 

; ; . Tk—n] := ky t:= Vit}; 7 := V9; 

comment This procedure grows a spanning tree 7 for a given for r := 1 step 1 until v do 

undirected loop-free graph G = (N, E) of v vertices and e edges. if V(r] = 7 then V{r] := 

If G is disconnected a spanning forest will be grown. end graft 

The edges (I[k], J(k]) € HE fork = 1,2, --- , e are assumed to elec i= nb 1 
be stored in the arrays J[l:e] and J[l:e]. At each stage of the end edge; 


algorithm one edge is considered whereby one of four possible 
conditions will arise. If neither of the vertices is included in a 
tree, this edge is taken as a new tree and its vertices numbered 
by an incremented component number c. If one vertex is in a 
tree, the edge will be grown to this tree. If the two vertices are in 
different trees, these will be grafted into a single tree by renum- 
bering the vertices of the other component. Finally, if both 
vertices are in the same tree, the edge completes a fundamental 
cycle of the graph with respect to the spanning tree and conse- 
quently will not be considered further. At the end, the indices 
of the edges in the spanning tree are stored in the array T[1:v—p] 
where p is the number of trees in the forest. The procedure can 
also be used to find a minimal spanning tree by sorting the edges 
into ascending order before calling the procedure. 

The main loop in the procedure is executed e times. For cases 
where the ratio e/v is high it could be worthwhile to introduce 
an additional variable, say d, in the program, for keeping a 
count of the number of edges included in 7’. When d has attained 
the value of v — 1 the algorithm could terminate. 

REFERENCES: 

. Berge, C., aNnp GuourLa-Houri, A. Programmes, Jeux et Re- 
seaux de Transport. Dunod, Paris, 1962, pp. 179-182. 

2. Berce, C., anp GHourta-Hourt, A. Programming, Games and 
Transportation Networks. Methuen, London, and Wiley, New 
York, 1965, pp. 177-180. 

. Kruskau, J. B., Jk. On the shortest spanning subtree of a 
graph and the travelling salesman problem. Proc. Amer. 
Math. Soc. 7 (1956) 48-50. 

4. Opruca, A. Algorithm 1. Mintree. Computer Bull. (Sept. 

1964) 67. 
5. Knutu, D. E. The Art of Computer Programming, Vol I Fun- 
damental Algorithms. Addison-Wesley, Reading, Mass., 1968. 
pp. 370-371; 
begin 
integer 2,7, k,c,n, 17; 
integer array V[1:v]; 
c:=n:= 0; 
for k := 1 step 1 until v do V[k] := 
for k := 1 step 1 until e do 
begin 
= Ik]; 3 := JIk]; 
if Viz] = 0 then 


p:=v—-—et+n 
end spanning tree 


—_ 
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ALGORITHM 400 . 

MODIFIED HAVIE INTEGRATION [D1] 

Grorce C. Watuick (Recd. 26 Jan. 1970 and 25 Apr. 
1970) 

Mobil Research and Development Corporation, Field 
Research Laboratory, P.O. Box 900, Dallas, TX 75221 


KEY WORDS AND PHRASES: numerical integration, Havie 
integration, Romberg quadrature, modified Romberg-quadra- 
ture, trapezoid values, rectangle values 

CR CATEGORIES: 5.16 


DESCRIPTION: 
The Havie integration method for the approximate evaluation of 
the definite integral 


B 
T= 
[ F(x) de (1) 


as implemented in ACM Algorithm 257 [4] is based upon the paral- 
lel generation of the Romberg table of trapezoidal 7;* values [1] 
and the table of rectangular 2;* values also used by Krasun and 
Prager [8]. At each step in the development of the tables the dif- 
ference | T;* — R;* | is examined. If | T;* — R;* | < ¢ the process 
is said to have converged and the algorithm returns a value of 


THY = 4(Tj*+R;*). — (2) 


For some F(X), e.g. F(X) = e* and F(X) = 2/(2+sin 107X), 
the R;*, T;* pairs converge more rapidly than the Romberg se- 
quence of 7';* values. (This is the same class of F(X) for which a 
simple nonadaptive Simpsons Rule algorithm [5] is competitive 
with the Havie algorithm.) For other F(X), the Havie algorithm 
is slightly less efficient than the Romberg algorithm. 

Like Romberg quadrature, Havie integration requires the evalu- 
ation of the rectangular values 


B-AS 1B 
or [4 + (j-#) 54 : (3) 


R& = 


Rutishauser [6] recognized that this repeated addition of small 
terms to a large partial sum can lead to serious roundoff error. 
He suggested a procedure for the evaluation of the R,* which sig- 
nificantly reduces this error. The method, used by Fairweather [2] 
in a modified Romberg algorithm, leads to a significant improve- 
ment in accuracy for large orders of extrapolation. 

In the modified Havie integration algorithm HRVINT the R,* 
are evaluated using a 3-level version of the Rutishauser procedure. 
The arguments X of the generating function F(X) are evaluated 
as in eq. (3) rather than by accumulative addition as in Algorithm 
257. , 

In the argument list for HRVINT, F is the name of the generat- 
ing function FUNCTION F(X) which returns a value of F(X) 
corresponding to a specified value of X, A, and B represent the 
lower and upper limits of integration, and MAX is the maximum 
order of extrapolation to be permitted, MAX < 16. Values of 
MAX > 16 are interpreted as MAX = 16; the value of MAX is not 
changed by the subprogram. Computation is terminated when 


| T;* — Ri* | < ACCs | 7;* | 
or when the order of extrapolation MFIN = MAX. Here ACC is a 


measure of the desired relative accuracy, ACC > 0. Upon exit 
HRVINT is the approximate value of the integral, FAC is a meas- 
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ure of the final relative accuracy achieved 
FAC = | 7;* — Rj* |/| 7;* | 
and MFIN is the order of extrapolation. 


_ Test case. HRVINT was tested in Fortran IV on a CDC 6400 
computer using single-precision floating point arithmetic (14+ 


TABLE I. A Comparison oF THE Havin AND MODIFIED 
Havig ALGORITHMS 


B 
i= F(X) dX 
[, Feo 


(m = Extrapolation Order, m < 16; N.S.F. = Number of 
Significant Figures) 


Numerical Evaluation 


value Havie Modified Havie 
FX) mes (digits Specified |__| 
10-16) relative I I : 
accuracy | (digits! m| of | (digits| m| cf 
10-14) = | 10-14) < 
ea? 0:0 | 5.0 | 45139 55 | 10--10-2 | 46726 | 3] 10 | 46726} 31! 10 
10-10-10 | 45089 | 4] 11 | 45080] 4] 11 
10-0 = |. 45110 | 5 | 12 | 45111 | 5 | 12 
10-2 =| 45128 | 6 | 12 | 45131 | 6] 12 
10-13 45184 | 6| 12 | 45187] 6 | 13 
10-4 =| 39757 | 16 | 9 | 45187; 7] 18 
1075 += | 39757 | 16 | 9 | 45136 | 10 | 13 
In x 1.0 {10.0 | 29940 46 107° 29845 | 8| 11 | 20846| 8] 11 
10- | 29937 | 8| 13 | 29030; 8] 13 
10-10-12 | 29987 | 9 | 13 | 29040] 9} 14 
1073 | 29987 | 9 | 13 | 29040) 10| 14 
10-" | 29556 | 16] 11 | 299401 10| 14 
(1 +2)71 | 0.0} 1.0 | 55994 53 10-9 56353 | 6 | 11 | 56354] 6| 11 
10- | 55996 | 6 | 13 | 55907; 6] 13 
10" —- | 55900 | 6 | 13 | 55901 | 6 | 138 
10-2 =| 56988 | 7| 12 | 55991 | 7 | 13 
10-8 ~—| 55987; & | 12 | 58901 | 7 | 13 
10-10-15 | 53242 | 16 | 10 | 55991] 9 | 13 
(1 + 24)-1 | 0.0 | 1.0 | 38001 10 | 10-8-10-7 | 35683} 8&| 10 | 35634| 5| 10 
10-8-10- | 33993 | 6 | 13 | 33095 | 6] 13 
10-11-1012 | 33984 | 7/ 12 | 38089/ 7] 13 
10-3 =| 30854 | 16 | 10 | 339087] 7/ 13 
10-14-10-15 | 30854 | 16 | 10 | 33988] 9} 13 
as 0.01] 1.1 | 68595 04 10-8 71022 | 13} 10 | 71829] 13] 10 
10-? 68136 | 13 | 11 | 68647] 13| 11 
107 | 68076 | 13} 10 | 68589] 13] 12 
10" =| 64508 | 16 | 10 | 68500] 14] 12 
1072-10-13 | 64508 | 16 | 10 | 68589 | 14| 12 
10-4-10-18 | 64508 | 16 | 10 | 68584 | 16] 12 
ans 9.01] 1.1 | 89506 64 10-8 89368 | 13] 11 | 89604} 13] 11 
1079 go199 | 13 | 11 | 89526 | 13] 12 
10- =| ggg57 | 14 | 10 | 89503 | 14] 13 
1071-10-12 | 86878 | 16 | 10 | 89502 | 14] 13 
10-3 — | 86878 | 16 | 10 | 89502 | 15] 13 
1074-10-15 | 86878 | 16 | 10 | 89499 | 16 | 12 
2-8 0.01) 1.1 | 29246 64 10-8 29556 | 13 | 11 | 29767/ 13] 10 
10--10-10 | 28828 | 14] 11 | 29247] 13] 14 
1o-" ~—- | 27557 | 16 | 10 | 29245 | 14] 13 
1012-10-13 | 97557 | 16 | 10 | 29244 | 15) 13 
10-4 =| 27557 | 16 | 10 | 29244 | 16| 138 
105 | 27557 | 16 | 10 | 29242] 16| 13 
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decimal digits). Corresponding integral values were also obtained 
using a Fortran version of the standard Havie Algorithm 257. The 
results of these tests are summarized in Table I. 

For modest accuracy requirements, the two algorithms are seen 
to be equivalent. For both algorithms the maximum accuracy 
achievable is limited by truncation and roundoff error. Since the 
Rutishauser modification serves to reduce the magnitude of such 
errors, the modified Havie algorithm can, in many cases, return 
optimum integral values that are from 1 to 2 significant figures 
more accurate than those returned by Algorithm 257. 

In the routine use of the algorithms it is possible to specify an 
accuracy requirement that cannot be satisfied. When this condition 
obtains, the algorithms are forced to proceed to the maximum per- 
mitted extrapolation order. With Algorithm 257 error accumula- 
tion accompanying such an overspecification can lead to a serious 
decline in evaluation accuracy. With the modified Havie algorithm 
IIRVINT this loss is minimized and in most cases virtually elimi- 
nated. 


Acknowledgment. The author wishes to thank Mobil Research 
and Development Corporation for permission to publish this in- 
formation. 
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FUNCTION HRVINT(CF sA¢B oe MAX ACC FAC eMFIEN) 
C HAVIE INTEGRATION WITH AN EXPANDED RUTISHAHSER- 
TYPE SUMMATION PROCEDURE 
OIMENSTON TC1L7) ¢UCE7) + TPREVIL7) ,UPREV(IT) 
C TEST FOR MAX GREATER THAN 16 
MUX=MAX 
TF (MAX-16)10210,5 
5 MUX=1L6 
INITEALIZATION 
10 ENPT=0.5¥*(FCA)+F(B)) 
SUMT=0.0 
MFIN=1 
Nel 
H=B~A 
SH=H 
C BEGIN REPFTITIVF LOOP FROM ORDER 1 TO ORDER MAX 
15 TCL) SH*¥ CENPT+SUMT) 
SUM=0. 
NN=N4+N 
FN=NN 
EM=SH/FN 
C BEGIN RUTISHAUSER EVALUATION OF RECTANGULAR SUMS 
C INITIALIZATION 
TF(NN=16) 20420, 25 


oo 


hae 


oO 


a 


20 NZ=NN 
GA TO 30 
25 NZ=16 
TFONN=256)30430935 
30 NA=NN 
60 TO 40 
35 NA=256 
TF ONN-4096)40140045 
40 NB=NN 
GO TO 50 
45 NB=4096 
C DEVELOPMENT OF RECTANGULAR SIIMS 
50 DO 70 KC=1+9NN+4096 
SUMB=0. 


KK=KC+NA=1 
DO 65 KBSKCyKKy256 
SUMA=0. 
KKK=KB+NA~1 
NN 60 KA=KAeKKK 416 
SUMZ=0. 
KFR=KA+NZ7~-1 
DN 55 KZ=KAdKFR_2 


ZKZ=KZ 
55 SUMZ=SUMZ+F (A+Z7KZ7*EM) 
60 SUMA=SUMZ+SUMA 
65 SUMB=SUMA+SUMB 


70 SUM=SUMB+ SUM 
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C END OF RUTISHAUSER PROCFDURE 
UCL) =H*StIM 


K=1 
C REGIN EXTRAPOLATION LOOP 
75 FAC=ABS(T(K})-U(K)) 


TF (T(K))80+85 480 
C TEST FOR RELATIVE ACCURACY 


80 LF (FAC—ARSCACC¥T(K)) 1904904200 
C TEST FOR ABSOLUTE ACCURACY WHEN T(K)=0 
85 TFC FAC-ABS (ACC) 195 9959100 
90 FAC=FAC/ABS(T(K)) 
C ENTEGRAL EVALUATION BEFORE EXIT 
95 HRVINT=0.5%( TIKI tU(K) ) 
RETURN 
100 LF (K=-MFIN) 10541159115 
105 AK=K+K 
D=2.**AK 
DMA=D-1.0 


C BEGIN EXTRAPOLATION 
T(K+1L)=(D0*T(K)-TPREVIK)/DMA 
TPREV(K)=T(K) 

U(K+1)=(D¥*U(K )-UPREV(K))/0MA 
UPREV(K)=U(K) 

C END EXTRAPOLATION 
K=K+1 
TF (K=MUX)75 91109110 

C END EXTRAPOLATION LOOP 

110 FAC=ABS(T(K)-U(K)) 
TF(T(K)990¢95490 

C ORDER IS INCREASEN BY ONE 

115 H=0.5*H 
SUMT=SUMT+SUM 
TPREVIK)=T(K) 
UPREV(K)=UCK) 
MFIN=MFIN+1) 
N=NN 
GO TO 15 

C RETURN FOR NEXT ORDER EXTRAPOLATION 
END 


Remark on Algorithm 400 [D1] 

Modified Havie Integration 

[George C. Wallick, Comm. ACM 13 (Oct. 1970), 622— 
624] 


Robert Piessens [Recd. 17 Apr. 1973] 
Applied Mathematics and Programming Division, Uni- 
versity of Leuven, B-3030 Heverlee, Belgium 


Recently, Casaletto et al. {1] tested a number of automatic in- 
tegrators by calculating 50 test integrals with different specified 
tolerances. We shall refer to these integrals as #1, #2,..., #50. (A 
list can be found in {1] or [2].) One of the aims of their tests was to 
give a summary of the number of failures (when the computed value 
was not within the requested tolerance) and overflows (when an 
upper bound on the number of integrand evaluations prevented the 
specified accuracy from being reached) of each integrator. We have 
examined some other recently published integrators in a similar way. 
Our study reveals that HRVINT fails more frequently than the other 
integrators. For example, for the specified relative accuracy ACC = 
10-3, HV RINT fails on #26, #31, #34, #45, and #47, and for ACC = 
10-4, on #20, #26, #31, #32, #34, #45, and #47. It is worth while to 
note that #20 and #32 are integrals with very smooth integrand. 

Most failures can be avoided by changing the statement labeled 
75 to 
75 IF (MFIN—2) 100, 100, 76 
76 FAC = ABS(T (K)—U(K)) 

Indeed, with this alteration failures occur only on #47 (for both ac- 
curacies ACC = 10-% and 1074). 


References 

1. Casaletto, J., Pickett, M., and Rice, J. A comparison of some 
numerical integration programs. SIGNUM Newsletter 4, 3(1969), 
30-40. 

2. Gentleman, W.A. Implementing Clenshaw-Curtis quadrature, 
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ALGORITHM 401 

AN IMPROVED ALGORITHM TO PRODUCE 
COMPLEX PRIMES [A]] 

Pauu BratTury (Recd. 25 Feb. 1970) 

Département d’informatique, Université de Montréal, 
C.P. 6128, Montréal 101, Quebec, Canada 


KEY WORDS AND PHRASES: 
complex numbers 
Ck CATEGORIES: 5.39 


number theory, prime numbers, 


integer procedure cprimes(m, PR, PI); 
value m; integer m; integer array PR, PI; 

comment The procedure generates the complex prime numbers 
located in the one-eighth plane defined by 0 < y < x. Any prime 
found in that area has seven more associated primes: —xz + yt, 
ae — yt, ty + zt. These associated primes must be generated 
externally to cprimes. The first complex prime generated by 
cprimes is 1 + 1, which exceptionally lies on x = y and has only 
three associated primes. 

The algorithm generates a list of complex primes in order of 
increasing modulus: the parameter m of the call is the highest 
modulus to be included in the list and should satisfy m > 2. 
PR and PI will contain respectively the real and imaginary 
parts of the generated list, with PR > PI > 0 for each prime. 
The value of the procedure is the number of primes generated. 

Algorithm 311 [1], seve 2, is used to generate the rational 
primes less than m?,. Then it is known (see, for instance [2]) 
that a rational prime p of the form p = 4n + 1 can be expressed 
as p = a? + b?, and factorized as (a-+bi)(a—bz) in the complex 
plane, where a + 67 and a — bi are complex primes. For our 
present purpose we choose a > 6 and include only a + 067 in the 
list. A rational prime p of the form p = 4n + 3 remains prime 
in the complex plane, so we include p + 07 in the list if p < m. 
Finally, the complex prime 1 + 7 may be thaught of as one of 
the factors of the remaining rational prime 2 = (1+7)(1—7). 

Although this algorithm and Algorithm 372 [3] are not directly 
comparable, since they produce the list of complex primes in a 
different order, the accompanying remark suggests that the 
present algorithm is often to be preferred. 

REFERENCES: 

1. Cuartres, B. A. Algorithm 311, Prime number generator 2. 
Comm. ACM 10 (Sept. 1967), 570. 

2. Harpy, G. H., anp E. M. Wricut. An Introduction to the 
Theory of Numbers, 4th ed. Clarendon Press, Oxford, 1965, 
Chs XII and XV. 

3. DunHAM, K. B. Algorithm 372, An Algorithm to produce 
complex primes, CSIEVE. Comm. ACM 13 (Jan. 1970), 
52-53 ; 

begin 

integer a, b, Cc, d, é, 1, Jd; Pp; q; 

integer array P2[1:0.7m f 2/In(m)], 

P3{1:1.4Xm/ln(m)); 
e€ := sieve 2(mf 2, P2); 

PR := Pll] :=a:=ec:= 1; 


b := 0; 
for d := 2 step 1 until e do 
begin 

p:= Pd); g:=p— 1]; 


if (q+4) X 4 ¥ q then 
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begin . 
if p < m then 
begin b := 6 +1; 
end 
else 
begin 
Ql: 


P3[b] := p end 


ifa < b then 
begin 
if P3{a] fT 2 < p then 
begin 
c:=e+1; 
a:=a+l; 
go to Ll 
end 
end; 
q := entier(sqrt(p/2)+1); 
for 7 := gq step 1 until p do 
begin 
j i= sqrt(p—if 2); 
ifi 72+ j 7 2 = pthen goto L2 
end 
comment Note that the jump to Z2 is always made before 
the cycle is terminated; 


PR{c] := P3la]; 
PlI{c] := 0; 


L2: 
ec:=c+]1; 
end 
end; 
[3: 
ifa < b then 
begin 
c:=ecert+]; 
a@a:=a+t+l; 
go to L3 
end; 
cprimes := ¢ 
end cprimes 


PR{c] := 1; Plfe] := j 


ll 


PR{c] : 
PiI{c] : 


P3{a]; 
0; 


ll 


REMARKS ON 

ALGORITHM. $372 [A]1] 

AN ALGORITHM TO PRODUCE COMPLEX 
PRIMES, CSIEVE [K. B. Dunham. Comm. ACM 13 

. (Jan. 1970), 52-53] 

ALGORITHM 401 [A1] 

AN IMPROVED ALGORITHM TO PRODUCE COM- 
PLEX PRIMES [P. Bratley. Comm. ACM 13 (Nov. 
1970), 693] 

Pauw Bratiery (Recd. 25 Feb. 1970) 

Département d’informatiqué, Universite de Montréal, 
C.P. 6128, Montréal 101, Quebec, Canada 

KEY WORDS AND PHRASES: 


bers, complex numbers 
CR CATEGORIES: 5.39 


number theory, prime num- 


Algorithm 372 was run on the CDC 6400 at the University of 
Montreal. The variable 7 is undefined if the for-loop at label A is 
completed. The statement 


t:=jg+1; 
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should be added immediately before label B. Algol purists may 
also care to remove redundant semicolons after go to A and go to 
B, and the redundant parentheses in one if-statement. With these 
changes the algorithm produced correct results for several values 
of m. 

The comment in Algorithm 372 is slightly inaccurate. The first 
prime generated by the algorithm is 1 + 7, which does not have 
PR > PI, and which has not seven but three associated primes. 

Tt is not possible to compare the speeds of Algorithm 372 and 
Algorithm 401 directly since they generate primes in a different 
order. However, the following test was run. A value of m was 
chosen, and Algorithm 401 was used to list all the complex primes 
with modulus less than m. The time taken and the number of 
primes produced were noted. Then Algorithm 372 was used to 
produce an equal number of primes, the time taken again being 
noted. Times observed are shown in Table I. 


TABLE I 
oe Algorithm 401 . Time taken by Algorithm . 
Limit . T tak Rat 
madndas Sate ns Tesh STE fo draduce the 2ame. somes taken 

25 60 0.278 0.331 1.2 
50 189 1.577 2.140 1.4 
75 373 4.217 7.602 1.8 
100 623 8.618 20.214 2.4 
3.4 


150 1266 23.732 79.481 


The conclusion from the figures in Table I is that if the speed 
with which the complex primes are generated is of paramount 
importance then Algorithm 401 should be preferred to Algorithm 
372. 

As written Algorithm 401 will use more memory than Algorithm 
372 since it is convenient and perspicuous to use sieve2 in an un- 
modified form, which makes it necessary to store temporarily all 
the rational primes less than m?. However, if space is tight then 
steve2 can easily be modified so as to generate rational primes one 
at a time on successive calls, and in this way the use of the long 
array P2 can be avoided. If this modification is made Algorithm 
401 will in fact use less store than Algorithm 372, which wastefully 
stores many useless values in PM. It is also to be noticed that the 
factors 0.7 and 1.4 occurring in the declarations of P2 and P3 may 
be diminished for large m: all that is necessary is that P2 should 
be long enough to hold the rational primes less than m?, and that 
P3 should be long enough to hold the rational primes which are 
not greater than m and which are of the form 4n + 3. Some space 
may be saved similarly in sieve2, which is called from Algorithm 
401. 
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402-P 1. Rl 
ALGORITHM 402 begin 
INCREASING THE EFFICIENCY OF z= alp); . 
QUICKSORT* [M4] > xx then go to right 
M. H. van Empen (Reed. 15 Dee. 1969 and 7 July 1970) pe ge cae 
Mathematical Centre, Amsterdam, The Netherlands _ . right: 
* The algorithm is related to a paper with the same title and by for g ‘= ¢— 1 while ¢ > p do 
the same author, which was published in Comm. ACM 18 (Sept. begin 
1970), 563-567. 2 1= ala; 
if z < zz then go to dist 
KEY WORDS AND PHRASES: sorting, quicksort end; 
CR CATEGORIES: 5.31, 3.73, 5.6, 4.49 Qq:=p; p:=p—1; z:=2; « := alpl; 
dist: 
ifz > z then 
procedure gsort(a, ll, u1); begin 
value J1, ul; integer ll, ul; array a; y= 2; alp] i= 2 := 2; 
comment This procedure sorts the elements a{l1], a{/1+1], --- , alg) i= zi=y 
a[ul] into nondescending order. It is based on the idea described ae: 
in [1]. A comparison of this procedure with another procedure, if x > xz then ; 
called sortvec, obtained by combining C. A. R. Hoare’s quicksort Begin ea eit ie pend, 
[2] and R. 8. Scowen’s quickersort [3], in such a way as to be if nee then 
optimal for the Algol 60 system in use on the Electrologica X-8 begin zz := z; iz := q end; 
computer at the Mathematical Centre is shown below. Here go to Left; 
‘repetitions’? denotes the number of times the sorting of a oer F ; 
sequence of that “‘length’’ is repeated; ‘‘average time’’ is the af Bie Nee Te then 
time in seconds averaged over the repetitions; “gain” is the ness alp| Fee Ole) ie EO, 
difference in time relative to time taken by sortvec. it Ge tz Nz # 22 then 
begin a{g) := 2z; afiz] := z end; 
procedure length repetitions average time gain ifu-—q>p-—Ithen 
sortvec 30 23 .09 begin 11 := 1; ul := p—-—1; 1:=q+1lend 
qsort 30 23 .06 + .37 else 
sorivec 300 16 1.25 begin ul := u; UL:=q +1; u:= p— lend; 
gsort 300 16 1.03 +.17 if ul > 11 then sort; 
sortvec 3000 9 17.43 if u > 1 then go to part 
gsort 3000 9 15.25 +.13 end 
sortvec 30000 2 232.46 = =. end of sort; 
gsort 30000 2 197.96 +.15 if ul > 11 then sort 
end of gsort 
REFERENCES: 
1, van Empen, M. H. Increasing the efficiency of quicksort. 
Comm. ACM 13 (Sept. 1970), 563-567. 
2. Hoare, C. A. R. Algorithm 64, quicksort. Comm. ACM 4 
(July 1961), 321-322. 
3. Scowen, R. S. Algorithm 271, quickersort. Comm. ACM 8 
(Nov. 1965), 669; 
begin 
eee Me me — Remark on Algorithm 402 [M1] 
eeovedure nay , Increasing the Efficiency of Quicksort [M.H. Van 
begin Emden, Comm. ACM 13 (Nov. 1970), 693-694] 
integer |, u; 
PS thy: eeu Robert E. Wheeler [Recd. 6 July 1971] 
aa E.I. du Pont de Nemours and Company, 
p:=l, q:=u; x:=alpj; 2 := alg); tava 
ifecS #then Wilmington, DE 19899 
begin y := 2; alp] := 2 := 2; alg) := 2 := y end; 
ifu—l>1then It will happen during execution of this algorithm that sequences 
begin will be encountered which are already in nondescending order 
BL 2; Wis Pp i= 2; 12 := q; and which should not be further sorted. Changes to the algorithm 
left: which accomplish this are indicated below. For a Fortran version 


for p := p + 1 while p < q do of this algorithm running on a Univac 1108, these changes de- 
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creased running time by 1.25 percent when sorting random arrays 
of length 500 and by 2.7 percent when sorting random arrays of 
length 50. 


Line Change to: 


2 integer p, q, ix, iz, (,/; 

9 p:=l; q:= 4; x:=alp]; z:= alg]; i:= 0; 
i= Ep Ht 

36 begin xx := x; (¢:=i+1; ix:= pend; 

38 begin zz := 2; i:=i+1; iz:=qend; 


48.5 if 7 +j begin 
50.5 end; 
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ALGORITHM 403 

CIRCULAR INTEGER PARTITIONING [A1] 

M. W. CoLEeMAN AND M.S. Taytor (Reed. 30 June 1970) 
Aberdeen Proving Ground, MD 21005 


KEYWORDS AND PHRASES: partitions, combinatorics, 
Statistical design of experiments 
CR CATEGORIES: 5.39, 5.5 
DESCRIPTION: 
The partition, when expressed as a K-tuple (X1 ,°7', XK), may 


be thought of as a K-digit number in the base V number system. 
The procedure C/RPI then functions as a counter which generates 
successive K-digit numbers in the base V number system. How- 
ever, since all K-digit numbers do not correspond to circular 
partitions, it is possible to have the procedure generate only a 
subset of K-tuples for consideration, using the following criteria: 

(a) The digits are constrained to sum to V, consequently, the 
K digits are not independent. Thus the procedure need only 
operate on the K — 1 most significant digits, the least significant 
digit being an easily computable function of the other K — 1 digits. 

(b) Since the numbers are sequentially increasing, a given 
number is a cyclic permutation of a previously generated number 
if a cyclic rotation of its digits produces a number with a smaller 
value. Thus the most significant digit, X, , provides an effective 
minimum value for any of the digits. 

(c) Given that the digits must sum to V and the minimum value 
for any digit is X1 , the value V — X,* (K — 1) provides an effec- 
tive maximum for any digit. 

(d) Since the maximum and minimum values depend on the 
most significant digit, X: , the procedure is finished when X, has 
increased to the point where the minimum digit size exceeds the 
maximum digit size, ie. when X1 > V — X, * (K — 1). This 
easily reduces to X, > V/K, providing an easy method for termi- 
nating the K-tuple generation as early as possible. 

Therefore, the procedure efficiently generates the totality of 
circular partitions since it can greatly restrict the number of K- 
tuples that must be considered. 


REFERENCES: 


1. Davin, H. A., ano F. W. Wouock. Cyclic designs. Annals of 
Math. Stat. 36 (1965), 1526-1534. 

2. Niven, I., Mathematics of Choice. Random House, New York, 
1965, ch. 6. 


ALGORITHM: 
SUBROUTINE CIRPI (Vy Ke X) 


THIS SUBROUTINE GENERATES ALL K-TUPLES SUCH THAT. coos 
&) THE SUM OF THE K ELEMENTS OF THF K-TUPLE IS Vy 
B) EACH OF THE ELEMENTS IS AN INTEGER GREATER THAN O» AND 
C) NO K-TUPLE IS A CYCLIC PFRMUTATIGN OF ANY OTHER K-TUPLE. 
THE K-~TUPLE IS STORED IN THE ARRAY X» WITH ONE ELEMENT 
PER ARRAY ELEMENT. EACH K-TUPLE IS PROCESSEN BY THE USER 
{USING THE SUBROUTINE *PROCES") BEFORE THE NEXT K-TUPLE IS 
GENERATED. THE SUBROUTINE *PROCES* MUST NOT CHANGE THE 
CONTENTS OF THE ARRAY X. 


aanaannaaarnnan 


INTEGER XEK)e Ve Vhe V2_ Ce SUM 
V-K+l1 

V/K 

K-1 

K-2 

SUM = KI 


< 
N 
wou og tt 


aa 


INITIALIZE THE ARRAY X WITH THE FIRST K-TUPLE, 


aan 


OAMAaan 
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00 100 I = ly Ki 
xX({I) = 1 
100 CONTINUE 
GO TO 115 
GENERATE THE NEXT K-TUPLE WHICH SATISFIES THE GIVEN 
CONDITIONS, A) — C). 
110 c= 
SUM = X(L) 
NO 113 1 = ly K2 
[1 = K-I 
XCEL) = XCLTLIFE 
IF ¢xX(11) .LT. Vl) GO TO tbl 
XCIL? = XOL) 
GO TO 112 
111 c=0 


112 SUM = SUM#X(T1) 
113 CONTINUE 
IF (C .EQ. 0) GO TO 1L5 
X(L) = XCL)41 
IF (€X€1) «GT. V2) RETURN 
DO 114 Tl = 2- KL 
XCIL) = XL) 
114 CONTINUE 
SUM = X€L)*K1 
VL = V-SUM 
115 SUM = V-SUM 
IF (SUM LT. X11)) GO TO 110 


XCK) = SUM 
CHECK TO SEE IF THE K-TUPLE IS A CYCLIC PERMUTATION OF 
ANY PREVIOUSLY GENERATED K-TUPLES. IF IT 1S, GENERATE THE 
NEXT CANDIDATE, OTHERWISE, CALL THE SUBROUTINE *PROCES* TO 
PROCESS THE K-TUPLE BEFORE GENERATING THE NEXT ONE. 
120 00 122 I = 2,.K 
IF {(X(1) .GT. X€1)) GO TO 122 
IF (XC1) LT. X€1)) GO TO LO 
Il = J4l 
DO 121 12 = 24 K 
16 (11 «GT. K) TL = Tt-K 
IF (XCUL) 2GT. x€12)) GO TA 122 
IF (X€I2) .LT. X€12)) GU TO 110 
Il = 11#l 
12k CONTENUE 
GO TO 130 


t22 CONTINUE 
130 CALL PROCES (X%, K) 
GO TO 110 
END 
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ALGORITHM 404 

COMPLEX GAMMA FUNCTION [814] 

C. W. Lucas Jr.* anp C. W. Terriny (Reed. 13 Feb. 1970 
and 19 June 1970) 

Physics Department, College of William and Mary in 
Virginia, Williamsburg, VA 23185 
*William and Mary Predoctoral Fellow. This work was partly 


supported by the National Aeronautics and Space Agency, 
Contract NGL 47-006008. 


KEY WORDS AND PHRASES: gamma function, poles of 
gamma function, Stirling’s asymptotic series, recursion formula, 
reflection formula 

Ck CATEGORIES: 5.12 


DESCRIPTION: 


CGAMMA evaluates in single precision the gamma function 
for complex arguments. The method of evaluation is similar to the 
one employed by A. M.S. Filho and G. Schwachheim in evaluating 
the gamma function with arbitrary precision for real arguments 
(1]. First: the real part of the argument of the gamma function is 
increased by some integer M, if necessary, so that Stirling’s 
asymptotic series for the logarithm of the gamma function may 
be used with high precision and a small number of terms. Then the 
recursion formula for the gamma function 


NZ) =T4@ + 1/4 


is used to step down to the original gamma function. 

The conditions on the value of T = Z + M used in Stirling’s 
asymptotic series are: 

1. Real(T) > 10 

2. Arg(T) = arctan(Imaginary(T)/Real(T)) < 7/4 

This second condition ensures that the error incurred in using 
Stirling’s asymptotic series with a finite number of terms is less 
than the value of the next term in the series [2]. 

The only condition on the argument Z is that it must not be 
too close to a pole of the gamma function, i.e. Z = 0, —1, —2,--- 
A rough:empirical relation was found between the number of 
significant figures obtained by Stirling’s asymptotic series and 
the distance 6 in the complex plane from Z to the nearest pole by 
approaching the poles at 0 and —1 from several directions. If 6 
= 10-" (n an integer > 3) this relation is (minimum number of 
significant figures) = 7 — n. With 6 = 10-4, for instance, Stirling’s 
asymptotic series gives three or more significant figures depend- 
ing on the direction of Z from the pole. The upper limit on the 
size of Z for which CGAMMA will work is a function of the com- 
puter system. For the IBM 360 system where the largest size 
number that can be handled is about 10% the upper limit for real 
Z is about +57, for Z on the line Imaginary (Z) = +Real(Z) it is 
(63 + 687), for Real(Z) > 0 and (—382 + 327) for Real(Z) < 0, 
and for Z on the imaginary axis it is +1077. 

CGAMMA has been tested in several ways. The reflection 
formula 


ee 
AG — 2) 3 
as ) sin(nZ) 
and the relation 


Tin + 1) = n! (mn integer) 
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have been employed as checks. Also log(gamma(Z)) has been 
compared with tabulated valued in reference [2] for a number of 
values of Z. These tests lead us to conclude that CGAMMA gives 
four to five significant figures for Z outside disks of radius 6 = 
10-3 centered on the poles. If the subroutine is written in. double 
precision, we have found that about eight more significant figures 
will be obtained everywhere for an IBM 360 system, and near the 
poles 


(minimum number of significant figures) = 15 — n 


where 6 = 10-”. The range of the subroutine remains the same. 
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ALGORITHM : 


[Warning. System dependent constants are used in assigning 
values to IOUT, PI, TOL, SUM—L.D.F.] 


FUNCTION CGAMMA(Z) 

COMPLEX ZyZMeTs TT2SUMy TERM, DEN+CGAMMA SPI SA 
DIMENSION C(12) 

LOGICAL REFLEK 

C SET IOUT FOR PROPER OUTPUT CHANNEL OF COMPUTER SYSTEM FOR 

C ERROR MESSAGES 
ToUuT = 3 
PI] = (3.141593,0.0) 

X = RFAL(Z) 
Y = AIMAG(Z) 

C TOL = LIMIT OF PRECISION OF COMPUTER SYSTEM IN SINGLE PRECISION 
TOL = 1.0F-7 ; 
REFLEK = «TRUE. 

C DETERMINE WHETHER Z 1S TON CLOSE TO A POLE 

C CHECK WHETHER TON CLOSE TO ORIGIN 

TF(X.GE-TOL) GN TO 20 
FIND THE NEAREST POLE AND COMPUTE DESTANCE TO 1T 
XNIST = X-INT(X-.5) 
2M = CMPLX(XDIST+Y) 
IF (CABS(Z2M).GE.TGL) GN TO 10 
IF Z IS TOO CLOSE TO A POLE» PRENT ERROR MESSAGE AND RETURN 
WITH CGAMMA = (1.E7,0.0F0O) 
WRITFCITOUT,900) 2 
CGAMMA = (1.467,0.E0) 
RETURN 
FOR REAL(Z) NEGATIVE EMPLOY THE REFLECTION FORMULA 
GAMMA(Z) = PI/(SIN(PIT*Z)*GAMMA(1~-2)) 
AND COMPUTE GAMMA(1=Z). NOTE REFLEK IS A TAG TO INDICATE THAT 
THIS RELATION MUST BE USED LATER. 
0 IF(X.GE.0.0) GO TO 20 
REFLEK = FALSE. 
Z = (1.0,0.0)-2 
X = 1.0-X 


a 


aa 


ePanan 


Y - 
C IF Z 1S NOT TOO CLOSE TO A POLE, MAKE REAL(Z)>10 AND ARG(Z)<PI/4 
20 M = 0 : 
40 IF(X»GE.~LO-) GO TO 50 

X =X + 1.0 ; 


oT 

G 
50 I S(Y).LT.X) GO TO 60 

x 1.0 

M 1 


60 LX(X*Y) 


+ 
+ 
50 
pP 
xT 
DEN = T : 
C COFFFICIENTS IN STERLING'S APPROXIMATION FOR LN(GAMMA(T)) 
C{1l) = 12/12. 
C(2) = -1./360. 
C(3) = 12/1260. 
C(4) = -1./1680. 
C(5) = 12/1188. 
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C(6) = ~691./360360. 
C(7) = 1./156. 
C(8) = -3617./122400. 
C(9) = 43867./244188. 
C(10) = -174611./125400. 
C(ll) = 77683./5796, 
SUM = (T= (654000) )¥*CLNGCTI-~THCMPLX (25 ¥*ALNG(26*3014159)50.0) 
J=1 
70 TERM = C(J)/DEN 


C TEST REAL AND IMAGINARY PARTS OF LN(GAMMA(Z)) SEPARATELY FOR 

C CONVERGENCE. IF 2 IS RFAL SKIP IMAGINARY PART OF CHECK. 
“FCABS (REAL (TERM) /REAL (SHUM) )-GE*TOL) GO TN 80 
TFCY.FQ.0.0) GN TO 100 
TFCABS(AITMAG( TERM) /AIMAG(SUM))-LT.TOL) GO TO 100 


80 SUM = SUM + TFRM 
J=z=J+ 1 
DEN = DEN*TT 


C TEST FOR NONCONVERGENCE 
TF(JeFO.12) GN TO 90 


GO TO 70 . 
C STIRLING'S SERIES NID NOT CONVERGE. PRINT ERROR MESSAGE AND 
C PROCEDF. 
90 WRITECTOUT,9I0O) 2 


C RECURSION RELATION USED TO ORTAIN LN(GAMMA(Z))} 
C LN(GAMMA(2)) = LN(GAMMA(Z4+M)/(Z*(Z+1) ¥o ee *(Z24M-1))) 


Cc LNOGAMMA (Z+M)-LN(Z)-LNGZ+1)-0 6 e-LN(Z+M-1) 


100, TEE MLFQ.9) GN TO 120 
NM 110 J = 14M 
A = CMPLX(1*1Le-1e70.0) 
L110 SUM = SUM=CLOGIZ 4A) 
C CHECK TO SEF TF REFLECTION FORMULA SHOULD BE USED 
120 TE(RFEFLEK} GA TO 130 
SUM = CLOG(PI/CSIN(PT*2Z))-SUM 
Z = (1605060) -2 
130  CGAMMA = CFEXP(SUM) 
RETURN 
900 FORMAT LX 214674 10X¢49HARGUMENT OF GAMMA FUNCTION IS TAN CLASF TO 
1 A POLE) 
910 FURMAT(44H ERROR = STIRLING'S SE® TES HAS NOT CONVFRGFD/14Xy4HZ = ¢ 
17F 14.7) 
FN 


Certification and Remark on Algorithm 404 [S14] 
Complex Gamma Function [C.W. Lucas Jr. and C.W. 
Terril, Comm. ACM 14 (Jan. 1971), 48] 


G. Andrejkova and J. Vinar, Computing Center, 
Safarik University, Kosice, Czechoslovakia 


The following changes were made in the algorithm: 
a. The function subroutine heading was changed to read 


COMPLEX FUNCTION CGAMMA(Z) 


in accordance with the standard. 

b. The convergence tests following statement number 70 involve 
the computation of the quantity REAL(TERM)/REAL(SUM). 
This can lead to overflow if Z is real and near to a pole. For these 
reasons the two statements were replaced by 


IF (ABS(REAL(TERM)) .GE. TOL+*ABS(REAL(SUM))) GO TO 
80 


and 


IF (ABS(AIMAG(TERM)) .GE. TOL*ABS(AIMAG(SUM))) GO 
TO 100 


c. For similar reasons the statement 

SUM = CLOG(PI/CSIN(PI*Z)) -—SUM 

was replaced by 

SUM = CLOG(PI)—CLOG(CSIN(PI*Z)) —SUM 


With these modifications the algorithm was translated on MINSK 
22M using the FEL Fortran compiler (with seven significant digits 
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in single precision and 15 in double precision) and ran satisfactorily. 
The following tests were performed: 

a. The logarithms of CGAMMA(Z) for z = x+iy with x = 1.0 
(0.1)10.0 and y = 0.0(0.1)3.0 were checked against the values given 
in {1]. An overall accuracy of five to six digits was observed. The 
imaginary part frequently had one more accurate digit than the real 
part. 

b. The behavior in the vicinity of poles was tested by computing 
the values of CGAMMA(Z) in eight evenly spaced points on circles 
of decreasing diameter. The value of 1.E—7 for the minimum diam- 
eter was found adequate. 

c. The values of CGAMMA(Z) were computed for z = x++/y with 


v = 0.0(1.0)23.0, y = 0.0 
x = 0.0, y = 0.0(1.0)26.0 
x = y = 0.0(1.0)25.0 
x = ~y = 0.0(1.0)25.0 
—x = y = 0.0(1.0)12.0 
6. —x = —y = 0.0(1.0)12.0 
in all cases the final value is the last for which the program did not 


run into overflow or, in the last two cases, try to take a logarithm 
of too small a number. 


NL eee an ae 
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KEY WORDS AND PHRASES: eigenvalues, matrix roots, pen 
cil roots 
CR CATEGORIES: 5.1, 5.3 


procedure PENCIL(A, B,m,n, LAMDA, Sp, Par, Tol); 
value A,B; realarray A,B, LAMDA; integerm,n, Sp, Par; 
real Tol; 
comment PENCIL finds the generalized eigenvalues LAMDA 

which solve x(A —\B) = Oand (A—\AB)y = 0 and simultaneously 
reduce the rank of (A—\B), where A and B are m by n matrices, 
see [1, 3, 4]. PENCIL converts the m by n problem of finding the 
rank-reducing numbers of (A—)B) into an ordinary 7 by r 
eigenvalue problem by a sequence of elementary transforma- 
tions. The theory is developed in [3] and [4]. These techniques 
are to be thought of as a combinatorial solution to an unsolved 
general problem. Our techniques may be numerically unsound 
for ill-eonditioned problems. There are at most k = min(m, n) 
such generalized eigenvalues. Sp is the number of generalized 
eigenvalues found. The real parts of the roots are stored in 
LAMDA (Il, j), and the imaginary parts in LAMDA (2, 7), j = 
1...Sp. LAMDA is declared external to this procedure and 
should be dimensioned [1:2, 1:k]. The procedure sets the param- 
eter Par: Par = 0 indicates there are no roots, otherwise Par = 
+1. The tolerance value 7J'ol governs the accuracy of the pivot- 
ing routine used in the procedure REDUCE. REDUCE is a pro- 
cedure applied to a matrix X of rank r to find matrices Pl and 


P2 so that 
I, 0 
PIXXX P2= | | 
0 O 


The input parameters of PENCIL must be A, B, m, n, and Tol. 
The following supplementary procedures are required: RE- 
DUCE, SWAP, Matmul, EIG. The purpose of each of these pro- 
cedures is explained in the head comment of each. A routine 
for finding eigenvalues of a square matrix should be supplied by 
the user to be called from procedure EIG; 

comment Examples. We show several examples of the general- 
ized eigenvalue problem and how the procedure PENCIL pro- 
cessed these examples for input into the user-supplied eigen- 


A 
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value routine, here named HIGENVALUES, which is called in 
PENCIL by EIG. The format for the examples is (a) the original 
A and B matrices are shown, (b) the derived A, matrix (to four 
digits) whose cigenvalues are the rank-reducing numbers of 
(A —\B) which is input to HIG and then to EIGENVALUES is 
shown, and (c) any pertinent comments about that example are 
made. 


IO 2B es aes Aes ne | 
ae a ie a | 14 1-1 1 
=/ 3 1 11 -1|)Be=]-1 #1 146-1 1 
12 19 1 2-1 -1 12 1 
1 1-1 1 «15 ~~ 1 1 =#21-1 «0 


7220 © .0248 0871 .2308 —.1426 

0160 .8663 .1882 0636 0015 

A; = 0781  .2339 7967 =— .0357 .2016 
3107 =.0554 +=—.0699 8551 —.0791 

—.1792 .0261 1447, —.0173 =: 1.4020 


This example contains no complications since both A and B are 
square of full rank. A; is effectively B-!A. The example is the 
first shown in [2, Sec. 7]. 


23 1 2 A 
A=1]3 5 2 B= )|2 2 2 (2) 
3.4 2 2-1 1 


This example, from unpublished notes by J. H. Wilkinson, calls 
PENCIL recursively. A and B above are transformed to the 
one by one matrices 


A’ = [.23077] B’ = [1.1538] 


for re-entry to PENCIL. The final output from PENCIL is the 
derived matrix Ai = [.2]. 


2 3 2 1 0 0 
A=/]3 5 2 B=]0 1 0 (3) 
22 2 00 0 
Here m—r—q=n-—r—q=0so that there were no recursive 
01] 
calls of PENCIL. On exit from PENCIL Az = ; 3 | 
2121 1 -1 11 
A= B= | (4) 
212 1 1-1 11 
This system has no roots which reduce the rank (A—\B). The 


failure is an example of Theorem 2.3(a) of [4] when both £12 
and £21 exist. 


A=15 9 Be i | (5) 
11 rE 


This system has no roots which reduce the rank (A—\B). The 


failure is an example of Theorem 2.3(b) of [4] where both #12 and 


E21 exist. 
bie oe -1-1 0 0 1 1 | 
—11361-1 a 0 1-1 1-1 O 
a 01160 0 oe 1 0 ee ae, (6) 
12481 1 9-1 8-3 1-2 


This system has no roots which reduce the rank of (A—)B), but 
that fact is not discovered by PENCIL until a recursive call 
is made on 


A’ = [.2353 .2353 0] B’ = [.4706 1 —.3529]. 
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The failure is an example of Theorem 2.3(a) of [4] when £21 is 


degenerate. 
=i <1. 0-4 -—1 0 1 2 
0 112 SS ns a) | 
2 3-1 4 Ga a. 3 
Am | 4 668 BP iy: ca Aaa Go (7) 
1101 1-1 0 1 
at <1) 1 1 0-1 ~2 


This system has no roots which reduce the rank of (A—\AB), but 
that fact is not discovered by PENCIL until a recursive call is 


made on 
Sits BY gs ee . 
.2858 1 


Except for the entry in the fifth row and third column of B, 
this example is the transpose of example (6). The failure is an 
example of Theorem 2.3(b) of [4] when E12 is degenerate. 


| Pee 
Bs 4 B =|? 


43 5 0 100 
We ae “10 000 
0000 0000 (8) 


2 
an] d 
3 5 


Examples (8), (9), and (10) all reduce to the same derived eigen- 
problem. Each, however, tests paths to different exits from 
PENCIL. Heres + t = 0. 


0 0 100 0 
0 0 B=]0 10.0 
2 0 00 0 0. () 


See comment at example (8). Here s = 0, i.e. E21 is degenerate 
and ¢ is found to be zero. 


od fees 
_|3 5 0 _{0 10 
ee as 

000 00 0. (10) 


2 
3 5]. 


See comment at example (8). Here ¢ = 0, i.e. E12 is degenerate 
and s is found to be zero. 


2 —1 1 23 1 
a-[3 2 | [8 5 | 
2 -1 1 3.4 2 
3 0 1 
A; =] ~-7 2 -l 
4 —2 0 


In all other examples rank(A) > rank(B). Here rank(A) < 
rank(B). The derived eigenproblem has one nonzero root, 5, 
and two zero roots. Note that this problem is the same as ex- 
ample (2) except that A and B are interchanged. Ordinarily in- 
terchanging the roles of A and B yields the reciprocals of the 
eigenvalues. When one problem has zero eigenvalues, the inter- 
changed problem has no corresponding reciprocal eigenvalue. 
Thus in example (2) we find only one solution, the reciprocal of 5; 
comment Here we relate our work to that reported in the litera- 
ture. Gantmacher [1, Chap. XII] has shown that every m X n 
matrix of the form (A—dB) can be transformed by elementary 
row and column operations to a canonical form. (We call this 
form the Gantmacher Normal Form, G.N.F.) That is, there exist 
nonsingular m X m matrix P and n X n matrix Q so that P(A— 
B)Q has a quasi-diagonal form which is the direct sum of as 


Cow 


(11) 
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many as (p+q+r+2) blocks as follows: 
g 
A{O), Leg a1 asia Legty ’ 
Les BORA Linegs Nv aes Ns, ’ eae 


g 
All other entries in the G.N.F. are zero. The block h[0] has h 
rows and g columns and all its elements are zero. The block 
Le, +; has €94: rows and (e,,; + 1) columns with structure 


e+l 


The block Daca has (ni4;+1) rows and m,; columns with struc” 
ture 


The final block (J—)J) is an ordinary square eigensystem in Jor- 
dan normal form. 
For a given matrix (A—)B) let 


. {mr ‘) 
w= maxje,m, entier 
t 2 y, 


where e;, 7;, u: are defined from the G.N.F. Then PENCIL ap- 
plied to (A—)B) will require no more than w recursive calls to 
derive the reduced eigenproblem. We have run many examples 
with various combinations of L, L7™, and N blocks to test our 
procedures. Since the L, L™, and N blocks contribute no solu- 
tions, these examples are uninteresting to reproduce here. If the 
G.N.F. of the original problem contains only L, L™, and N 
blocks, there are no solutions. If the G.N.F. contains (J—AJ) as 
well, the output of our procedures for EJG is the matrix whose 
Jordan normal form is J. 
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begin 
integer q,7, 8, t, 7, j,k, lime, limr; 
begin 
array Pl{l:m, 1:m], P2{l:n, 1:n]; 
Par := +1; Sp :=0; 
k := ifm <n then melsen; 
for 7 := 1 step 1 until 2 do 
‘for j := 1 step 1 until k do LAMDA[i, j] := 0; 
REDUCE (P1, B, P2,m,n,r, Tol); 
ifr = 0 then 
begin Par :=0; goto Endp; end; 
Matmul(P1, A, A,m,m,n); Matmul(A, P2, A, m,n, n); 
Matmul(P1, B, B,m,m,n); Matmul(B, P2, B, m,n, n); 
comment NOTE:; The last two matrix multiplications to- 
gether, by definition of P1 and P2, change B to 


I, 0 
0 01° 


To avoid the multiplications at this point, an r by r identity 
matrix B can be generated directly. Note that r is deter- 
mined by the immediately preceding call of REDUCE; 


end; 
comment At this stage 
A:=P1XAX P2, B:=P1XBX P2= : : , 


B is ‘“‘reduced’’ and the corresponding operations have been 
performed on A; 
if (m—r) = 0A (m—r)=0) then 
begin 
EIG(LAMDA, A,r); goto Endp; 
comment Calculations for examples (1) and (11) exit here, 
and for example (2) cease here after one recursive call. See 
discussion in the ‘“Examples’’ comment; 


end; 

lime := if (n—r) = 0 then 1 else n — 7; 
limr := if (m—r) = 0 then 1 else m — 7; 
begin 


array C12(l:7r, l:limc], C21[1:limr, 1:r], C22[(L:limr, 1:limc], 
Pl (l:limr, lilimr], P2[1:lime, 1:lime]; 
if (n—r) ¥ 0 then 
begin 
for 2 := 1 step 1 until r do 
for j :=7r+1 step 1 until n do 
C12[t, j—7] := Alt, 7]; 


end; 

if (m—r) ~ 0 then 

begin 
for? := r-+1 step 1 until m do 
begin 


for j := 1 step 1 until r do C21[i—r, j] := Alt, J]; 
if (n—r) ~ 0 then 
begin 
forj :=7r+1step1 until n do C22[i—r, j—r] := Ali, j]; 
end 
end 
end; 
comment A has now been partitioned and the parts are 
referred to below as 


ron-—?r 
A C12 |r 
C21 C22 In —1; 


if ((n—r) = 0 V (m—r)=0) then 

begin g :=0; goto Mul; end; 

REDUCE(P1, C22, P2, limr, lime, q, Tol); 

Matmul(C12, P2, C12, r, lime, lime); Matmul(P1, C21, C21, 
limr, limr, r); 
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Matmul(P1, C22, C22, limr, limr, imc); Matmul(C22, P2, C22, 
limr, lime, lime); 

comment See ‘‘Note’’ comment above to generate C22 
directly without matrix multiplications; 

comment (C22 has been ‘‘reduced’’ and the requisite opera- 
tions have been performed on C12 and C21. That is | 


C22 := Pl X (22% P2 = By : 


C12 := Pl X C12, and C21 := C21 & P2. Thus A now looks 


like ° 
7 n—- 7 
rf A, C12 | 
- I, 0 
c21 . hb — 1; 
Si 0 


if g = 0 then go to Mul; 
begin 
array D2i{lir, 1:1); 
Matmul (C12, C21, D21, 7, g, 1); 
for 7 := 1 step 1 until 7 do 
for j := 1 step 1 until r do 
Alt, j] := Afé, j] — D2i[Z, 7); 
end; 
Dstep: 
if ((m—r—q)=0 A (n—r—q)=0) then 
begin 
EIG(LAMDA,A,71r); goto Endp; 
comment Calculations for example (3) cease here. See 
discussion in the ‘‘Examples’’ comment; 
end; 
Mul: 
limr := if (nm—r—q) = 0 then 1 else m —- r — q; 
lime := if (n—r—q) = 0 then 1 else n — r — q; 
begin 
array E12(1:r, l:limc], E21[l:limr, 1:7]; 
if (n—r—q) ~ 0 then 
begin 
fori := 1 step 1 until r do 
for j := q+ 1 step 1 until n — rdo 
Eli, j—g] := C12fé, J); 
end; 
if (m—r—q) ~ 0 then 
begin 
for 2 := ¢ +1 step 1 until m — rdo 
for j := 1 step 1 until r do 
E2i\[i—q, j] := C2llt, 7}; 
end; 
comment The columns of C12 above J, and the rows of C21 
to the left of J, are annihilated. The remaining submatrices 
are now called £12 and #21, respectively. 


B 0 °| 
A=1]0 I, 0 45 
E210 O 
begin 


array Pl, PA[lir, l:r], P2[l:lime, 1:lime], 
P3[lilimr, 1l:limr); 

if (n—r—q) ~ 0 then REDUCE (P1, E12, P2, r, lime, t, 
Tol 

else); ¢ := 0; 

if (m—r—q) #0 
then REDUCE (P3, E21, P4, imr, r,s, Tol); 


else s := 0; 
if ((r=t) V (r=s)) then 
begin 


comment Set parameter for no solutions; 
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par :=0; goto Endp; 


comment Calculations for examples (4-7) (after one 


recursive call for (6) and (7)) cease here. See discus- 
sion in the ‘‘Examples’’ comment; 
end; 
if (s+?) = 0 then 
begin 
EIG(LAMDA, A, 7); go to Endp; 
comment Calculations for examples (8-10) cease here. 
See discussion in ‘‘Examples’’ comment; 
end; 
if (n—r—q) ~0 then 
begin 
Matmul(P1, 4, A, 7,7, 7); Matmul(P1, B, B, 7, 7, 1); 
end; . 
if (m—r—q) # 0 then 
begin 
Matmul(A, P4, A,r, 7, 7); Matmul(B, P4, B, r, r, 7); 
comment £12 and £21 have been ‘‘reduced”’ and the 
requisite operations have been performed on B. That 
is 


El2 := Pl X E12 X P2= E : 


E21 := P3 X E21 X P4 = B a 
and B, := Pl X I, X P4. Thus A looks like 


r q t(n—r—q—-t) 
ss | 
A, 0 r 
0 0 
0 lq 0 q 
I, 0 8 
0 0 
¢ 4 
: (m—r—q—s) 
and B looks like 
B, 0 0 0 r 
0 0 O 0 q 
0 0 0 0 8 
0 0 0 0 (m—r-q—s) 
end 
end 
end 
end; 


comment Thecolumns of A, above J, and the rows of A; to the 
left of J, are annihilated, and the remaining (r—s) X (r—t) 
submatrix is called G. The corresponding r — s rows and r — t 
columns of B are called H. The following statements build 
the matrices G and H; 
begin 
array G, H(l:r—t, 1:r—s]; 
for? := t+ 1 step 1 until r do 
for j := s +1 step 1 until r do 
begin 
Gli—t,j —s]):= Afi, jl]; H{i—t,j—s] := Blt, j] 
end; 
PENCIL(G, H, r—t, r—s, LAMDA, Sp, Par, Tol); 
end; 
Endp: 
end PENCIL; 
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procedure REDUCE(I1, X, I2,m,n, dex, Tol); value X; 
real array X, I1, 12; real Tol; integer m,n, dex 

comment REDUCE applied to an m by n matrix X of rank dex 
finds an m by m matrix J1 and ann by n matrix J2 such that 


Taez | 

0 05" 
The rank is found by REDUCE and returned in dex. Gaussian 
elimination with complete pivoting is used until the (dex + 1)st 
pivot element would be less than Tol, a parameter to be supplied 
by the user to PENCIL. This procedure is supplied to make 
the PENCIL routine complete. Users concerned with increased 
numerical accuracy should write their own routines paying 
attention to multiple precision, and ill-conditioning. Note that 
X is called by value and is not altered. When preserving X is 
not important, PENCIL can be made to run faster by elimi- 
nating value X in REDUCE and the matrix multiplications in 
PENCIL that directly follow the calls to REDUCE; 

begin 
integer 7,7, k, 1, lum, p,q; real div; 
real array CVEC, TEMP{l1:n, 1:n], I3[1:m, 1:m}; 
integer array rvec({1:m], mvec(l:n]; 
ifm > n then lim := nelse lim := m; 


NXX xX 12= 


dex := 0; 
for 2 := 1 step 1 until m do 
begin 


Tift, 7] := 1; rvec{t] := 12; 

for 7 := 1 step 1 until m do if 7 # j then /1[7, 7] := 0; 
end; 
for 7 := 1 step 1 until n do 


begin 
mvec{t] := 1; 
for j := 1 step 1 until n do 
begin 


if 7 ~ j then /2[t, j] := TEMP{i, j] := CVEIC[t, j] :=0 
else [2(7, 7] := TEMP(i,j] := CVEC{, 7] := 
end 
end; 


Rowop: 


forz := 1 step 1 untilm do 
for j := 1 step l-until m do 
if i = j then J3[2, j] := 1 else J3[7, 7] := 0; 
dex := dex +1; 
if dex < lim then 
begin 
SEARCH (X, dex, k,l, m, n, rvec, mvec); 
comment X(k, 1) is the pivot element; 
ISWAP (rvec{dex}, rvec{k]); ISWAP(mvec{dex], mvec{l]);, 
for 7 := 1 step 1 untiln do SWAP(CVEC{i, dex], CVEC(i l)); 
if abs(X[rvec[dex], mvec{dex}]) < Tol then 
begin dex := 0; go to Endr end; 
for i := 1 step 1 until m doif i # dez then 
begin 
div := X[rvec[t], mvec[dex]]/X(rvec[dex], mvec[dex]); 
I(t, k] := —div; 
for j := dex step 1 until n do 
X [rvec{t], mvec[j]}] 
:= X[rvec(t], mvec{j]] — (divx X[rvec[dex], mvecl[j]]); 
end; 
I8ldex, k] := 1.0/X[rvec[dex}, mvec{dezx]]; 
for 7 := dex step 1 until n do 
if 7 ~ dex then 
X (rvec(dex], mvec[j]] 
:= X(rvec(dex], mvec{j]]/X[rvec{dex], mvec[dex]]; 
X[rvec|dex], mvec{dex]] := 1.0; 
if k * dex then SWAP(I3[dex, dex], I3[k, dex)); 
Matmul(J8, 21, I1, m, m, m); 
for 1 := dex + 1 step 1 until m do 
for j := dex + 1 step 1 until n do 
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if abs (X[rvec[t], mvec[j]]) > Tol then go to Rowop; 
end; 
ifm <nV dex < lim then 
begin 
integer p,q; real mul; 
for 7 := 1 step 1 until dex do 
for 7 := dex + | step 1 until n do 
if abs(X[rvec{z], mveclj]]) > Tol then 
begin 
mul := TEMP{i, j] := —X(rvec[t], mvec[j]]; 
for p := 1 step 1 until m do 
X{rvec[p], mvec[j]] 
:= X[rvec[p], mvec(j\] + (mulXX[rvec[p], mvec{t]]}); 
Matmul(I2, TEMP, 12, n,n, n); 
for p := 1 step 1 until n do 
for g := 1 step 1 until n do 
if p = q then TEMP)(p, q] := 0 
else TEMP[p, q] := 1; 


end 
end; 
Matmul(CVEC, I2, 12, n, n, n); 
Endr: 


end REDUCE; 
procedure SWAP(r,s); realr, s; 
comment SWAP interchanges real variables r and s; 
begin 
real temp; 
temp := Tr; ri= 8; & i= lemp; 
end SWAP; 
procedure [SWAP(r,s); integer 1, s; 
comment JSWAP interchanges integer variables r and s; 
begin 
integer lemp; 
temp := 7; r:=s8; s:= lemp; 
end ISWAP; 
procedure Matmul(X, Y, Z, u,v, w); 
real array X, Y, Z; integer u, v, wv; 
comment Matmul causes the matrix product X times Y to be 
stored in matrix Z. X is u by v, Y is v by wv, and Z is u by w. 
For improved accuracy inner products should be accumulated 
using double precision arithmetic; 
begin 
integer 7,7, k; real array 7'EMP{(l1:u, 1:w]; 
for 7 := | step 1 until u do 
for j := 1 step 1! until w do 
begin 
TEMP(i, 7] :=0 
fork := 1 step 1 until v do 
TEM P(t, j] := TEMP{i, j]} + Xi, k} X Yk, J]; 
end; 
for 7 := 1 step 1 until u do 
for 7 := 1 step 1 until w do Z[i, j] := TEMP{i, j|; 
end Matmul; 
procedure SEARCH (Y, Lim, k, l, m, n, vect, vecj); 
array Y; integer Lim, k,l, m,n; integer array vect, vecj; 
comment SEARCH finds the largest element in the m by n 
array Y starting at Y[Lim, Lim], searching the remaining sub- 
array. Vectors vect and vecj record the row and column swaps 
which have occurred previous to the call of SEARCH. k and l 
are the row and column indices, respectively, for the largest 
element in the array searched. 
begin integer 7, 7; 


k:=l:= Lim; 

for 7 := Lim step 1 until m do 
for j := Lim step 1 until n do 
begin 


if abs(Y[veci[i], vecj[j]]) > abs(Y¥(veci(k], vecj{l]]) then 
begin k :=1i; l:=j; end 
end 
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end SEARCH; 
procedure EIG(LAMDA, X,7r); 
real array LAMDA, X; integer r; 
comment EIG calls a procedure which finds the eigenvalues of 
the r by r matrix X and stores them in the 2 by 7 matrix LAMDA, 
real parts in LAMDA[I, j], imaginary parts in LAMDA{2, 7]; 
begin 
EIGENVALUES (X, LAMDA, 1); 
Sp := 7; 
end E/1G; 


Remark on Algorithm 405 [F2] 

Roots of Matrix Pencils: The Generalized Eigenvalue 
Problem [A.M. Dell, R.L. Weil, and G.L. Thompson, 
Comm. ACM 14, (Feb. 1971), 113-117] 


Richard M. Heiberger [Recd. 19 May 1971, 29 July 
1971, and 8 Sept. 1971] 
Department of Statistics, Harvard University* 


Key Words and Phrases: eigenvalues, matrix roots, pencil roots 
CR Categories: 5.1, 5.3 


Algorithm 405 calculates rank-reducing numbers which are 
similar to, but not identical to, generalized eigenvalues. An eigen- 
value of A with respect to B, as defined in this Remark, satisfies 
the equations 


xT(A — XB) = 0, (A — AB)y = 0 (1) 


for appropriately dimensioned vectors x and y. A rank-reducing 
number Xo, as defined by Thompson and Weil [3], further satisfies 


Rank (A — 0B) < Rank (A — AB) (2) 


for some value \ # Xo. The distinction is meaningful only if the 
matrices A and B are of less than full rank. 

The definition (1) is the simplest generalization of the ordinary 
eigenvalue problem in that the only new concept is the replacement 
of an identity matrix with an arbitrary matrix B. This form of the 
problem arises in many physical contexts, usually with A and B 
square symmetric, and B positive definite (see [4] for examples). 
Dell, Weil, and Thompson find that in their context the additional 
condition (2) is desirable since rank-reducing numbers are always 
discrete, finite in number, and related to a Jordan-like canonical 
form. 

In order to insure that all eigenvalues, as defined here by (1), 
are discrete, one further condition than given in Algorithm 405 
must be tested. It is necessary that 


Rank (A — dB) = min (m, n) (3) 


for at least one value of \. In the special case that m = n (square 
matrices) the condition (3) is equivalent to 


det (A — \B) ¥ 0 (4) 


for at least one value of \. When this condition is violated, the 
spectrum of eigenvalues is continuous; that is, for every complex 
number \ there exist vectors x and y such that (1) is satisfied. 
Discrete rank-reducing numbers may exist even when the rank 
condition (3) is violated. Example 8 accompanying Algorithm 405 
does not satisfy condition (3) and therefore does not have discrete 
eigenvalues although it does have discrete rank-reducing numbers. 

The procedure PENCIL is similar to the algorithm developed 
by Fix and Heiberger [1] for the generalized eigenvalue problem 
when A and Bare Hermitian matrices. We showed that the spectrum 


* This work was supported by the Cambridge Project through 
the Department of Statistics, Harvard University. Author’s current 
address; Department of Statistics, Iowa State University, Ames, 
IA 50010. 
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of Ax — Bx = 0 consists of stable and unstable eigenvalues, 
which undergo, respectively, small and large changes in response to 
small changes in A and B. We proved that our algorithm isolates 
and accurately computes the eigenspace associated with the stable 
eigenvalues. We did not attempt to extend our proof to non-Her- 
mitian and rectangular matrices, for which Algorithm 405 may 
also be used. Our proof explicitly does not apply to rank-reducing 
numbers unless the rank condition (3) is satisfied. Instead it sug- 
gests that the computed solution may be inaccurate, as the first 
example in [1] shows. We programmed in APL [2] and Fortran 
(unpublished). 

The following changes to Algorithm 405 will modify it to 
calculate either eigenvalues or rank-reducing numbers at the user’s 
option. The user of rank-reducing numbers will be warned if the 
rank condition (3) is not satisfied, and there may be numerical 
inaccuracy in his solution. 


Page 113, column 1. Replace procedure heading with: 
procedure PENCIL (A,B,m,n,LAMDA,Sp,Par,Tol,eigrrn) ; 


Page 113, column 1, preceding first comment insert: 
integer array eigrrn; 


Page 113, column 1, first comment. Replace the sentence: 

The input parameters of PENCIL must be A, B, m, n, and Tol. 
with the following: 

The input parameter eigrrn[1] is used to direct the program 
to calculate either eigenvalues or rank-reducing numbers. If eigrrn{1] 
= 0, then eigenvalues will be calculated. If eigrrn[1] = 1, then rank- 
reducing numbers will be calculated. The parameter eigrrn[2] 
must be set to 0 as an input parameter. As an output parameter 
eigrrn({2] indicates whether the rank condition (3) is satisfied. If 
eigrrn(2] = 0, the condition is satisfied. If eigrrn[2] = 1, the con- 
dition is violated. When the rank condition is violated and eigen- 
values are being calculated, the parameter Par is set to 0 indicating 
no roots, and the procedure is terminated. When the rank condition 
is violated and rank-reducing numbers are being calculated, the 
procedure continues calculations as at present, but the user is warned 
that there may be numerical inaccuracy in the solution. The input 
parameters of PENCIL must be A, B, m, n, Tol, eigrrn{1], and 
eigrrit{2]. 


Page 116, column 1, preceding line — 11. Insert: 

if ((a—r—q—t0) A (m—r—q—s+0)) then 

begin 

comment Set parameter for continuous spectrum; 
eigrrn(2] := 1; 

if eigrrn({1] = O then 
begin 
comment Set parameter for no solution; 

Par := 0; goto Endp; 
end; 
comment Beware of possible numerical inaccuracy; 
end; 

Page 116, column 1, line —4. Replace with: 
PENCIL(G,H,r—t,r—s,LAMDA,Sp, Par,Tol,eigrrn); 
There are several typographical errors. The following lines 

should read as given below. 


Page 115, column 2, lines —8 and—7: 
Tol) 
else ¢ := 0; 
Page 115, column 2, line —5: 
REDUCE(P3,E21,P4,limr,r,s,Tol) 
Page 116, column 1, line 1: 
Par:= 0; goto Endp; 


I would also suggest that the following value parts be added 
for more efficient execution. 
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Procedure PENCIL Procedure Matmul 
value 4,B,m,n,Tol; value u,v,w; 
Procedure REDUCE Procedure SEARCH 
value X,m,n,Tol; value Lim,m,n,veci, vec); 
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Algorithm 406 


Exact Solution of Linear 
Equations Using Residue 
Arithmetic [F4] 


Jo Ann Howell (Recd. 23 Mar. 1970 and 2 July 1970) 
The University of Texas at Austin, Center for Numerical 
Analysis, Austin, TX 78712 


Key Words and Phrases: residue arithmetic, symmetric residue, 
modulus, mixed-radix representation, symmetric mixed-radix 
representation, mixed-radix conversion, prime number, linear 
equations, Gaussian elimination, matrix inversion, determinant, 
adjoint matrix, ill-condition 

CR Categories: 3.15, 5.14 


Description 


Purpose. The subroutine EXACT solves the matrix equation 
AX = B for X, where A is an N by N integer matrix, B is an N by 
M integer matrix, and X is an N by M real matrix. Residue arithme- 
tic is used to obtain the exact solution, consisting of the rational 
components of X, i.e. det(A) and the elements of Y = A?4/B, and 
the rounded solution, computed as the quotient of the rational 
components and stored in the array X. The subroutine can be used 
to solve systems of linear algebraic equations, to invert matrices, 
and to compute determinants and adjoint matrices. 

Method. A methd similaor to the one described in [1, 2, and 3] 
is used to solve a system of linear algebraic equations AX = B, 
using residue arithmetic. However, since there are differences we 
shall describe them here. In [1] the concept of residue modulo m 
refers to the least nonnegative remainder of the integer x after divi- 
sion by m. The definition here, on the other hand, is preferable as a 
matter of computational convenience reflected by the definition of 
the FORTRAN MOD function. 

Definition. Given any integer x and any modulus m, if 


(i) r==x (mod m), 

(ii) |r| <m, and 

(iii) sgn(r) = sgn(x), 

then we write 

r=|Xx\|m 

and say r is a residue of x modulo m, 

It is easily shown that this residue is also unique. 

Since our definition of residue here differs from that in [1], we 
must point out that each of the theorems in [1], relative to the non- 
negative residue system, has an analog in the residue system defined 
here. However, in some of the analogous theorems it may be neces- 
sary to use the congruence symbol = in place of the equality 
symbol =. Thus, Algorithm I in {1} and [2] and Algorithm IT in 
[3] can be completely described using our definition of residue. 

We should point out that there are related discussions in [5], 
[6], and [7]. 

The subroutine EXACT uses Algorithm II to solve AX = B 
using the residue system described here. First, the following pre- 
liminary calculations are carried out by the program before solving 
the system of equations. (i) The number, /.S, of moduli required to 
obtain a solution is predicted by subroutine LOGBND, as described 
in [2]. The program computes 


N N MN 
BOUND = log (: be (S ai; TI IT jou |), (| bir | ¥ 0) 
i=1 jel l=n1 k=s} 


and IS chosen so that 


BOUND < SUMLOG 
= log(MM(1)) + -++ + log(MMUIS)). 
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where the MM(I) are the stored moduli. (ii) The elements of A and 
Bare reduced modulo MM (1), J = 1, -:- , IS. 
The subroutine SOLVE solves the residue Syston 
AX = B (mod MM(D), I = 1, ,1S 
for the residue I dercieeisan aes (1}) of d and the elements of Y, 


d~ {\d|uma),|d|ume,--+,|d| mucus} 

and . 

vis ~ (vis lay, | Yes lays + | Yas Laemcray}, 
where 

d = det(A) 

and 

Y = AaB, 


The computation is performed by means of Gaussian elimination 
for residue arithmetic [1] using the residue system described here. 
Then, the residue representations for d and the elements of Y are 
converted to their symmetric residue representations (see [3]), 
d~ {/d/uma), /d/mmoys+++ 5 /d/mmuus)} 

and 

vis ~ {/ is (uma), / is (Myst) / is /mcts)}- 

Next, subroutine MXRADX converts the symmetric residue 
representations for d and the elements of Y to their corresponding 
symmetric mixed-radix representations [3], 
d~ (81, Bo, -++, Brg) 
and 
Vig ~ (Ovijy ) Wijny 114, Aijrg) 

The conversion is accomplished by means of a mixed-radix con- 
version process described in [3]. 

From their symmetric mixed-radix representations, d and the 

elements of Y are directly obtainable, as follows: 


2 
d = 8, + &MM(1) + 63 [] MM(k) + --: 
k=1 
Is—2 Is—1 


+8rs-1 il MM({k) + 81s I MM(k) 
1 i 


and 
2 

Yip = iz, + aig MM(1) + ceijs Il MM(k) + -:- 
=1 


IS—2 Is—1 
+ aijr5 1 Il MM(k) + eiizg II MM(k). 
==] ish 


Since each of these quantities may overflow a fixed-point word, they 
are stored as ‘‘multilength’’ numbers. In other words, d and each of 
the elements of Y are stored in several words, with NDIGIT digits 
in each word. On return from EXACT, these multilength numbers 
are stored in MULTL, with the elements of Y (stored columnwise) 
in the first M*N rows of MULTL, and d in the (M*N + 1)th row of 
MULTL. The lowest order digits are in the first column, and the 
highest order digits are in column LCOUNT. Thus, the exact solu- 
tion of AX = B, consisting of the elements of Y (stored columnwise) 
and the determinant of A, may be printed out as follows (assuming 
NDIGIT < 7): 


WRITE (1, 10) | 
10 FORMAT(24H MULTILENGTH DIGITS OF Y/) 
MTN = M*N 
MNI = MTN-+1 
Li = LCOUNT+1 
DO 201 = 1, MTN 
20 WRITE (1, 30)(MULTLU, L1—J), 
30 FORMAT (1X, 1018) 
WRITE (1, 40). 
40 FORMAT (//17H DETERMINANT OF A/) 
WRITE (1, 30)(MULTL(MNI, L1—J), J=1, LCOUNT) 


J=1, LCOUNT) 


Program Call. Subroutine EXACT is completely self-contained 
(composed of eight subroutines EXACT, SOLVE, MXRADX, 
MLTLTH, CHECK, INVERS, RESIDU, and LOGBND), and the 
calling sequence, which has 22 parameters, is 
CALL EXACT (A, N, IN, B, M, IM, IMPIN, IMIN1, NDIGIT, 
KPRIME, NOPRIM, NO2, X, DET, IER, MULTL, LCOUNT, 
ATEMP, MM, RY, W,V) 

Communication to EXACT is solely through the parameter list 
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which is described in comments at the beginning of the subroutine 
EXACT. 

Cautions to User. 1. The user should test JER before attempting 
to print results. An error code of 1 may arise if 


(a) |det(A)| > Il KPRIME(I), 
I=] 


where r is the number of primes, KPRIME(J), for which 
det(A) # 0 (mod(K PRIME(J))), 


(b) max |yi;| > I] KPRIME(D, 
tj I=1 


where r is defined as in (a), 
(c) KPRIME(I) is not a prime (for some J), 
(d) KPRIME(I) = KPRIME(J), J #T. 
2. This algorithm is of limited use due to the fact that 4 and B must 
be integral, due to the limitations given in 1(a)—1(d) above, and due 
to the algorithm’s inherent slowness. It is not intended as a substi- 
tute for other well-established procedures for solving systems of 
linear algebraic equations. However, it may be useful in obtaining 
the exact solution of an ill-conditioned system of equations which 
has integral coefficients or a system which has rational coefficients 
which can be scaled to make it integral. In fact, as Knuth [8, p. 
256] states, this method is “substantially faster than anv other 
known method for obtaining exact solutions.” 

Test Results. Subroutine EXACT was tested on a cpc 6600 
computer on which the maximum size of integer variables which 
can be used in arithmetic operations is 48 bits (~14 digits). The 
maximum size of real variables is 48 bits with an 11-bit exponent. 
The results are summarized below. The following parameters were 
used as input for both test cases: 


IN = 10 NDIGIT = 7 
IM = 10 NOPRIM = 10 
IMPIN = 20 #£.NO2 = 20 
IMINI | = 101 
10000019 
10000079 
10000103 
10000121 
KPRIME =| 10000139 
10000141 
10000169 
10000189 
10000223 
10000229 
(i) Input to EXACT: 
N = 10 M=1 
10 9 8 7 6 5 4 3 2 10° 
998 76543 2 «1 
8 8 8 76543 2 1 
777765 43 2 «41 
*< 6 6666543 2 #1 
Ape 5 5555543 2 1 
444444432 1 
33333333 2 #1 
232. 2 2 2 2.2 A 
1131%1d1%1d1d1d421@é«21 
B= ey. 


Output from EXACT: 


—1.9999998E-+07 
1.9999998E-++-07 


oS 


—1.0000000E +00 
2 .0000000E +00 


DET = 1.0000000E-+00 
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MULTILENGTH DIGITS FOR Y 


—1 — 9999998 
1 9999998 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 ~1 
0 2 


MULTILENGTH DIGITS FOR DETERMINANT A 


0 1 


(ii) Input to EXACT{4]: 
N = 5 M= 5 


5 300 —2100 


4200 "sa | 


—60 —2400 18900 —40320 25200 


210 6300 —52920 117600 —75600 


52920 —35280 


A= 
—280 —6720 58800 —134400 88200 | 
126 2520 —22680 
B=]. 
Output from EXACT: 


1 .OOODDDD00000E +00 
5 .OO0000000000E — 01 
X = | 3.333333333333E—01 
2. SO0000000000E — 01 
2 .000000000000E — 01 


1 .000000000000E ++ 00 
3 .333333333333E—O1 
2. 5S00000000000E — 01 
2 .QO0000000000E — 01 
1 .666666666667E — 01 


1 .OODOD0000000E -++-00 
2 .000000000000E — 01 
1 .666666666667E — 01 
1 .428571428571E—01 
1 .250000000000E — 01 


DET = 5.3343360000E+10 
MULTILENGTH DIGITS FOR Y 


$334 3360000 
2667 1680000 
1778 1120000 
1333 5840000 


1066 8672000 
5334 3360000 


1778 1120000 
1333 5840000 
1066 8672000 
889 560000 
5334 3360000 
1333 5840000 
1066 8672000 
889 560000 
762 480000 
5334 3360000 
1066 8672000 
889 560000 
762 480000 
666 7920000 
5334 3360000 
889 560000 
762 480000 
666 7920000 
592 7040000 


1 .000000000000E +00 
2 . 500000000000E — 01 
2 .000000000000E — 01 
1 .666666666667E—01 
1.428571428571E—01 


1 .O00000000000E +00 
1 .666666666667E —01 
1.428571428571E—01 
1. 250000000000E — 01 
1.111111111111E—01 
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MULTILENGTH DIGITS FOR DETERMINANT A 
5334 3360000 
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Algorithm 


SUBROUTINE EXACTCAsNe INeBeMe IMe IMPIN, IMINI eNDIGIT» 
LKPRIMEyNOPRIMyNU2 eX eDET, LERe MUL TL ol COUNT pATEMP MMy 
2RYV eWeV) 

DIMENSICN ACINe IND e BLING IMD XC ING IMD SATEMPL ING IMPIND » 
LMULTLOEIMINL sNOPRIM) ¢MMCNOPRIM) CRYCIMINI), 
Z2KPRIMECNOPRIM) »WINO2),VENO2Z) 

INTEGER Ags ATEMPePPyWeV 
COMMON/PLEN/ IBy PPaNZeLSe LFLAGs LQUITsNORES 


THIS SUBROUTINE SOLVES THE MATATX EQUATION AX=B 
‘FOR X AND FOR THE EXACT SOLUTICIN, Y=ACADJI)*B 
AND DET Aw KESTOUE ARITHMETIC IS USED TO OBTAIN 
THE SOLUTION. 


A IS THE N BY N CQEFFICIENT MATREX AND MUST BE 
CF TYPE INTEGER. 
N IS THE ORDER OF THE MATRIX A (N GREATER THAN 1). 
IN IS A DIMENSION PARAMETER WHICH DEFINES THE 
CEIMENSION OF Aw IT MUST BE EQUAL TO OR GREATER 
THAN Ne 
B IS THE N BY M MATRIX OF THE RIGHT-HAND SIDE AND 
MUST BE OF TYPE INTEGER. 
M IS THE NUMBER OF COLUMNS OF B AND X (M GREATER 
THAN 0). 
IM 1S A DIMENSEON 9ARAMETER WHICH DEFINES THE 
SECOND OEFMENSION OF THE 2-DIMENSTONAL ARRAYS 
A AND X. IT MUST BE EQUAL TO OR GREATER THAN Mo 
IMPIN IS A DIMENSION PARAMETER WHICH IS IM + IN. 
IMENL IS A DIMENSTON PARAMETER WHICH IS [M * IN + 1. 

NOIGIT IS THE NUMBER OF DIGITS STORED IN EACH WORD 
CURING MULTELENGTH ARITHMETIC OPERATIONS. IT IS 
MACHINE DEPENDENT AND MUST BE CHOSEN SO THAT 
10 ** (2 * NDIGIT) IS LESS THAN OR EQUAL TO THE 
LARGEST REPRESENTABLE INTEGER FOR THE COMPUTER 
BEING USED. 

KPRIME 1S THE LINEAR ARRAY OF NOPRIM MODULI. THE 
MODULI MUST BE PRIMES, CHOSED AS LARGE AS 
POSSIBLE AND SO THAT KPRIME(I) * KPRIME(J) DOES 
NOT OVERFLOW AN ENTEGER WORD, FOR ALL I AND Je 

NOPRIM ES A DIMENSION PARAMETER WHICH DENOTES THE 
NUMBER OF PRIMES (MODULI) STORED IN KPRIME. 

NO2 IS A DIMENSEON PARAMETER WHICH IS 2*NOPRIM. 
X 1S THE N BY M FLOATING-POINT MATRIX WITH THE 
M SOLUTION VECTORS AS COLUMNS. IT IS THE 
ROUNDED QUOTIENT OF THE RATIONAL COMPONENTS 
CF X. 
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COLLECTED ALGORITHMS (cont.) 


DET IS THE FLOATING POINT DETERMINANT DF A. 
TER IS AN ERKOR COLE WHICH IS 
O IF THE SYSTEM [S SOLVED SATISFACTORILY ,s 
1 UF THERE ARE NOT ENOUGH MODULI AVAILABLE 
TQ SOLVE THE SYSTEM, 
2 TF THE COFFFICIENT MATRIX IS SINGULAR 
MOLULO EACH OF THE NCPRIM MOOULT (IN WHICH 
CASE X AND DET ARE NOT COMPUTED)» 
3 IF GNE OR MORE OF THE ENPUT INTEGER 
ARGUMENTS IS INCORRECT (LE. NoMyINe IMs 
IMPINe IMINL»NO2). 
MULTL IS THE MATRIX {N WHICH THE MULTILENGTH OIGITS 
CF Y€IeJ) AND MET A ARE STORED. THE ELEMENTS 
CF Y ARE STORED BY COLUMNS IN THE FIRST M * N 
ROWS OF MULTL» AND DET A IS STORED IN THE 
(M * N + LITH ROW. LOW ORDER CIGITS ARE IN 
COLUMN OVE OF MULTL» AND HIGHEST ORDER DIGITS 
ARE EN COLUMN LCOUNT. [T SHOULD BE DIMENSTONED 
EMINL BY NOPRIM. 
LCOUNT IS THE CULUM¥ NUMBER EN MULTL WHICH CONTAINS 
THE HIGHEST ORDER MULTILENGTH CIGITS. 
ATEMP IS THE IN BY IEMPIN MATRIX OF TYPE INTEGER USED 
BY EXACT TO HOLO THE AUGMENTED MATRIX (AyB) 
(IN RESTOUE FORM. 

MM TS THE LINEAR ARRAY USED BY SOLVE TO HOLD THE 
MODULI WHICH WERE USED TO SOLVE THE SYSTEM OF 
EQUATIONS. IT SHUULD BE OIMENSTONED THE SAME 
AS KPRIME. 

RY IS THE LINEAR ARRAY USED BY EXACT TO STORE THE 
FLOATING-POENT ELEMENTS OF Y AND THE FLOATING- 
POINT DETERMINANT OF Aw [TS DIMENSION SHOULD 
BE IMINL. 

W IS THE LINEAR ARRAY OF TYPE INTEGER USED RY 
MLTLTH TO HOLD A MULTILENGTH NUMBER WHILE 
PERFORMING MULTILENGTH ARITHMETIC OPERATIONS 
CN IT. IT SHOULD GE DIMENSIONED NO2. 

VIS THE LENEAR ARRAY OF TYPE INTEGER USED BY 
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CHECK ENPUT PARAMETERS FOR CONSISTENCY 
IF(N .LE. L -ORe N oGTe IN) GO TO 80 
IF(M oLE. 0 OR» M oGT. IM) GO TO 80 
IFCIMPIN NEw IM+IN) GO TO 80 
TFCIMIN] NE. IM*INe1L) GO TO 80 
IF(NO2 .NE. 2*NUPRIM) GO TO 80 
NORE S=M4N41 
IB=LO*®*NNIGIT 
SUMLOG=C. 
NZ IS THE NUMBER OF PRIMES FOR WHICH 
THE RESIDUE SYSTEM IS SINGULAR 
NZ=0 
IF IQUIT IS NOT EQUAL TO O, THEN A IS SINGULAR 
MODULO EACH OF THE STORED PRIMES 
TQUIT=0 
IS WILL COUNT THE NUMBER OF PRIMES 
USEO SUCCESSFULLY 
1S#2 
ICOUNT WILL COUNT THE NUMBER OF PRIMES TRIED 
{COUNT=1 
COMPUTE A LCWER BOUND ON THE NUMBER 
OF REQUERED MODULI 
CALL LOGBND{AeNe INe BoM, EMe BOUND) 
COMPUTE RESTOUE OF A AND B 
AND STORE BCTH IN ATEMP 
10 PP=KPRIMECICOUNT) 
P=PP 
DO 20 I=1,N 
00 20 J=1lyN 
20 ATEMP (I, J)=MODL ACI» J) e PP) 
00 30 T=1leN 
00 30 J=1l,™ 
JJ=NeJ 
30 ATEMP UL s JJD=MOD(BCUE es SD e PP) 
IFLAG=0 
SOLVE THE RESIDUE SYSTEM AX=8 (MOD PP) 
FOR YSA(ADJ)*B (MOD PP) AND DET (MOD PP) 
AND STORE RESULTS IN MULTL 
CALL SOLVECATEMP,MULTL oe Ng INeMMeMeIMPIN, IMINI »NOPRIM) 
TF IQUIT IES NOT EQUAL TO Oy THEN THE SYSTEM IS 
SINGULAR MOCULO EACH OF THE STORED PRIMES, 
ANO HENCE» CANNOT Bt SOLVED BY THIS PROGRAM. 
RETURN AN ERROR COOt OF 2. 
TFC LQUIT .EC. 0) GO TO 40 
TER=2 
RETURN 
{F IFLAG 1S NOT EQUAL TO O, THEN A IS SINGULAR 
MODULO KPREIMECECOUNT). CHOOSE ANOTHER PRIME» 
LeE. KPRIMECICOUNT4¢1)»5 AND TRY TO SOLVE 
THE SYSTEM AGAIN. 
40 LFC(TFLAG .«NE~ 0) GO TO 50 
SUMLOG=SUMLOG+ALOG(P) 
“TEST TO SEE IF THE REQUIRED NUMBER 
OF PRIMES HAVE BEEN USED 
IF(SUMLCG .GE. BOUND) GO TO 60 
ISsiSel 
50 ICOUNT=ICOUNT#1 
IF ALL PRIMES HAVE BEEN TRIED 
AND STILL ANOTHER IS REQUIREC? 
COMBINE RESULTS ANC CHECK SOLUTION 
IFC ECOUNT .LE.’ NOPRIM) GO TO 10 
[S=[S-1 
COMBINE RESULTS BY CONVERSION 
TO SYMMETRIC MIXED~RADIX 
60 CALL MXRADX(MULTL eMMeRY pLCOUNTSNDIGITs IMINL »NOPRIM, 
INO2 Ww) 
C CHECK SOLUTION BY COMPUTING AY AND DB 
CALL CHECK(A Ne INeBeMeI Me TERsMULTL» IMIN] sNOPRIM,y 
INO2sWeV) 
TFUTER 2EQe 1) RETURN 
C COMPUTE THE SOLUTICN X = (1L/DET)*¥Y 
DET=RY(AORES) 
INDE X=0 
00 70 J=LeM 
DO 70 I=lgN 
INDEX=INDEX+41 
70 XC Ps JI=RVCINDEX) /DET 
RETURN 
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CHECK FOR COMPARING THE VALUES OF TWO MULTILENGTH 
NUMBERS. IT SHOULD BE OIMENSTONED THE SAME AS We 


406-P 4— 


C RETURN ERROR COME OF 3 FOR EWCONSISTENT 
C INPUT PARAMETERS 
80 TER=3 
RETURN 
END 
SUBROUTINE SOLVE CATEMPsMULTL »No INe MMe, I MPIN, IMINI 
LNOPRIM) 


NIMENSTCN MMINCPRIM) eMULTLOEMINI yNOPRIM) pATEMPUINs IMP EIN) 


INTEGER ATEMP,PPeREST QU 
COMMON/ PLEN/ TB, PPeNZ eo IS se LFLAG, EQUITeNORES 
C THIS SUBROUTINE SOLVES THE RESTOUE SYSTEM 
C AX=B {MOD PP) FOR Y (MOC PP) AND DET (MOD PP}. 
IDET=1 
C FIND A PIVOTAL ELEMENT RELATIVELY PRIME TO PP 
MPN=M4N 
00 4O J=leN 

DO 10 L[=JyN 
TFCMODCATEMP( Ey J) ePP) .NE. OF GC TO 29 
IF(I .EQ. Nd GO TO 100 

10 CONTENUE 
C PERMUTE ROWS IT AND J 
20 IF(ET e£Qe J) GN TC 40 

IDET=-IDET 

DO 30 JJ=JSeMPN 
ITEMP=ATEMP (JyJJ) 

ATEMP (Se JJ) =ATEMP( Io JJ? 
30 ATEMP(1, JS) =1 TEMP 
C ACCUMULATE CETERMINANT 
40 IDET=IDET*ATEMP( Jy J) 

IOET=K"ODLIDET.PP) 

C FIND ENVERSE OF PIVOTAL ELEMENT 

IXSINVERSCATEMP (Je J) 9 PP) 

CG MULTIPLY ROhW J BY INVERSE OF PIVOTAL ELEMENT 

DO 50 JJ=JeMPN 
ITEMP=ATEMP (Je JSD IX 

50 ATEMP (Je SJ) =MODCI TEMP, PP) 
C REPLACE LTH ROW BY LTH ROW-JTH ROWe (L NOT EQUAL J) 

DG 70 L=leN 
{F(t .-EG. J) GO TO 70 
IK=ATEMP(L, J) 

DO 60 JJ=Jd_y MPN 
ITEMP=ATEMP(S, JJ) #IEK 
TTEMP=MODCITEMP,PP) 
TTEMP=ATEMP(L pe JJ)-ITT EMP 
ATEMP(LsJJ)=MOD(T TEMP, PP) 

60 CONTINUE 
70 CONTINUE 
80 CONTIAUE 
C STORE SYMMETRIC RESIDUE DIGETS IN MULTLe 
C AND MODULUS IN MM 
NI=Nel1 
INDEX=0 
DO 90 J=N1LeMPN 

DO 90 IT=1sN 
INDEX=INDEX+#1 
ITEMP=ATEMP( EJ) * IDET 

90 MULTLCINDEXs 1S)=RESIDUCIE TEMP »PP) 
MULTLEINCRES»sIS)=RESLOUCIDET,PP) 
MM(TS)=PP 
RETURN 
100 NZ=NZ+#1 
IFLAG=1 
C TEST TO SEE IF ALL PRIMES HAVE FAILED 
IFC(NZ .GT. NOPRIM-1) I[QUIT=1 
RETURN 
END 
SUBROUTINE MXRAOX(MULTL »MMyRY gL COUNT »NDIGIT Ss IMEIN], 
LNGPRIMsAC2eW) 
DIMENSICN MM(NOPREIM) eMULTLOIMINL sNOPREIM) eR YC IMINL) » 
1w(NQ02) 
INTEGER RESIDU,WePP 
OOUBLE PRECISION ACCeACC1LeACC2, TEX 
COMMON/PLEN/IByPPaNZe tS, LFLAG,IQUIT,NORES 
SUBROUTINE *XRADX COMPUTES THE SYMMETRIC 
MIXED-RADIX CIGITS CF Y AND CET A FROM 
THEIR SYMMETRIC RESINUE DIGITS. 
IF(IS .£Q. 1) GE TO 150 | 
COMPUTE SYMMETRIC MIXED-RADIX ,DIGITS 
AND STORE TKEM IN MULTL 
00 10 152,1S 
KK=I-1 
DO 10 J=IeIS 

IX=ENVERS(MMEKK) » MM (0D) 

DO LO K=leNURES 
LTEMP=MULTLOIK,JI-MULTL (Ke I-1) 
ITEMP=ITEMP#*IX 

10 MULTL (Ky JI=RESTOUCITEMP gMM(J)) 


an Can 


C COMPUTE Y AND D FROM 
C THEIR SYMMETRIC MIXED-RADIX CIGITS 
C USING MULTILENGTH ARETHMETIC 
LCOUNT=C 
20 00 140 T=lLeNORES 
WELIEL 
00 30 K=2,IS 
30 W(K)=0 
C COMPUTE YCL)=Ceee(MULTL (Ts 1S )*#MMCTS—1L)+ 
C MULTLO Tp TS~1)) MACE S—-2) 4000 +MULTL(192)) 
C MM(LI+MULTLOT SL) 
WOL)=hO1)*MULTLOT ep ES) *®MMCTS—-1) 
CALL MLELTHING2 WwW) 
J=IS 
40 JeJ-1 
TFE(J st» 1) GO TO 60 
WOLI=HKOLI#MULTLOT 9d) 
CALL FLTLTHING2,W) 
DO 50 K=1L,IS 
50 W(K)=W(K)D*MM(J-2) 
CALL PLITLTH(NG2—W) 
GO TO 490 
60 WCLISKHOLI+MULTILGUT 2 J) 
CALL MLTLTH(NO2»W) 
G STORE MULTILENGTH OLGETS OF YIT) 
C IN MULTL (Te J) eJ=1,1S 
C STORE MULTILENGTH DIGITS OF TET A 
€ IN MULTLONORESs J) e Jaleo IS 
DO 70 J=lelIS 
70 MULFIL (Le Jdawtd) 
C COMPUTE YC)? IN FLOATING-PT. FROM MULTILENSTH WIGITS 


K=IS 


COLLECTED ALGORITHMS (cont.) 


C 
c 


C 


oOerar 


80 IF(weK) .NE. 0) G2 TO 90 
[F«K .FQ. 1) GO Tad 100 
K=K-1 
GO TO 40 
90 IF(K eLee 1) GO TC L100 
ACCEW(K)*13¢Ww(K-1) 
TEX=NCIGIT*®(K-2) 
60 TO LLO 
100 RVC ETE) swt) 
GO TO 130 
110 tFCK eLE. 2% GG TO 120 
ACCL=hw(K-2) 
ACC2=10.N0¥**#NOIGIT 
ACCH=ACCH+ACCL/ACC2 
129 ACC1=1.0D¢1¥** TEX 
RY( EL) =ACC#ACCL 
130 TF(K oLE. tCOUNT) GO Tu 140 
LCOUNT=K 
140 CONTINUE 
RETURN 
150 DO 160 T=lsNORES 
160 RYCEPSMULTLOI,1L) 
RETURN 
END 
SUBROUTINE MLTLTHINC2 sh) 
DIMENSION WEtNnd2) 
INTEGER W,PP 
CCMMON/MLEN/ I By PPeNlev [Se TFLAGZLQUITyNORES 
TEUIS 269-6 1) RETURN 
L=1S-1l 
CISTRIBUTE THE DIGITS IN W SO THAT 
EACi} ELEMENT OF W CONTAINS NCIGIT DIGITS 
DO 10 K=lel 
WOK #LI=RWIK IST ew (Kel) 
Lo WOK Y=-W(KI/TB*IBtW IK) 
K=I1S 
ALL THE ELEMENTS OF W SHOULO HAVE THE SAME SIGN. 
20 TFIWEK) 160530040 
30 TF(K wEC. 1) RETURN 
K=K-1 
GO Fa 2c 
40 DO 50 Kelel 
(F(w(K) .GE. 0) GO TO 50 
WCKD=hWUK)418 
W(K+L) =WOKeL)-1 
50 CONTINUE 
RETURN 
60 00 70 K=lel 
TFtwtK) «LE. 0) GO TO 70 
W(K)sh (K-18 
W(KtLI=WIKeld el 
70 CONTINUE 
RETURN 
ENO 
SUBROUTINE CHECK(AsN,INsBeMes EMP EER eMULTLy IMINI, 
LINOPRIM,NO2,W,V) 
DIMENSICN VONO2) ¢eMULTLETMINT eNOPRIM) SACING IND 
LBCINS EM) SWEINO2) 
INTEGER WeVeAyb eg PP 
COMMON/NLEN/IBy bP eNZe 1S es TFLAG,IQUITsNORES 
SUBROUTINE CHECK CHECKS THE SOLUTION BY COMPUTING 
A®y AND (DET A)*8 AND CUMPARING THE RESULTS. 
Y 1S STORED PY COLUMNS EN FPULTL 
DET IS STORED IN MULTLINORES Td eTHlstS 
LL=IS 
KK=1Se1 
90 90 T=1—N 
INDEX=0 
00 90 L=lyM 
MULTIPLY ROW ft OF A RY COLUMN L OF Y 
00 10 K=LeN02 
10 WK )=0 
[S=kK 
DO 40 J=leN 
EACEX=INDEXeL 
JJzaCt JVB 
TIl=-JJ*®IB+ACT sd) 
TFCLEL .FQ. 1) GO TC 30 
DC 290 K=2,LL 


WIK) SWOKD#MUL TL CINDEX KD ® EE tMULTLO INDEX gK~1I* Jd 


CALL MLILTH(NO2 ow) 

20 CONT UNUe 

30 WCLI=LPL*MULTLOINCEXs Ld ewe) 
WOKK)=JJ*#MULTLOINDEX LL) +W(KK) 
CALL MLTLUH(NG2sH) 
TECLABS(WKES)) .LT. 18) GO TO 40 
TFCLS .GE. NO2) GO TO 40 
TS=(S¢#) 
WOES)=WCTS-1L)/18 
WCES-1L)=-WOESD*®IB¢WEES-1) 

40 CONTINUE 


C STORE THE PROOUCT IN V 


OO 5 K=lelS 
50 VK )=W(K) 


C MULFIPLY BeIst) BY CET AND STURE IN W 


JIFHCTeLIZIC 
{L=-JJ*1B+B UI ,l) 

TFC(LL .EQ. 1) GG TO 70 
DO 60 K=2ekL 


60 W(KY=MULTLINORES SK) FLL +MULTLINORES sK-LI* IS 


70 WCL)=ET*MULTLONORES SL) 
WOKK DZ JJ*MUL ILE NORES LL DP +W (KK) 
CALL MLILTHINO2 sw) 


C TEST EQUALITY OF W AND V 


OO #O J=tsTS 
TFCWOS) ewE. Visi) GO TO 100 
40 CCNTINUE 
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90 CONTINUE 
[F SOLUTION CHECKS, RETURN [ERsC0 
ELSe, RETURN TER=L 
TER=0 
TS=LL 
RETURN 
100 TER=1 
RETURN 
—NO 
FUNCTION [TNVERS (Ke) 
LC INVERS COMPLTES AN INVERSE OF K (MOD M) 
C HY THE EUCLICEAN ALGCRITHN 
t=K 
L=M 
J=1 
INVERS=C 
10 KK = T/L 
NN=MOOC ToL) 
IFINN «EQ. 9) GL TO 20 
fat 
L=NN 
NN=-KK*INVERS+J 
J=LNVERS 
INVERS=NAN 
GO TO 1C 
20 IF(L .GE. 9) GC TO 30 
INVERS=-INVERS 
C RETURN A POSITEVE VALUE. 
30 IFCINVERS «GE. GO) RETURN 
INVERS=M4ENVERS 
RETURN 
END 
INTEGER FUNCTICN RES{DU(K,)M) 
REST DU=YGDIK eM} 
C THE FUNCTION RESTDU COMPUTES THE SYMMETRIC 
C RESIOUE OF K ¢(MCD M) 
C TeEe. -M/2 LESS THAN RESEDU LESS THAN M/2 
IF(RESTCUIL0,205 30 
10 TF(2*RESINU+tM 2GE. 0) RETURN 
RESIDU=RESTDU+M 
20 RETURN 
30 TF(-2*RESTIDU+M .GE. 0) RETURN 
RESIDU=RESTDU-M 
RETURN 
ENO 
SUBROUTINE LOGBND(AsNe IN ByMy [Me BOUND) 
DIMENSICN ACTING IND, RE TN, IM) 
INTEGER AgB 
CC BOUND IS A LOWER BOUND FOR THE 
tC LOG OF THE PRODUCT UF THE MOCULI 
BOUND=0. 
DO 20 [=1,N 
ALPHA=0. 
00 10 J=leN 
TEMP=A(1,J) 
TEMP=TEMPeTEMP 
10 ALPE A=ALPHA+TEMP 
20 BOUND =SQUND+ALOG( ALPHA) 
BOUND=BCUND/2. 
CO 30 J=teM 
DO 30 I=1,N 
ALPFA=LABS(B(C1,J)) 
[F(ALPHA «EQ. 0.) GO TO 30 
BOUNT=ZROUNT FALOS (ALPHA) 
30 CONTINUE 
40 ROUND=SCUNDS4 4L0G(2.) 
RETURN 
END 


om a 


Remark on Algorithm 406 [(F4| 

Exact Solution of Linear Equations Using Residue 
Arithmetic [Jo Ann Howell, Comm. ACM ]4 
(Mar. 1971), 180-184] 


Jo Ann Howell [Rec’d 6/10/71 | 
Department of Computer Science, Yale University, 
New Haven, CT 06520 | 


The following statement should be added to subroutine 
MXRADX before the last RETURN statement (after statement 
160): 


LCOUNT = 1 
Without this statement, LCOUNT is undefined whenever IS = 1. 


COLLECTED ALGORITHMS FROM CACM 


Key Words and Phrases: differential equations, stiff differential 
equations , 
CR Categories: 5.17 


Description 


This subroutine integrates a set of up to N ordinary differential 
equations one step of length H, where H may be specified by the 
user, but is controlled by the subroutine to control the estimated 
error within a specified tolerance, if possible. 

A multistep predictor corrector method is used whose order is 
automatically chosen by the subroutine as the integration proceeds. 
Either:an Adams’ method or methods suitable for stiff equations 
can be selected. The starting procedure is automatic and the in- 
formation retained by the program about previous steps is stored in 
such a way as to make the interpolation to a nonmesh point 
straightforward. (See the description of the parameter Y in the 
subroutine.) The methods used are described from a mathematical 
point of view in the papers referenced in [1]. 

The programs may call on up to three subroutines. They are 


DIFFUN(T, Y, DY) 
PEDERV(T, Y, PW, M) 
MATINV (PW, N, M, J) 


The first, DIFFUN, must be provided always, and it must 
evaluate the derivatives of the dependent variables Y with respect to 
the independent variable T and place the results in DY. Y-is di- 
mensioned 8 by N, and the function values are in Y(1, J) for J = 
1 to N. 

MATINV must be provided if stiff methods are requested. It 
should invert the matrix PW which is of size N by N. The (J, J) 
element of PW is stored in position PW(I+M*(J—1)), that 1s, 
PW is dimensioned as an M by M array. (The value of M used by 
DIFSUB is equal to the value of N used on the first call to DIFSUB 
when the user supplied parameter JSTART is 0.) If stiff methods 
are not used, MATINV is never called, so it is sufficient to provide a 
dummy subroutine to satisfy the loader if Adams’ methods are used. 
The parameter J in MATINV should be set to a + 1 on return if 
the inversion is successful, —1 if the matrix is thought to be nearly 
singular. 

If large systems of stiff equations are to be integrated, the in- 
version should be done in two stages. The call to MATINV after 
statement 300 should be replaced by a call of an LU factorization 
program; e.g. subroutine DECOMP (2, p. 68]. 
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Algorithm 407 


DIFSUB for Solution of 
Ordinary 
Differential Equations [D2] 


C.W. Gear [Recd. 29 Dec. 1969 and 10 April 1970] 
Department of Computer Science, University of Illinois, 
Urbana, IL 61801 


The set of statements 
DO400 I=1,N 


400 SAVE(9, 1) = D 


should be replaced by a call to the second stage of a Gaussian 
elimination program; e.g. subroutine SOLVE [2, p. 69]. The net 
result must be to solve the N by N linear system PW*X = Y where 
the array Y is in SAVE (/, 1), J = NS + 1 to NS + N and the 
unknown array X is to be returned to SAVE (9, 1), 1 = 1toN. 

Tests have indicated that MATINV is called about ten times 
less frequently than the code represented by the above DO loop is 
executed. The cost of the change would be the overhead of the call 
to SOLVE which is independent of N; the saving due to the change 
would be about 5N*/6 multiplications and overhead operations 
each time that DECOMP is called instead of MATINV. The break 
point will depend on the computer and compiler used, but the 
change will lead to time saving on most computers when N exceeds 
about 5. 

PERDERYV is another optional subroutine called only if the 
method flag MF is set to 1. (See the description of the parameters.) 
If it is not used it can be replaced by a dummy subroutine to 
satisfy the loader. When used, it should compute the partial deriva- 
tives of the differential equations with respect to the dependent 
variables. The partial of the /th equation with respect to the Jth 
variable should be stored in PW(I+M*(J—1)). For example, if 
the two equations 


yi =\Njyol 
yo = —y2 + 6y; 


were being solved by method type 1, PEDERV should compute 
as follows: 


PW) = Y(1, 2)*7*+#2 
PW(2) = 6.0 

PW(1+M) = Y(1,1)*T*+2 
PW(2+M) = —2.0«Y(1,2) 


If the first value of N used in a call to DIFSUB was 2, then the left 
hand sides of the last two assignment statements could better be 
written PW(3) and PW(4) for speed. 

The DOUBLE PRECISION statement may be removed if a 
single precision version is required. If it is left in, all variables begin- 
ning with the letters A to H and Q to Z are double-precision 
floating-point, those beginning with P are single-precision floating- 
point. (In particular the matrix PW is computed and inverted in 


COLLECTED ALGORITHMS (cont.) 


Table I 


enema PS SL ES 


METHOD TYPE 0 
ERROR RQ PRESENT ERROR MAXIMUM ERRPR NO. STEPS FN EVALS MAT LNVS AVERAGE STEP 
0.190D-03 0.2806D-06 0.2806D-06 42 120 0 0.87510-04 
0.100D-03 0 .33610-06 0.3361D-06 211 623 0 C.1607N=03 
O.1CO0-03 INTEGRATION ABANDONED WHEN NO. OF FN EVALUATIONS EXCEEDED 5000 AT T = 
C.100D-04 0.10330-07 0.1033D-07 50 143 0 0.7043D-04 
C.1000-0% 0.51750-06 0.51 750-06 218 645 0 0.15540-03 
0.1000-04 INTEGRATION ABANDONED WHEN NO. OF FN EVALUATICNS EXCEEDED 5000 AT T = 
0.100D-05 0.1863D-07 0.18630-07 60 1738 0 0.57240-04 
041 00D-C5 04452 50-06 0.45250-06 182 548 0 0.1833D-03 
Q.1000-05 INTEGRATION ABANDONED WHEN NO. OF FN EVALUATIGNS EXCEEDED 5000 AT T = 
«1000-06 0.60610-08 0.60610-08 7 228 0 0244500-04 
0.1000-C4 0-46080-07 0.4608L-07 203 5 74, 0 C.17470-03 
0» 100D-06 0.44190-07 0.4608D-07 1576 45 €8 0 9.21800-93 
0.1000-06 INTEGRATION ASANDONED WHEN NO. OF FN EVALUATICNS EXCEEDED 5000 AT T = 
0.1 000-07 0.1550D-08 0.1550D-08 85 269 0 O.37720-04 
0.109D-CV 0.5784D-08 0 .5784D-08 223 649 0 0.15420-03 
0.1000-C7 0.31620-07 0.31620-07 1396 3997 0 C.25020-93 
Owl000-CT INTEGRATION ABANDONED WHEN NO. OF FN EVALLATICNS EXCEEDED 5000 AT T= 
0.1000-C¥ 0.23789-09 0.2378D-09 106 323 0 ©.31200-04 
0.100D-¢8 C.3412D-09 0.34120-09 249 710 0 0.1415D-03 
0.100D-08 0.66800-10 0 .34120-09 142} 3523 ) 0.25520-03 
Q0.100D-C8 INTEGRATION ABANDONED WHEN NO. OF FN EVALUATICNS EXCEEDED 5000 AT T = 
04 100-09 0.3961D-10 0.39610-10 130 392 0 0.25760=0% 
0.1000-09 0.5042D0-10 0-5042D-10 284 B41 0 0.11990-63 
0.100D-09 0.2036D-09 0.20360-09 1420 4028 0 0.24830-03 
0.1000-09 INTEGRATION ABANDONED WHEN NU. OF FN EVALUATIONS EXCEEDED 5000 AT T = 
Table II 
METHOD TYPE 1 
ERROR RQ PRESENT ERROR MAXIMUM ERRPR NO. STEPS FN EVALS MAT INVS AVERAGE STEP 
0.1000-03 0235330704 0.3533D-04 39 &7 6 0.12370-03 
0.100U-03 0.75520-04 0.75520-04 66 155 10 0-65480-03 
0.1000-03 0.79560-04 0.79560-04 98 231 13 C.44010-02 
0.100D=¢3 0.9021D-04 0.90210-04 126 30 16 0.35C0C-O1 
0.1COD=03 0.11130-03 0.11130-03 146 346 20 0.3138D 90 
0.1000-¢3 0234470-04 0.11130-03 157 380 24 C.27860 OL 
0.1000-04 0.2202D-06 0.22020-06 53 121 6 0.8564N=04 
0.1C0D-04 0.13690-04 0.1369D-04 103 243 12 0.41560-03 
0.1000-04 0.2649D-05 0.1369D-04 155 366 17 0.2748D-02 
0.1000-04 0.20960-04 0.20960-04 192 453 21 0.22750-O1 
0.L00D—C4 0.1209D-0% 0.2096D-04 220 516 24 0.19570 00 
0.L000-04 0-1671D-06 0. 20960-04 242 5 6% 27 C.1995D OL 
0.1000-05 €.91000-07 0.9100D-07 79 179 7 0.57230-94 
0.1C0D-05, 0.26670-05 0. 26670-05 110 262 12 0.40030-03 
0.1000-95 0.22080-05 0 «26670-05 163 45 15 0.24990-02 
0.1009-05 0.28700-05 0.287CD-05 216 523 20 C.19140-01 
0.100D-05 0.29840-05 0.2984D-05 252 616 25 0+1664D 00 
0.1C0D-¢5 0.11990-05 0.29840-05 283 693 29 0.14800 O1 
0.1000-06 0.3261D-07 0.32610-07 BS 192 6 0.52450-04 
0.1000-C6 0.3073D-07 0.3261D-07 145 341 12 0.3049D-03 
0.1000-06 0.7833D-07 0.7833D-07 219 530 7 ©.1921D-02 
0.1000-06 0.5889D-07 0.78330-07 281 672 22 0.15390-01 
0.1 COL~C6 +1096D-07 0.78330-07 333 752 oF C.1297D 00 
0.1 000-66 0.25880-06 0.25880-06 393 930 32 0.10979 Ol 
0.1000-C7 0.4569D-08 0 .4569D-08 113 254 6 0.39950-04 
0.1C00-C7 0.546 80-08 0 .5468D-08 171 469 11 0.24450-03 
0.100U-07 0.14400-97 0.14400-07 269 654 17 0.1553D-02 
0.1000-C7 0.13810-07 0.1440D-07 359 837 23 0.12110-O1 
0.1000-07 0.13940-08 0.1440D-07 413 978 29 0.10300 00 
0.1G0D-07 0.75890-09 0.144C0-07 473 1114 34 0.90059 00 
0.1000-08 0 .64860-09 0.6486D-09 148 327 6 0.31150-04 
0.1000-08 0.16180-08 0.1618D-08 218 529 10 0.19400-93 
0.100D~¢8 0.51320-08 0.51320-08 345 849 13 0.1193D-02 
0.1000-08 C2532 50-08 0 .5325D-08 447 L1Cl 19 C.9L670-02 
0.100D-08 0.371 60-08 0.53250-08 528 1297 24 O.77710-01 
0.100D-08 0.22000-08 0 .53250-08 593 1447 29 0.70080 00 
0.1C00~C9 0.7502D-10 0.7502D~10 184 422 6 942394004 
0.1009-09 0.297 70-09 0.29770-09 273 666 11 0.15200-03 
0.1000-09 0.54160-09 0.5416D-09 439 1094 12 0.91560-03 
0.1000-09 0.9022D-09 0 .90220-09 575 1417 19 C.71000-02 
0.1000-09 0.63310-09 0.9022D0-09 681 1657 25 0.61L180-01 
0.1C00-09 0.351 50-09 0.90220-09 765 1856 32 0.5477) 00 
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CURRENT TIME 
0.0105012848 
N.LANL121513 
028512205128 
O.NLONTI“ASTL 
0.1992143720 
0. 8434239899 
O.0L01889615 
0.100453 L167 
0.9391NROL21 
OeNLOL4SS 1645 
021002597074 
LINALLS9B74 
Le96AGLT8 26} 
Ne ALO1462923 
21001053613 
1.0900859952 
Le LO TRH42 168 
0.901L00780205 
061904798094 
L.ONL1817994 
162618059881 
0-910N975698 
021998770225 
1.0009863103 
Le 27318563952 


CURRENT TIME 
0-.N010N7625419 
921014987087 
1.9166403388 
10.6057160621 
109. 5682945165 
1058.6211483326 
0690403624070 
9.1099973518 
169956549946 
19. 307336483) 
LOL232505658414 
1425642 7T07COT6 
DeD024367CL 

— 0610488469068 
~ 1460122667324 
1N.O0L LC 78S 8S7 
102.4771283917 
1025. 7769259724 
O.QOLONTL29C2 
0.1039585812 
L.O0178773065 
10. 3413882438 
102473 7507 1042 
1020.6588599969 
NVeALOV4GS TIES 

92 190N02034N 
1.91 56394503 
1021342196229 
109. 7578631819 
100321153401613 
069101868353 
061026173559 
169126228439 
10 .N934159546 
LOO. T8IR655954 
161406N73431L5249 
029L01LN1830 
061912469906 
169016548361 
1029607268383 
191.3735674223 
10166 5477264277 


COLLECTED ALGORITHMS (cont.) 


single-precision to save space and time. Its accuracy only affects the 
rate of convergence of the method slightly.) All variables beginning 
with letters J to N are integers. 

Because this program computes its own indices in the temporary 
storage array SAVE provided by the user in the call sequence, use of 
an optimizing compiler will reduce execution time considerably. (A 
version in which several more arrays of temporary storage must be 
provided in the call sequence has been compared with this. It uses 
these arrays to avoid computing indices, and consequently runs 
about 10 percent faster than this version on an IBM 360/91 using 
Fortran H, OPT = 2. However it is not as convenient for the user.) 

Generally the problem should be scaled so that the square of 
any values of the solution that are to be considered nonzero when 
multiplied by the test constant EPS discussed below remain within 
the range of numbers representable in floating-point. 

The following test problem proposed by F. T. Krogh (private 
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Let B be the diagonal matrix 


6 O 0 nf 

B=10 B. OO ) 
1) 0 Bs a 
) 0 ) Ba 


The differential equation 
y’ = Uz — UBUy 


is integrated from t = 0 to t = 10,000 where 
a4 Wy —1 


| _ | w? _ | we] _ . . |! 
y= ys? z= we |? w= rl ha Uy with y(0) = = 
vs we? Wa = 


communication) was run. Let U be the unitary matrix given by 


Ek 1 
1 1 
oe —1 

1 1 -1 


The solution is 


D 

= 

~~ 

-_~ 

— 
| 


(1 + 61)e*?") 


y =U| B&/( — (1 + Brde*?") 


Dw 

C3) 

pa 

oN 

— 
| 


Ba/( 


(1 + 6s)e%3") [° 
(1 + Bue") 


STEPS FN EVALS MAT INVS 


Table III 
METHCD TYPE 
ERROR RQ PRESENT ERROR MAXIMUM ERRPR NO. 
0.100D-03 0.3533D-04 0.3533D-04 39 
0.1. 000-03 0.7552D~04 0 .75520-04 66 
0.100D-93 0.7956D-04 0.79560-04 98 
0.100D-03 0.90200-0% 0.9020D-04 126 
0.100D-03 0.11130-03 0.11130-03 146 
0.1000-03 0.7901D-04 0.11130-03 157 
0.1000~-04 0.2202D-06 9.2202D-06 53 
0.100D-04% 0+1369D-04 0.13690-04 103 
0.100D-04 0.26490-05 0.1369D-04 155 
0.1000-C4 0.20820-04 0.20820-04 192 
0.1000--04% 0.120 70-04 0.2082D-04 220 
0.100D-04 0.1695D+05 0 «2082D-04 242 
0.1000-05 0.91000-07 0.91000-07 70 
0.1000-05 0.2667D-05 0.26670-05 110 
0.100D~C5 0.2298D-05 0. 2667D-05 168 
0.100D~05 0.2870D-05 0.28700-05 216 
0.1 000-05 042984005 0.29840-05 252 
0.100D-05 0.1291D-05 0.29840-05 283 
0.1C0D~C6 0.3261D-07 0 .32610-07 86 
0.1000-C6 0.3073D-07 0.3261D-07 145 
0.100D--06 0.78330-07 0.78330-07 219 
0.1000-06 0.58880-07 0.7833D-07 281 
0.1000-C6 0.1096D-07 0.78330-07 333 
921 00D--C6 0.258 6D-06 0.25860-06 393 
0.100L-C7 0.4569D-08 0.45690-08 113 
0.190L--07 0.5468D-08 0.54 68D-08 W71 
0100-07 0.1440D-07 0+14400-07 269 
0.1000~07 0.13180-07 0.14400-07 350 
0.100D-07 0.32460-07 0.32460-07 415 
0.102D—C7 0.3758D-07 0.3758D-07 467 
0.100D-08 0.6486D-09 0 .6486D-09 148 
0.1000-08 0.161 80-08 0.16180-08 218 
0.1000-08 0.5131D-08 0.51310-08 345 
C.100U-08 0.532 4D-08 0. 53240-08 447 
0 «1000-08 0.371 60-08 0.5324D-08 52:8 
0.1000-08 0.2195D-08 0.53240-08 593 
0.1COD-09 0.75020-19 0.7502D-10 184 
0. 1000-09 0.297 70-09 0.29770-09 273 
0.1900-09 0.546 80-09 0.5468D-09 439 
0. 1000-09 0.8977D-09 0.8977D=09 575 
0.1 000-09 0.62830-09 0.89770-09 681 
0.100D~09 0.353 70-09 0.8977D-09 765 


AVERAGE STEP 


CURRENT TIME 


Lil 6 C.9696D-04 O.NLN7T625419 
195 10 0.52050~-03 N.19014987087 
293 13 0.35920-02 129166402801 
367° 16 0-28S00-O1 10.6059403654 
426 20 0.2548D 90 108. 56 3268%6976 
477 24 0.23910 Ol L149.4008N35531 
145 6 0.71460-04 0.0103624079 
291 12 0.34710-03 0.1009973518 
434 17 0.23170-02 129956550430 
537 22 9.1919D~O1 10.390732 82637 
614 24 0.16540 CO 101. 546439959266 
677 23 C.1665D OL {127.4919557 261 
207 7 0.49490-04 De M1 O2436701 
310 2 0.33830-03 0.1048 84649068 
465 15 O.21 770-02 120122667324 
693 20 C.16600-O1 LOJOLIAQ65 5640 
716 25 C.14310 09 102.4764579999 
8C9 29 0.1268D Ol 1025. 9894923429 
216 6 0.46630-04 O.N1LNO7L2902 
389 12 0.26720-03 O.LAZBS5ASAI2 
598 17 0.1702D-02 L.O1L787RN838 
760 22 O.1L361D-OL 10. 3414927392 
9CO 27 0.11420 00 102. 7387715393 
1057 32 0.96550 09 1029.5539661127 
278 6 0.3650D0~04 O.NLO1467 707 
453 LL 0.22C890-03 Ge 1050920357 
722 17 C.14070-02 LeOL5G3A1221 
927 23 0.1993D-O1 10.134%2171393 
10S2 30 0.91S00-O1 100.35459265C8 
1232 35 C.81280 00 LDN1L. 42408 75568 
351 6 0. 29025-94 N.O101868321 
569 10 6-18C3D-03 ON. LO261 74287 
9ol - 13 C.11240-02 1.912619 869% 
1177 L9 0.8575D-02 10. 9927NTHNKA 
1393 24 C.72350-01 100. 7810184380 
1563 29 0.64850 00 L013. 5805314175 
446 6 0. 22650-94 O.OLOLOLIAL2 
710 11 0.1426D-03 O.1L012478 756 
1140 12 0.87920D-93 169922354668 
1499 19 6.67450-02 10.1198946311 
1728 . 27 6.5688C-O01 LOL. 6984171224 
2005 33 C.5102D 00 1023.03778204%92 
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Tables I-III show the results for 8; = 1000, 6B. = 800, 8B; = —10, 
8, = .001. The columns show the requested error (EPS), the error 
at the time of printing in the least accurate component, the maxi- 
mum such error to date, the number of steps, number of calls to 
DIFFUN (i.e. function evaluations), number of calls to MATINV, 
average step size and the current value of 7. The initial step was set 
to 10-4 and printing occurred at the first step to pass 10° for i = —2, 
—1, 0, 1, 2, and 3. The three different methods were used (MF = 0, 
1, and 2), but the integration was stopped if the number of function 
evaluations. exceeded 5000, as it did with Adams’ methods for this 
stiff problem. The problem was run for EPS = 107‘ fori = 4,5, ---, 
10. (Warning; this problem is critically stable. If an error in excess of 
about 10-* occurs, the solution of the perturbed problem may have 
a pole.) It should be noted that the results will depend slightly on the 
precision of the machine and the characteristics of the library pro- 
gram used for MATINV. 
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Algorithm c KFLAG A COMPLETION CODE WITH THE FOLLOWING MEANINGS... 
c +1 THE STEP WAS SUCCESFUL. 
c -1 THE STEP WAS TAKEN WITH H = HMINy BUT THE 
c REQUESTED ERROR WAS NOT ACHIEVED, 
SUBROUTINE ODIFSUB(Ne Ty Vs SAVE yHeHMINsHMAX» EPS _ MF» YMAXs ERROR #KFLAG 9 C -2 THE MAXIMUM ORDER SPECIFIED WAS FOUND TO 
1 JSTART»MAXDER yPW) Cc BE TOO LARGE. 
DOUBLE PRECISION ArDeEsHeReT oY eR eRz + BNDy EPS eEUP yEDWNe ENOL c -3 CORRECTOR CONVERGENCE COULD NOT BE 
L 2 ENQ2,ENQ3eHMAX sHMINy HNEWs HOLD» SAVE+ TOLD» YMAXy ERRORs RACUM c ACHIEVED FOR H .GT. HMIN. 
c c -4 THE REQUESTED ERROR IS SMALLER T.AN CAN 
C THE PARAMETERS TO THE SUBROUTINE DIFSUB HAVE c BE HANDLED FOR THIS PROBLEM. 
C THE FOLLOWING MEANINGS... c JSTART AN INPUT ENDICATOR WITH THE FOLLOWING MEANINGS... 
Cc Cc -1 REPEAT THE LAST STEP WITH A NEW H 
c N THE NUMBER OF FIRST ORDER DIFFERENTIAL EQUATIONS. N c O PERFORM THE FIRST STEP. THE FIRST STEP 
c MAY BE DECREASED ON LATER CALLS IF THE NUMBER OF Cc MUST BE DONE WITH THIS VALUE OF JSTART 
c ACTIVE EQUATIONS REDUCES» BUT IT MUST NOT BE c SQ THAT THE SUBROUTINE CAN INITIALIZE 
c INCREASED WITHOUT CALLING WITH JSTART = 0. c ITSELF. 
c T THE INDEPENDENT VARIABLE. c +1 TAKE A NEW STEP CONTINUING FROM THE LAST. 
Cc Y AN 8 BY N ARRAY CONTAINING THE DEPENDENT VARIABLES AND c JSTART IS SET TO NQy THE CURRENT ORDER OF THE METHOD 
c THEIR SCALED DERIVATIVES. Y(J+tiel) CONTAINS c AT EXIT. NQ IS ALSO THE ORDER OF THE MAXIMUM 
Cc THE J-TH DERIVATIVE OF Y(I} SCALFD BY Cc DERIVATIVE AVAILABLE. 
c HeEI/FACTORIAL(J) WHERE H IS THE CURRENT c MAXDER THE MAXIMUM DERIVATIVE THAT SHOULD BE USED IN THE 
Cc STEP SIZE. ONLY Y(l+I) NEED BE PROVIDED BY c METHOD. SINCE THE ORDER 1S EQUAL TO THE HIGHEST 
Cc THE CALLING PROGRAM ON THE FIRST ENTRY.« c DERIVATIVE USED, THIS RESTRICTS THE ORDER. IT MUST 
Cc IF IT IS DESIRED TO INTERPOLATE TO NON MESH POINTS Cc BE LESS THAN 8 OR 7 FOR ADAMS OR STIFF METHODS 
Cc THESE VALUES CAN BE USEN. IF THE CURRENT STEP SIZE c RESPECTIVELY. 
Cc 1S H AND THE VALUE AT T + £ IS NEFDED, FORM c Pw A BLOCK OF AT LEAST N¥*2 FLOATING POINT LOCATIONS. 
c S = E/H, AND THEN COMPUTE Cc 
Cc NQ DIMENSION Y(89N)+YMAX(N) ySAVE( LON) ERROR IN) PWN)» 
c YCL)(T+E) = SUM Yi dth oT) *S#eJ 1 A(8) sPERTST(79293) 
c J=0 Cc 
c SAVE A BLOCK OF AT LEAST 12*N FLOATING POINT LOCATIONS C THE COEFFICIENTS IN PERTST ARE USED IN SELECTING THE STEP AND 
Cc USED BY THE SUBROUTINES. C ORDER, THEREFORE ONLY ABOUT ONE PERCENT ACCURACY IS NEEDED. 
c H THE STEP SIZE TO BE ATTEMPTED ON THE NEXT STEP. Cc 
c H MAY BE ADJUSTED UP OR DOWN BY THE PROGRAM DATA PERTST /260040597e333910e42e13eTabTeol5vleOy 
c IN ORDER TO ACHIEVE AN ECONOMICAL INTEGRATION. 1 200912 0092400937 089 95 3033970608 BT 29Te 
c HOWEVER, IF THE H PROVIDED BY THE USER DOES 1 3209600990 L674 1205915098 el Oe le Oe 
Cc NOT CAUSE A LARGER ERROR THAN REQUESTED, IT 1 120092400937 089 753033970008 98769 Ty1LeOy 
c WILL BE USED. TO SAVE COMPUTER TIMEs THE USER IS 1 Lorber 005200 1667700041332000082674140% 
c ADVISED TO USE A FAIRLY SMALL STEP FOR THE FIRST 1 1009150920091 00903157 9007407 ¥00139/ 
Cc CALL. IT WILL BE AUTOMATICALLY INCREASED LATER. DATA A(2) / -1.0/ 
Cc HMIN THE MINIMUM STEP SIZE THAT WILL BE USED FOR THE IRET = 1 
Cc INTEGRATION, NOTE THAT ON STARTING THIS MUST KFLAG = 1 
c MUCH SMALLER THAN THE AVERAGE H EXPECTED SINCE IF (JSTART.LE2O) GO TO 140 
Cc A FIRST ORDER METHOD IS USED INITIALLY. c 
c HMAX THE MAXIMUM SIZE TO WHICH THE STEP WILL RE INCREASED C BEGIN BY SAVING INFORMATION FOR POSSIBLE RESTARTS AND CHANGING 
€ EPS THE ERROR TEST CONSTANT. SINGLE STEP ERROR ESTIMATES C H BY THE FACTOR R IF THE CALLER HAS CHANGED He ALL VARIABLES 
Cc DIVIDED BY YMAX(1) MUST BE LESS THAN THIS C DEPENDENT ON H MUST ALSO BE CHANGED. 
Cc IN THE EUCLIDEAN NORM. THE STEP AND/OR ORDER IS C € IS A COMPARISON FOR ERRORS OF THE CURRENT ORDER NQe EUP IS 
Cc ADJUSTED TO ACHIEVE THIS. C TO TEST FOR INCREASING THE ORDER, EDWN FOR DECREASING THE ORDER, 
c MF THE METHOD ENDICATOR. THE FOLLOWING ARE ALLOWED... C HNEW IS THE STEP SIZE THAT WAS USED ON THF LAST CALLe 
c 0 AN ADAMS PREDICTOR CORRECTOR IS USED. c 
C l A MULTI=STEP METHOD SUITABLE FOR STIFF 100 bO 110 I =-19N 
c SYSTEMS 1S USED. IT WILL ALSO WORK FOR DO 110 J = 19K 
Cc NON STIFF SYSTEMS. HOWEVER THE. USER 110 SAVE(JSeT) = YOdeT) 
Cc MUST PROVIDE A SUBROUTINE PEDERV WHICH HOLD = HNEW 
c EVALUATES THE PARTIAL DERIVATIVES OF IF (HeEQ.HOLD) GO TO 130 
c THE DIFFERENTIAL EQUATIONS WITH RESPECT 120 RACUM = H/HOLD 
Cc TO THE Y'S. THIS IS DONE BY CALL IRETL = 1 
c PEDERV(T+YePWyM)« PW IS AN N BY N ARRAY GO TO 750 
c WHICH MUST BE SET TO THE PARTIAL OF 130 NOOLD = NQ 
c THE I-TH EQUATION WITH RESPECT TOLD = T 
c TO THE J DEPENDENT VARIABLE IN PW(I9J)6 RACUM = 1.0 
c PW IS ACTUALLY STORED IN AN M BY M IF (JSTART.GT.O) GO TO 250 
Cc ARRAY WHERE M IS THE VALUE OF N USED ON GO TO 170 
C THE FIRST CALL TO THIS PROGRAM. 140 IF (JSTART.EQ.-1) GO TO 160 
Cc 2 THE SAME AS CASE ly FXCEPT THAT THIS Cc 
c SUBROUTINE COMPUTES THE PARTIAL C ON THE FIRST CALLy THE ORDER IS SET TO 1 AND THE INITIAL 
c DERIVATIVES BY NUMERICAL DIFFERENC ING C DERIVATIVES ARE CALCULATED. 
Cc OF THE DERIVATIVES. HENCE PEDERV IS c 
Cc NOT CALLED. NQ = 1 
c YMAX AN ARRAY OF N LOCATIONS WHICH CONTAINS THE MAXIMUM N32 = N 
c OF EACH Y SEEN SO FAR. IT SHOULD NORMALLY BE SET TO Nl = N*¥10 
c 1 IN EACH COMPONENT BEFORE THE FIRST ENTRY. (SEE THE N2 = NE + 1 
c DESCRIPTION OF EPS.) N& = N®*2 
Cc ERROR AN ARRAY OF N ELEMENTS WHICH CONTAINS THE ESTIMATED N5 = Ni +N 
Cc ONE STEP ERROR IN EACH COMPONENT. N6 = N5 + 1 
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nares ent fh iS SS Sn ssa Sess AO 


CALL DIFFUN(T sY¥¢SAVE(N291)) 
00 150 I = 19N 


Nik = Nl + 1 
150 Y(2,1) = SAVE(N1L191)*H 
HNEW = H 
K = 2 
GO TO 100 


Cc 
C REPEAT LAST STEP BY RESTORING SAVED INFORMATION.’ 
c 

160 IF (NQ.EQ.NONLD) JSTART = 1 


T = TOLD 

NQ = NOOLD 
K = NQ + 1 
GO TO 120 


SET THE COEFFICIENTS THAT DETERMINE THE ORDER AND. THE METHOD 
TYPE. CHECK FOR EXCESSIVE ORDER. THE LAST TWO STATEMENTS OF 
THIS SECTION SET IWEVAL .GT.O IF PW IS TO BE RE-EVALUATED 
BECAUSE OF THE ORDER CHANGE, AND THEN REPEAT THE INTEGRATION 
STEP IF IT HAS NOT YET BEEN DONE (IRET = 1) OR SKIP TO A FINAL 
SCALING BEFORE EXIT IF IT HAS BEEN COMPLETED (IRET = 2)- 


aAanaanaaga 


170 IF (MF.EQ.0) GO TO 180 
IF (NQO.GT.6) GO TN 190 
GO TO (221 9222422342244225,4 226) »NO 
180 IF (NO.GT.7) GON TO 190 
GO TO (21192129213921472154 2169217) ¥NO 


190 KFLAG = -2 
RETURN 
C 
C THE FOLLOWING COEFFICIENTS SHOULD BE DEFINED TO THE MAXIMUM 
C ACCURACY PERMITTED BY THE MACHINE. THEY ARE IN THE ORDER USED... 
Cc 
Cc -l 
C -1/29-1/2 
C =-5/129-3/4,-1/6 
C  ~3B/Be- 11/12 9-1/39-1/24 
C  =-251/720 9~25/245-35/724-5/484-1/120 
C -95/2884-137/1209-5/89-17/9649-1/409-1/720 
C -19087/60480 »-49/40 4-203 /2709-49/19 2 4-7/1449-7/14409-1/5040 
¢ 
, | 
C -2/39-1/3 
C -6/119-6/11,-1/11 
C -12/259-7/109-1/59-1/50 
C  -120/274 9- 225/274 4-85 /274y-15/2749-1/274 
C =180/4419-58/639-15/369-25/2529-3/2525-1/1 764 
c 


211 Al) = ~1.0 
GO TO 230 

212 Al) = -0.500000000 
A(3) = -0.500000000 


GO TO 230 

213 ACL) = -0.4166666666666667 
4(3) = -0.750000000 
A(4) = -0.1666666666666667 
GO TO 230 

214 ACL) = ~0.375000000 
A(3) = ~0.9166666666666667 
A(4) = -0.3333333333333333 
A(5) = -0.04166666666666667 
GO TO 230 

215 ACL) = -0.3486111111111111 
A(3) = -1.0416666666666667 
A(4) = ~0.4861111111111111 
A(5) = -0.1041666666666667 
A(6) = -0.008333333333333333 
GO TO 230 

216 ACL) = —0.3296611111111111 
A(3) = —1.1416666666666667 
A(4) = -0.625000000 
A(5) = ~0.1770833333333333 
A(6) = -0.0250000000 
A(7) = -0,001388888888888889 


GO TO 230 


217 


221 


222 


223 


224 


225 


226 


230 


240 


A(1) = -0.3155919312169312 
A(3) = ~1-235000000 

A(4) = -0.7518518518518519 
A(5) = -0.2552083333333333 
A(6) = -0.04861111111111111 
A(7) = ~-0.004861111111111111 
A(8) = -0.00019841 26984126984 
GO TO 230 

Af{l) = -1.000000000 

GO TO 230 

All) = -0.6666666666666667 
A(3) = -0.3333333333333333 
GO TO 230 

AQ1l) =°- 0.5454545454545455 
A(3) = Atl) 

A(4) = -0.09090909090909091 
GO TO 230 

A(1) = -0.480000000 

A{(3) = -0.700000000 

A(4) = -0.200000000 

A(5) = -0.020000000 

GO TO 230 

ACL) = =0.437956204379562 
A(3) = -0.8211678832116788 
A(4) = -0.3102189781021898 
A(5) = ~0.05474452554744526 
A(6) = -0.0036496350364963504 
GO TO 230 

Atl) = -0.4081632653061225 
A(3) = -0-9206349206349206 
A(4) = -0.4166666666666667 
A(5) = -0.0992063492063492 
A(6) = -0.0119047619047619 
A(T) = ~0.000566893424036282 
K = NQ+1 

IDOUB = K 

MTYP = (4 — MF)/2 

ENQ2 = .5/FLOAT(NQ + 1) 

ENQ3 = .5/FLOAT(NQ + 2) 

ENOL = O65/FLODAT(NQ) 

PEPSH = EPS 

EUP = (PERTST(NQsMTYP 92) *PEPSH) **2 
E = (PERTSTINO »MTYP 91] )*PEPSH)**2 


EDWN =(PERTST(NO gMTYP,3 }*PEPSH) ¥¥*2 
IF (EDWN.EQ.0) GO TO 780 

BND = EPS*ENQ3/DFLOATI(N) 

IWEVAL = MF 

GO TO ( 250 » 680 )yIRET 


C THES SECTION COMPUTES THE PRENICTFN VALUFS BY EFFECTIVELY 
C MULTIPLYING THE SAVED INFORMATION BY THE PASCAL TRIANGLE 
C MATRIX. 


260 


AAMAAANNDAAND 


270 


aAaAaA 


T=T+H 
DO 260 JS = 29K 
DO 260 Jl = JoK 
J2 = - JL + J - 1 
DO 260 [I = 1lyN 
Y(J2e1) = YOS2e1) + YOI2+1971) 


UP TN 3 CORRECTOR ITERATIONS ARE TAKEN. CONVERGENCE IS TESTED 
BY REQUIRING CHANGES TO BE LESS THAN BND WHICH IS DEPENDENT ON 
THE ERROR TEST.CONSTANT. 

THE SUM OF THE CORRECTIONS IS ACCUMULATED IN THE ARRAY 
ERROR(1)- IT IS EQUAL TO THE K-TH DERIVATIVE OF Y MULTIPLIED 
BY H**K/ (FACTORIAL (K-1)¥*A(K))» AND ITS THEREFORE PROPORTIONAL 
TO THE ACTUAL ERRORS TO THE LOWEST POWER OF H PRESENT. (H¥*K) 


DO 270 I = IeN 
ERROR(I) = 0.0 
DO 430 L = 143 
CALL DIFFUN (TyY¥ySAVE(N291)) 


IF THERE HAS BEEN A CHANGE OF ORDER OR THERE HAS BEEN TROUBLE 
WITH CONVERGENCE? PW IS RE-EVALUATED PRIOR TO STARTING THE 
CORRECTOR ITERATION IN THE CASE OF STIFF METHODS. IWEVAL IS 
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on 


AAMAAAAD 


AANNDAANMAAAAAAANAAND 


THEN SET TO -1 AS AN INDICATOR THAT IT HAS BEEN DONE. 


IF (IWEVAL.LT.1) GO TO 350 
IF (MF.EQ.2) GO TO 310 
CALL PEDERV(TyY+PWyN3) 
R = A(1)*H 
DO 280 I = 1s5N4 
280 ~* PW(I) = PWET)*R 
290 NIL = NB + 1 
N12. = N*¥N1] - N3 
O09 300 I = LeNl2eN11 
300 Pw(T) = 1.0 + PW(1) 
IWEVAL = =1 
CALL MATINV(PWyNyN3,51) 
IF (J1-GT.0) GON TO 350 


GO TO 440 
310 00 320 1 = 1lyN 
320 SAVE(9s1) = YOlel) 


D0 340 J = 1yN 
R = EPS*DMAXL(EPS+DABS(SAVE(94J))) 
Y(loJ) = VYlled) + R 
D = A(1)*H/R 
CALL DIFFUN(Ts¥ySAVE(NGy1)) 
DO 330 IT = lyN 


NIL = Fo + (J-1)*N3 
N12 = N5 + I 
N13 = Ni + TI 
330 PW(NIL) = (SAVE(N1L2591) - SAVE(N13,1))*0 
340 YOleJ) = SAVE(9,J) 
GO TO 290 
350 IF (MF.NE.O) GO TN 370 


ND 360 1 = LeN 
NIL = Nl + I 


360 SAVE(9s1T) = Y(2s1r) — SAVE(NI191)*H 
GN TN 410 
370 00 380 [ = laN 
N1ll = N5 + I 
Ni2 = Nl + I 
380 SAVE(NI1+1) = Yl2yI) - SAVE(N12,1)*H 
pO 400 T = 1LyN 
9 = 9.0 
DQ 390 J = LlyN 
Nil = T + (J-1L)¥*N3 
N12 = NS + J 
390 D = D + PWINLLI*SAVE(NI291) 
400 SAVE(9,1) = D 
410 NT = N 
N0 4620 FT = lyn 
Ytleol) = VOLloT) + ACLI¥SAVE(9e91) 
¥(2eT) = ¥(2e1) - SAVE(9,1) 
FRRGR(T) = ERROR(1) + SAVE(9¢I) 


TF (DABS (SAVE(9,1))eLEs(BND*YMAX(I))) NT = NT - 1 
420 CONTINUE 
IF (NT.LELO)} GO TO 490 


430 CONTINUE 
THE CORRECTOR ITERATION FAILED TO CONVERGE IN 3 TRIES- VARIOUS 
POSSIBILITIES ARE CHECKED FOR. IF H IS ALREADY HMIN AND 
THIS [S EITHER ADAMS METHOD OR THE STIFF METHOD IN WHICH THE 
MATRIX PW HAS ALREADY BEEN RE-EVALUATEDs A NO CONVERGENCE EXIT 
IS TAKEN. QTHERWISE THE MATRIX PW IS RE-EVALUATED AND/OR THE 
STEP ITS REDUCED TO TRY AND GET CONVERGENCE. 


440 T=T-H 


IF (CH.LE.CHMIN*®1].00001)) -ANDe((CIWEVAL - MTYP).LT.-1L)} GO TO 460 


IF ((MFLEQ.60).OR. (IWEVAL~NE~O)) RACUM = RACUM¥*0.2500 
IWEVAL = MF 


460 KFLAG = -3 

470 DO 480 I = 1yN 

14K 
SAVE(Je1) 


2 
3 
S 
a 
ro 
i= 

Wout 


480 Y(dSel) 


THE CORRECTGR CONVERGED AND CONTROL IS PASSED TO STATEMENT 520 
TF THE ERROR TEST IS OeKer ANO TO 540 OTHERWISE. 

TF THE STEP IS OK. IT IS ACCEPTED. IF I100UB HAS BEEN REDUCED 
TO ONE, A TEST IS MADE TO SEE IF THE STEP CAN BE INCREASED 

AT THE CURRENT ORDER OR BY GOING TO ONE HIGHER OR ONE LOWER. 
SUCH A CHANGE IS ONLY MADE IF THE STEP CAN BE INCREASED BY AT 
LEAST 1lele IF NO CHANGE IS POSSIBLE INDNUB IS SET TO 10 TO 
PREVENT FUTHER TESTING FOR 10 STEPS. 

IF A CHANGE IS POSSIBLE, IT IS MADE AND IDOUB IS SET TO 

NQ + 1 TO PREVENT FURTHER TESING FOR THAT NUMBER OF STEPS. 
IF THE ERROR WAS TOG LARGE, THE OPTIMUM STEP SIZE FOR THIS OR 
LOWER ORDER IS COMPUTED, AND THE STEP RETRIED. IF IT SHOULD 
FAIL TWICE MORE IT IS AN INDICATION THAT THE DERIVATIVES THAT 
HAVE ACCUMULATED IN THE Y ARRAY HAVE ERRORS OF THE WRONG ORDER 
SO THE FIRST DERIVATIVES ARE RECOMPUTED AND THE ORDER 1S SET 
TO le 


00 500 IT = 1,N 


500 D = D + CERRGR(T)/YMAX(T) )**2 


IWEVAL = 0 
IF (0.GT.E) GO TO 540 
IF (K.LT.3) GO TO 520 
00 510 J = 39K 

00 510 I = 1lyN 


aQaoano 
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510 Yodel) = Yldel) + ACJ)*ERROR(T) 
520 KFLAG = +1 
HNEW = H 
1F (TDOUB.LE+«1) GO TO 550 
I1p0UB = [D0UB - 1 
IF (1DGUB.GT.1) GO TO 700 
DO 530 I = 1yN 
530 SAVE(LOs1) = ERROR(T) 
GO TO 700 
540 KFLAG = KFLAG - 2 
IF (H.LE.(HMIN*1.00001)) GO TO 740 
T = TOLD 
IF (KFLAG.eLE.-5) GO TO 720 
550 PR2 = (D/E)**ENQ2*1.2 
PR3 = 1.&+20 
IF ((NQ.GEsMAXDER) -OR.(KFLAGLE.-1)) GO TO 570 
D = 0.0 
0D 560 I = lyN 
560 D = D + (CFRROR(I) - SAVE(1LO91))/YMAX (1) )**2 
PR3 = (D/EUP)**ENQ3*1.4 
570 PRL = 1.E&+20 
IF (NO.LE.-1) GO TO 590 
D = 0.0 
DO 580 I. = LyN 
580 D = D + CY(KyT)/YMAXCT) ) #2 
PR1 = (D/EDWN)#*ENOL*1.3 
590 CONTINUE 
IF (PR2.LEePR3) GO TO 650 
IF (PR3.eLTePRL) GO TO 660 
600 R = 1.0/AMAX1(PR1y_1leE-4) 
NEWQ = NQ - 1 
610 IDOUB = 10 
IF ((KFLAGSEQs1)sANDe(ReLTe(he1))) GO TO 700 
IF (NEWQ.LE.NQ) GO TO 630 
DO 620 I = LyN 
620 Y(NEWO+1¢e1) = ERROR(1)*A(K)/DFLOATIK) 
630 K = NEWO + 1 
IF (KFLAGSEQ.1) GO TO 670 
RACUM = RACUM&R 
IRET1 = 3 
GO TO 750 
640 IF (NEWO.EQO.NQ) GO TO 250 
NQ = NEWQ 
GO TO 170 
650 If (PR2.GT.PR1) GO TO 600 
NEWO = NO 
R = L.O0/AMAX1(PR291.E-4) 
GO TO 610 
660 R = 1.0/AMAX1(PRA91.E~4) 
NEWO = NQ + 1 
GO TO 610 
670 IRET = 2 
R = DMIN1(R»HMAX/DABS(H)) 
H = H#R 
HNEW = H 
IF (NO.EQO.NEWQ) GO TO 680 
NO = NEWQ 
GO TO 170 
680 Rl = 1.0 
DO 690 J = 25K 


Ri = R1L*R 
00 690 I = 1,N 
690 Y(JeoT) = Y€JoT)¥*R1 
IDOUB = K 
700 DO 710 T = LeN 
710 YMAX(T) = DMAXLE(YMAX(T) sDABS(Y (LI) )) 
JSTART = NQ 
RETURN 


720 IF (NQ.EQ.1) GO TO 780 
CALL DIFFUN (T,Y,SAVE(N2¢1)) 


R = H/HOLD 
DO 730 £ = lyN 
Y(lyI) = SAVE(1L+I) 


Nil = Ni + [ 
SAVE(2s1) = HOLD*SAVE(N11,1) 
730 Y(2,1) = SAVE(2,1)*R 
NO = 1 


740 KFLAG = -1 


JSTART 
RETURN 


THIS SECTION SCALES ALL VARIABLES CONNECTED WITH H AND RETURNS 
TO THE ENTERING SECTION. 


750 RACUM = DMAX) (DABS (HMIN/HOLD) »RACUM) 
RACUM = DMINL(RACUM,DABS(HMAX/HOLD) ) 
Rl = 1.0 
DO 760 J = 29K 
Rl = RL*RACUM 
DO 760 I = IyN 
760 Y(JSeT) = SAVE(Je1)¥*R1 
H = HOLD*RACUM 
DO 770 I = IyN 
770 YOl,t) = SAVEC1s1) 
IPOUB = K 
GO TO ( 130 » 250 » 640 ),IRET1L 
780 KFLAG = -4 
GO TO 470 
END 
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Certification of Algorithm 407 [D2] 

DIFSUB for Solution of Ordinary Differential Equa- 
tions [C.W. Gear, Comm. ACM 14 (Mar. 1971), 185- 
190] 


Paul J. Nikolai [Recd. 1 Mar. 1972, 21 July 1972] 
Aerospace Research Laboratories, Wright-Patterson 
AFB, OH 45433 


The program used for this certification was keypunched directly 
from the printed Fortran algorithm [2]. The algorithm was imple- 
mented on a CDC 6600 computer using Fortran Extended, Version 
3.0, Level 261A, OPT (Optimization Level) = 1. The DOUBLE 
PRECISION statement was deleted, and the built-in or intrinsic 
double precision function references were replaced by their single 
precision equivalents. Thus about 14 decimal digits (48 binary 
digits) were retained in the computations. An apparent bug in 
Fortran Extended required changing the statement 


N4 = Ne«2 
following statement 140 to the equivalent statement 
N4 = NeN. 


The test problem given in [2] was coded, compiled, and ex- 
ecuted to prepare three tables analogous to those given with the 
problem. The results are available from the present writer. In addi- 
tion to the computed error [1, eq. (16)] returned by DIFSUB, the 
tables include the corresponding true error obtained by computing 
the Euclidean norm of the difference between the dependent vari- 
able vector returned by DIFSUB and that computed directly from 
the known solution of the test equation normalized by the infinity 
norm of the latter. The number of steps and average step size reflect 
these items over the appropriate printing interval and are not cumu- 
lative as the corresponding values apparently are in the tables with 
[2]. H was set initially to 10-4, and MAXDER was set to 4. The 
tables compare quite favorably for the larger values of the requested 
error, the discrepancies over the smaller values being attributable 
to the drop in precision from 16 decimal digits on the IBM 360/91 
to roughly 14 on the CDC 6600. The results from the stiff methods 
are truly impressive. 

Several inconsistencies become apparent, unfortunately, if one 
should choose the value of H, the current step size, to be negative. 
For negative values of H the IF statement following statement 440, 
the IF statement following 540, and the arithmetic expression for R 
following 670 do not work correctly. We recommend replacing H 
and HMIN by ABS(A) and ABS(HMIN) in the IF statements and 
HMAX/ABS(HA) by ABS(HMAX/H) in the expression for R. 

DIFSUB with the above modifications has been incorporated 
into a general program for solving linear two-point boundary value 
problems for ordinary differential equations by the method of pro- 
jections [3]. Past- experience with the method of projections indi- 
cates that stiff equations arise often in applications. We currently 
feel that DIFSUB is our best hope for handling these problems. 
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COLLECTED ALGORITHMS FROM CACM 


Key Words and Phrases: matrix, sparse matrix, matrix 
manipulation 
CR Categories: 5.14 


Description 

It is frequently necessary to manipulate large sparse matrices, 
for example in electrical network problems. In such cases much time 
and memory space can be saved if only the nonzero elements are 
stored. A set of Fortran subroutines has been written for performing 
various operations on sparse matrices stored in compact form in 
core. Core storage requirement is reduced for any square matrix less 
than 66 percent dense. These subroutines have been tested on an 
1BM 360/50 using a ““WATFOR” compiler. 


Method of Storage. The nonzero elements are stored row-by- 
row (in one case column-by-column) in a single-dimensioned real 
array (A, say) while entries in an associated single-dimensioned 
integer array (M, say) contain the column indices of the correspond- 
ing elements. In addition the M-array contains certain control in- 
formation. 

The control information and column indices are packed into the 
M-array as indicated in Table I. By the “right half” of an integer 
word is meant the four least significant decimal digits, while the 
“left half’ means the next four digits. Thus it is assumed that the 
computer word length is sufficient to contain at least an eight deci- 
mal digit integer (i.e. 28 bits including sign). 

There should be-no gaps in the M-array; thus, if the number of 
rows is odd, the first column index will appear in the right half of the 
word which contains “number of elements in last row” in its left 
half. 

The total number of words needed in the M-array will be 
{4 + (number of rows) + (number of nonzero elements) + 1}/2 
{rounded down to nearest integer]. 
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Algorithm 408 


A Sparse Matrix Package 
(Part I) [F4] 


John Michael McNamee (Recd. 26 Nov. 1969 and 
15 July 1970) 
York University, Downsview, Ontario, Canada 


Note that the number of rows or columns may be as high as 
9999, while the number of elements stored may be 108 — 1. (This is 
more than can fit into the core of any existing computer.) 

As an example consider the matrix: 


[ 1 O 2 
2 3 0 
0 0 0 
0 1 0 
The A-array would be as follows: 


fe 90402 53 44S 
Ae Ts 24°25. 3 1 


while the M-array would be: 


I: 1, 2, 3, 4, 5, 6, 7 
M(D): 40003, 5, 20002, 00001, 10003, 10002, 20000 


As a second example consider a 100 & 100 matrix having an 
average of three nonzero elements per row (as might arise in an elec- 
trical network problem). The A-array requires 300 words and the 
M.-array (4 + 100 + 300 + 1)/2 = 202, for a total of 502. This is 
just over 5 percent of the area required to store the matrix in full. 

Thirdly, consider a 100 & 100 matrix having an average of 66 
nonzero elements per row. This requires a total of 6600 + (4 + 
100-+ 6600 + 1)/2 = 9952 words, just short of the 10000 needed for 
full storage. Thus it is economical touse the sparse method of storage 
for square matrices having up to 66 percent nonzero elements. Time 
is also saved up to a certain degree of ‘‘nonsparseness.”’ 


List of Subroutines. The subroutines described here are listed in 
Table II. 


Notes on the Subroutines 

1. Using RDSPMX a sparse matrix may be input on cards as 
follows. The nonzero elements only are entered row-by-row in order 
of ascending column number with a sentinel (which may be any 
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Table I. Storage of Control Information and Column Indices 


Word 
Number Left Half Right Half 
M(1) Number of rows Number of columns 
M(2) «Number of elements stored— 
M(3) Number of nonzero Number of nonzero 
elements in row 1 elements in row 2 
M(4) Number of nonzero 
elements in row 3 etc... 
Mi) Number of nonzero 
elements in last row 
M(I+1) Column index of first Column index of sec- 
element stored ond element stored 
M(I+2) Column index of 
third element stored etc. . 
M(J) Column index of last 


element stored 


number) after the end of each row. After each element, its column 
index is entered, the end-of-row sentinel having an index of the form 
90000 + J where J is the row number. At the end of the whole matrix 
there is an additional sentinel (any number) with an index 99999. 

The elements and column indices are entered four per card in 
the format 4 (£15.8, I5); i.e. 


1-15 first element in £15.8 format 
16-20 first column index in 75 format 
21-35 second element 
36-40 second column index 
41-55 third element 
56-60 third column index 
61-75 fourth element 
76-80 fourth column index 

etc. 


Columns 


The elements are preceded by a control card containing in /5 
format: 


Columns 1-5 number of rows in A 
6-10 number of columns in A 
11-15 number of nonzero elements in A 


For example the matrix 


1 2 3 
4 0 5 
0 0 0 
would be entered thus: 


Col. 5 |Col. 10 | Col. 15 
u t 


Card #1: 3 3 5 
\ Cols. | 1-15 20 21-35 |40| 41-54 60/61-75 [76-80 
\ 
a 
NS 
EAS 
Card (say) 
2 1.0E£00 1 2.0£00 2|3.0E00 310.0200 |90001 
3 4.0E00 1 5.0E£00 3|0.0£00 |90002/0.0£00 |90003 
4 0.0E00 199999 


Note the third row must have an end-of-row sentinel. 


The subroutine checks that this information agrees with the 
number of rows and elements actually entered, and that no column 
index exceeds the number of columns as stated. It also checks that 
column indices within a given row are entered in ascending order. 
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Table II. List of Sparse Matrix Subroutines. (¥, MX) means 
“matrix with elements stored in X¥ and control information 
and column indices stored in MX.”’ 


Name and Result See note 
Parameters Function stored in number 
RDS PMX (A,M, Read from (A,M) 1 
NA,NM) cards in non- 
packed form 

ADS PMX(A, Add (A,MA) (C,MC) 2 
MA,B,MB,C, and (B,MB) 
MC,NA,NM) 
MUS PMX (A, Postmultiply (4, (C,MC) 3 
MA,B,MB,C, MA) by the 
MC,NA,NM) transpose of (B, 

MB) 
TRS PMX(A,M, Transpose (A, (AT,MT) 11 
AT,MT,NA, M) 
NM,IP,NP) 
PERROW(4A,M, Permute rows of (AP,MP) 4(a), 5 
AP,MP,IP,NA, (A,M) accord- 
NM,NP) ing to permuta- 

tion in JP 
PERCOL(A,M, Permute col- (AP,MP) 4(b), 5 
AP, MP,IP,AT, umns of (4,M) 
MT,NA,NM, according to 
NP) permutation in JP 
ARS PMX(A,M, Add R times (AN,MN) 
AN,MN,R,IR, row IR of (A, 
IT,NA,NM) M) to row IT 
ACS PMX(A,M, Add R times (AN,MN) 
AN,MN,R,IR, column JR of 
IT,NA,NM) (A,M) to col- 

umn IT 
MRS PMX(A,M, Multiply row IR (AN,MN) 
AN,MN,R,IR, of (A,M) by the 
NA,NM) scalar R 
MCS PMX(A,M, Multiply col- (AN,MN) 
AN,MN,R,IC, umn IC of (A, 
NA,NM) M) by the sca- 

lar R 
ERS PMX(A,M, Exchange rows (AN,MN) 6 
AN,MN,IR,JR, IR and JR of 
J,NA,NM,NP) (A,M) 
ECS PMX(A,M, Exchange col- (AN,MN) 6 
AN,MN,IR,JR, umns JR and JR 
J,NA,NM,NP) of (A,M) 
MVS PMX(A,M, Move (4,M) (AN,MN) 
AN,MN,NA, 
NM) 
SMS PMX(A,M, Multiply all ele- (AN,MN) 
AN,MN,S,NA, ments of (4,M) 
NM) by the scalar S 
RVS PMX(A,M, Extract row IR V 7 
IR,V,N,NA, of (A,M) 
NM) 
CVS PMX(A,M, Extract column V 7, 11 
IC,V,AT,MT,N, IC of (A,MA) 
NA NM,IP) 
INS PMX(A,M, Read from (A,M) 8 
N,NA,NM) back-up storage 

(Fortran unit 

N) in packed 

form 
OTS PMX(A,M, Write (4,M) Fortran 8 
N,NA,NM) onto back-up unit NV 

storage in 

packed form 
WRS PMX(A,M, Print (A,M) in Printer 9 


TIT,NA,NM) 


edited form. 
TIT (10) con- 
tains 10 four- 
letter words de- 
scribing (4,M). 
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Reading is via unit IN, which is set to 5. This may be changed by 
the user if necessary. 

2. A subroutine to subtract (B,MB) from (4,MA) may be ob- 
tained by making the following minor changes to ADSPMX: 

(i) Replace first line by SUBROUTINE SUSPMX(A,MA,B,MB, 
C,MC,NA,NM) 

(ii) Replace statement number 9 by T = A(JA) — B(JB) 

(iil) Replace Ist line after statement number 10, and also 2nd line 
after statement number 12, by C(JA) = — B(JB) 

(iv) In statements 2, 4, 17 & 19 replace “... ADSPMX ...”’ by 
“|... SUSPMX ...” 

3. MUSPMxX requires (B,MB) to be stored column-by-column. 
If it is not in this form the user must first call TRSPMX. 

4(a). In PERROW old row IP(J) becomes new row I. NP is 
dimension of [P (equals number of rows in A). 

4(b). In PERCOL old column J becomes new column JP(J). 
NP is dimension of JP (equal number of columns). A47,MT are used 
internally. 

5. The subroutine ANTIP (see ancillary subprograms below) 
may be used to invert the permutation JP. 

6. J is used internally. NP is number of rows (for ERSPMX) or 
number of columns (for ECSPMX). It is the dimension of J. 

7. The row (or column) extracted from (A,M) by RVSPMX (or 
CVSPMxX) is stored in full in V; i.e. zero elements are included. N is 
dimension of V (equal number of columns or rows in A). 

8. It is often possible to write more efficient subroutines for 
cransfer to/from mass storage devices, using machine coding or 
special subroutines available on individual computer systems. 

9. WRSPMX produces a printout of the nonzero elements of 
(A,M), row-by-row, five elements per line. Each element is followed 
by its column index. Each row is preceded by the heading “row 
number J’’. TIT is printed at top of each page. 

10. In all the subroutines NA,NM are the dimensions of A,M, 
respectively. Checks are made that these limits are not exceeded. 

11. The array IP(NP) in TRSPMX or IP(N) in CVSPMX is 
used internally. 

12. All on-line writes are on unit LP, set to six at start of each 
subroutine. The user may change this number. 


Ancillary Subprograms 

(i) FUNCTION IND(M,I,NM) is used to extract the /th half- 
word from the array M. All the subroutines listed in Table II 
use this except RDSPMX, INSPMX. 

(ii) SUBROUTINE IPK(K,M,I,NM) is used to pack K into the 
Ith half-word of array M. All the subroutines listed in Table 
II use this except MVSPMX, SMSPMX, RVSPMX, 
CVSPMX, INSPMX, OTSPMX, WRSPMX. NM is the di- 
mension of M. 

(iii) ANTIPUIP,AP,N) may be used to invert a permutation array 
IP of N elements, storing the result in AP. For example sup- 
pose JP is (3,1,2), then AP will be (2,3,1). This may be useful 
in conjunction with PERROW and PERCOL. Note also that 
some subroutines call on others: namely, ERSPMX calls 
PERROW, ECSPMX calls PERCOL, CVSPMX calls 
TRSPMX, and RVSPMX. 


Possible Alterations 

(i) | On machines having word lengths of 36 bits or more (such as 
1BM 7000 series), an integer contains over ten decimal digits. 
Hence by a slight change to IND or IPK a five digit integer 
may be stored in each half-word of the M-array. (No change 
to the main subroutines is required.) Thus matrices with up to 
99999 rows or columns can be stored. At the cost of extra 
storage and changes to the main subroutines a similar effect 
can be obtained on the 1BM 360 by using a full-word for each 
column index (then JND and JPK are not needed). 

(ii) If the program does not have to handle matrices with more 
than 999 rows or columns, a further saving of space can be 
made by packing three (more on some machines) indices into 
each word of the M-array. This requires changes to most of 
the subroutines as well as to IND and IPK; e.g. in MRSPMX 
and MCSPMX second line before statement number 2 would 
be changed to J1 = (5 + NRA + NEA)/3. 
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(iii) On the rem 360 the same effect as packing two column indices 
per word can be obtained more easily by declaring the 
M-array to be half-length (two bytes per word), and using one 
(half-length) word per index. Then subprograms IND and 
IPK are no longer required. This requires considerable 
changes to all the subroutines, but may save time. 


Further Extensions. It is hoped to present subroutines for 
solving sparse systems of linear equations, and (perhaps) for solving 
eigen-problems of sparse matrices at a future date. 


Algorithm: 


FUNCTION IND(Me fg NM) 

RESERERA ERE ERE ES EKE EK 
UNPACKS I*TH COLUMN INDEX.ARRAY M CONTAENS TWO 4~-DIGIT 
INDICES PER WORD, LUWER INDEX IN UPPER FOUR DIGITS. 


DIMENSICN M{NM) 
J*TH WORD OF M CONTAINS I*TH INDEX. 
J = tl+1)/2 
L 1S O IF IT -£VEN»s L IF I ODO. 
L = 1-(1/2)*2 
KT CONTAINS UPPER 4 OIGITS OF M(J). 
KT = M(J)/10000 
IF (Ld) Lole2 
1 IND = M(J)-KT*10000 


a a fF AnAaAN 


2 INO = KT 


END 

SUBROUTINE [PKC KeMe Te NM) 

EKER KKKEREE EERE SEES E 
PACKS K (1*TH COLUMN INOEX) IN ARRAY My WHICH WILL 
CONTAIN TWO 4-DIGIT INDICES PER -WORD, LOWER INDEX 
UPPER 4 DIGITS. 


Aaa 


DIMENSTICN MONM) 

J = (1+#1)/2 

L = [-€1/2)*2 

IF (lL) lyle2 

1 MfJ) = MOJ)+K 
RETURN 

2 M(J) = M(J)+K*10000 
RETURN 


END 

A ARO OR AO RR EE HD ER EE EK 
SUBROUTINE ACSPMX(AsMeANeMNeReIRe TT eNAg NM) 
HEAT EA EI ERE EEE EE ER EEE EE EEE ERE 


a 


ADDS R TIMES COL IR TO COL IT OF MATRIX STORED IN As 
PLACING RESULT IN ANe MeMN CONTAIN CONTROL DATA AND COL 
INDICES FOR AyAN. 
NA IS DIMENSION OF AsAN. NM IS DIMENSION OF MyMN. 
REAL AgAN,Re AR 
INTEGER MyMNyERy TT» NAgNMg I pNRAyNCAcL yp JF eNIRgNIRAyJ20Ky 
* KLZTFLLP,J 
OEMENSICN ACNA) »MONM) ANCONA) o MNONM)D 
C LP £S UNIT NUMBER OF LINE PRINTERS 
LP = 6 
C CHECK THAT PARAMETERS WITHIN RANGE. 
IF (REG.0.0) WRITE (LP914) 
IF (IReLE.O.OR-IT.LE-0) GO TO 15 
C CLEAR MN. 
O00 1 I = LyNM 
1 MN(T) = 0 
C CHECK THAT AN DOES NOT OVER-WRITE Aj 
IF (M€1)9.EQ.0) GO TO 17 
C UNPACK AND TRANSFER CONTROL DATA. NRAgNCA ARE NUMBERS OF 
C ROWSeCOLS IA Aw 


aaoannc 


NRA = IND(Ms1,NM) 
NCA = INO(M,2,NM) 
MN(1) = MOL) 


C CHECK PARAMETERS WITHIN RANGE. 
IF (IReGT.NCA.ORIT~GTeNCA? GO TO 15 
C & COUNTS ELEMENTS OF AN. 
Ll = 1 
JF = 4tNRA 
J COUNTS ELEMENTS TO END OF ROW {(I-1). 
9 


c 
J z 
C £ COUNTS ROKS OF A. 
CO 13 I) # lsNRA 
C NIR IS NUMBER IN NEW ROK. 
NIR = 0 
C NIRA £S NUMBER IN ROW I OF A, 
NIRA = IND(Ms44IeNM) 
IF (NIRA.EQ.0) GO TO 12 


COLLECTED ALGORITHMS (cont.) 


C J2 COUNTS ELEMENTS TO END OF CURRENT ROW. 


J2 = J+NIRA 
C J1 COUNTS ELEMENTS UP TO FIRST ONE IN CURRENT ROW. 
J1 = J+l 
C PICK OUT ELEMENT EN COLUMN IR. 
AR = 0.° 
00 2K = JleJ2 
Kl = IND(MsK+JFeNM) 
IF (KLeNE.ER) GO TO 2 
AR = ACK) 
2. CONTINLE 
C PICK OUT ANO ALTER IF NECESSARY ELEMENT IN COL IT. 
C TRANSFER REST OF ROW TO ANsMN. IFL SET TO L WHEN ELEMENT 
C IN COL IT FCUND OR CREATED. 
TFL = 0 
K = JI-1 
3. OK = Kel 
Kl = IND(MyK+JFeNM) 
Al = AUK) 


IF (KL.GE.IT) GO TO 7 
C CHECK IF ARRAYS FULL. 
4 IF (LeLEeNAANDOJFtL.LE22*NM) GO TO 6 
WRITE (LP»5) 
5 FORMAT(21H IN ACSPMX ARRAY FULL) 
CALL EXIT 
COLUMN IT NCT YET REACHED. 
6 AN(L) = Al 
CALL TPK(K1LoMNy JF4i eg NM) 


oO 


L = tl 
NIR = NIRL 
Go TO 10 


7 #+IF (K1.GT.IT) GO TO 8 : 
K1 EQUALS IT, I.E. THERE IS A NON-ZERO ELEMENT IN COL. IT 


Cc 
C UF ROW I. 
TFL =1 
Al = AR#R+A(K) 
IF (AL.NE.0.0) GO TO 4 
Go To 10 
8 IF (IFL.NE.9) GO TO 9 
C Kl GREATER THAN IT ANO ELEMENT IN COL IT HAS NOT YET 
C BEEN FOUND, THUS COL IT HAS A ZERO ELEMENT. 
CFL zl 
K = K-1 
Al = ARR 
C A NEW ELEMENT EN COL IT IS CREATED IF Al NOT ZERO. 
Kl = IT 
IF (AL.NE.0.0) GO TO 4 
GO TO 10 


C K GREATER THAN IT AND ELEMENT IN COL IT ALREADY FOUND OR 
C CREATEDs JUST TRANSFER TO NEW ARRAY. 
9 Al = AUK) 
GO TO 4 


10 IF (K.eLT.J2) GU TO 3 
IF CIFLNE.OsOReARWEQ20.0) GO TO 12 
IF (LOLESNASAND.JFtL LE. 2*NM) GO TO 21 
WRITE (LP,5) 
CALL EXIT 
11 ANCL) = AR*R 
CALL IPK(IT MN, JSE4+LeNM} 
L = L+l 
NIR = NIR+1 
C ENO OF ROW. 
12 CALL EPKINIReMN»44+19NM) 
J = J2 
13 CONTINUE 
C END OF LAST ROW. 
MN(2)) = L-1 
RETURN 
C ERROR MESSAGES. 
14 FORMAT(20H IN ACSPMX R IS ZERO) 
15 WRITE (LP916) 
16 FORMAT(32H IN ACSPMX IR OR IT OUT OF RANGE) 
CALL EXIT 
17 WRITE (LP .18) 
18 FORMAT(35H IN ACSPMX OUTPUT OVER-WRITES INPUT, 
* 34H OR INPUT HAS NO ROWS AND COLUMNS.) 


CALL EXIT 
END : 
CRE RR I RRR RR ROR RR IO IIR SEE REE EER OE 
SUBROUTINE ADSPMX(AsMAeB yMByC eMC eNAy NM) 
c SAR ER IO RA EE RRR EE AOE EEE ER 
c 
C ADD TWO SPARSE MATRICES. 
Cc 
C AvBeC CONTAIN ELEMENTS OF FIRST,SECONOD AND SUM MATRICES. 
C MAgMBsMCe CONTAIN CONTROL DATA AND COL. INDICES FOR AyBsC. 
C NA IS DIMENSION OF AsBsCe NM IS DIMENSION OF MAeMB eMC. 
c 
REAL AgBeC 
INTEGER MAyMBeMC aL PyNRAgNCAsNRBeNCBye Ce KA KBy JB a KF ye 
* TeKAle JAg JIL 9 J2 »NOLDyNAy NM 
DIMENSICN ANA) s MAC NM) ¢B(NA) 9 MB(NM) 9 CCNA) MCONM) 
C LP IS UNIT NUMBER OF LINE PRINTER. 
Le = 6 
C CLEAR MC. 
00 1 IT = 1yNM 
1 mC(I) = 0 


C CHECK THAT C DOES NOT OVER-WRITE A OR Be 

ITF (MA(1)2EQ.02.0RMBI(1)-2EQ.0) GO TO 18 
C UNPACK CONTROL DATA. NRAsNCA ARE NUMBER OF ROWSs COLUMNS 
C IN Aw NRB»y NCB ARE ROWS, COLUMNS IN B. 


NRA =z {NO(MAyl NM) 
NCA = IND(MAy2_NM) 
NRB = INO(MB yl NM) 
NCB = INO(MB,2,_NM) 


C TEST FOR COMPATIBILITY. 
IF (NRASEQ.NRB) GO TO 3 
WRITE (LP,2) NRA,NRB ; 
2 FORMAT(32H IN ADSPMX NUMBER OF ROWS IN A lo 14e 
* 37H) OCES NOT EQUAL NUMBER OF ROWS IN Bly1492H).) 
CALL EXIT 
3 IF (NCA.JEQ.NCB) GO TO 5 
WRITE (LP 4) NCAsNCB 
4 FORMAT(31H IN ADSPMX NUMBER OF COLS IN Alyi4, 
* 36H) DCES NOT EQUAL NUM. OF COLS. IN Blel422H).) 
CALL EXIT 
Cc JC COUNTS ELEMENTS OF C. 
5 JC = 1 


(a) 


ao a Aanan 


OOOMOAAO O 
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KAeKB ARE NUMBERS EN FIRST I ROWS OF AoB. 


KA = 0 
KB = 0 
KF IS NUMBER OF CONTROL DATA IN AgB OR C. 
KF = 4+NRA 
JB COUNTS ELEMENTS OF 8B. 
JB a | 


1 COUNTS ROWS OF AsBeCe 
DO 15 T=LhyNRA 


KB = KB+IND(MB e441 ,NM) 
NIRA ITS NUMBER IN ROW I OF A.W 

NIRA = TNOCMAs4+I_NM) 

TF (NTRA.EQ.O) GO TO 12 

KAL = KAtl 

KA = KA+tNIRA 


JA COUNTS ELEMENTS OF Ao 
DO LL JA= KAlsKA 
6 Ji = [ND(MAsJA+KF NM) 
AT END OF B-ROW TRANSFER REST OF A-ROWe 
IF (JB.GT.KB) GO TO 7 
J2 = END(MB,JB+KF y NM) 
IF (JL-J2) 799,10 
IF A-INODEX LESS THAN B-INDEX TRANSFER A-ELEMENT TO C. 
T IF (JC.GT.NA) GO TO 16 
C(JC)} = AJA) 
8 TF (JC+KF.GT22*NM) GO TO 16 
CALL [PK(J1LeMCyJC+KFy NM) 
Jc = JC+l 
GO TO ll 
IF A-INDEX EQUALS B-INDEX ADO ELEMENTS ,PLACE SUM IN Ce 
9 T = A(JAI+B0JB) 
IGNORE SUM ELEMENT LF ZERO. 
IF (T-E€Q.0.0) GO TO Ll 
IF (JC.GT.NA) GO TO 16 


cise) = T 
JB = JBtl 
GO TO 8 


IF A-INDEX GREATER THAN B-INOEX TRANSFER B-ELEMENT TO C. 
10 IF (JC.GT.NA) GO TO 16 

C(JC) = BtJB) 

IF (JC+KF.GT.2*NM) GO TO 16 

CALL TPK(JS2oMC,JICHKF NM) 


JB = JB+tl 
Jc = JC+1 
GO TO 6 


11 CONTINUE 
END OF ROW CF A. TRANSFER REST OF ROW OF 8. 
12, «IF (J8.GT.KB) GQ TO 13 
IF (JC.GT.NA) GO TO 16 
Ctscd = BIJB) 
J2 = IND(MB,JB+KF,NM) 
IF (JC+KF.GT.2*NM) GO TO 16 
CALL IPKCJ2 MC, JC+KF 9NM) 


JC = JC+L 
J8 = JBel 
GO TO 12 

13 «TF (1.GT.1) GO TO 14 
NOLO = JC-1 

NIRC IS NUMBER IN ROW I OF C. 

NERC = JC-1 
Go TO 15 

14 NIRC = JC-1-NOLD 


NOLD = JC-1 
15) CALL TPKINIRC MC ,4+1_9NM) 


LAST ROW. STORE CONTROL DATA IN MC. 
CALL IPKINRAsMC,1LyNM) 
CALL [PKINCAsMC o2yNM) 
MC(2)) = SCHL 
RETURN 
ERROR MESSAGES. 
16 WRITE (LP917) 
17 FORMAT(41H IN ADSPMX SPACE FOR SUM MATRIX EXCEEDED.) 
CALL EXIT 
18 WRITE (LP,_19) 
19 FORMAT(33H IN ADSPMX SUM OVER-WRITES INPUT, 
* 34H OR ENPUT HAS NO ROWS AND COLUMNS.) 
CALL EXIT 
END 
SRE RARE RHEE ER REE HE EEE ER ERE EKER EERE ERE 
SUBROUTINE ANTIP(N»Pe¢AP) 
REE AE RE ERR EEE 


INVERT PERMUTATION IN Pe PLACING RESULT IN AP. 
N ES NUMBER OF ELEMENTS IN PyAP. 


INTEGER PUN) »APUN) 
LP IS UNIT NUMBER OF LINE PRINTER. 


LP = 6 
CHECK THAT CUTPUT DOES NOT OVER-WRETE INPUT. 
AP{L) = 0 


[F (P(1)}.-£Q.0) GO TQ 4 
DO 3 1 = 1yN 
J = P(T) 
IF (J) lols3 
1 WRITE (LPe2) I 
2 FORMAT(37H IN ANTIP PERMUTATION CONTAINS A NON-y» 
* 28HPOSITIVE NUMBER IN POSITION ,I5) 


CALL EXIT 
3 AP(J) = I 
RETURN 


ERROR MESSAGES. 
4 WRITE (LP,5) 
5 FORMAT(42H IN ANTIP OUTPUT OVER-WRITES INPUT OR PUL)» 
* 8H IS ZERO) 
CALL EXIT 
END 
RRR KKK KR EERE EEK EEK KEKE EERE REREEEARERKEEEEKEARE EEE EK 
SUBROUTINE ARSPMX(AyMeANsMNyRe LRe LT eNAg NM) 
BECK EK EERE KEKE AEE KEKE EERE EEA EEE EEE EEE KER 


ADD R TIMES ROW [IR OF SPARSE MATRIX TO ROW IT. 
AyM CONTAIN ELEMENTS, COLUMN INDICES OF INPUT MATRIX. 


ANeMN CONTAIN ELEMENTS, COLUMN INDICES OF NEW MATRIX. 
NA 1S DIMENSION OF AsAN. NM IS DIMENSTON OF My MNe 


COLLECTED ALGORITHMS (cont.) 


REAL A,AN,R 
INTEGER MyMNeIRe ITs TeNRAgNCAGNEAWNIRASIRe Tbe JT ade JFe 
* SLa ITZ ¢ INeJ2eKR KT UTL s ETL eK eNAeNMe STORLP 
DIMENSEICN ANA) eMONM) yAN(NA) ¢ MN(NM) 
C LP [S UNIT NUMBER OF LINE PRINTER. 
up = 6 
C CHECK PARAMETERS WITHIN RANGE. 
IF CIR LE.O.ORJIT.LE.0) GO TO 23 
C CLEAR MN. 
OO L IT = lyNM 
1 MNIED) = 0 
C CHECK THAT AN DOES NOT OVER-WRITE Aw 
1F (M01)2EQ.0) GO TO 25 
C UNPACK CONTROL DATA, STORE IN MN. 


NRA = INO (Ms,1—NM) 
NEA = M(2) 
MNCL) = MOL) 
0O 2 I = 1sNRA 
IF (IleEQ.IT) GO TO 2 
K = IND (Me 4¢1_,NM) 
CALL [PK(KyMN,4+1,NM) 
2 CONTINUE 


a 


CHECK PARAMETERS WITHIN RANGE. 
IF CIR.GT.NRA.ORSIT.GTeNRA) GO TO 23 


C JReJT ARE NUMBERS OF ELEMENTS BELOW ROWS IRVIT. 


JR = 0 
IF (UR.LE.1) GO TO 4 
IRL = IR-1 
00 3 T = L,fRl 
3 JR = JR+IND(M,4+1,NM) 
4 ST = 0 
JF = 44+NRA 
IF (¥T.LEwL) GO TO 7 
Iv. = IT-1 
00 5 I = leITl 
5 JT = JST+ENDO(Me441,NM) 
C TRANSFER ELEMENTS BELOW ROW IT. 
00 6 f = LeJT 
AN(T) = ACT) 
J = IND(M, JF+1,NM) 
6 CALL TPKOJeMNy JFtE SNM) 
7 JTO: = JT 
C AUD R TIMES ROW FR TO ROW IT. 
JT2 = JT+IND(M,4¢I1T NM) 
JT = JT+l . 
C JN COUNTS ELEMENTS UF NEW MATRIX. 
JN = JT 


C NIRR IS NUMBER OF ELEMENTS IN ROW IR OF Aw 
NERR = IND(M,4+IR,NM) 
IF (NIRR~EQ.0) GO TO 14 


Jl = JR+1 
J2 = JR+NIRKR 
00 13 T= JlyJ2 


a 


CHECK ARRAY LIMIT. 
TF) (JINJLESNAJAND. ( IN*+JF)LE2Z*NM) GO TO 8 
WRITE (LPe21) 


CALL EXIT 
8 KR = ENOD(M, JF +1 y NM) 
9 IF (JST.GT.JT2) GO TO 12 
KT = IND (Ms, JF4JTs NM) 


IF (KT.GE.KR) GO TO 10 
ANUJNDSACJT) 
CALL EPK(UKTsMNe ING JF NM) 


JN = JN+1 
JT = JT+1 
Go To 9 

10 IF (KT.GT.KR) GO TO 12 
S = ACJST)+#R¥AUT) 
IF (€$.£Q.0.0) GO TO ll 
ANLJND=S 
CALL TPKUKT sMNy JING UF oNM) 
JN = JNeLl 

ll JT = JT¢L 
GO TO 13 

12 > =«S = R¥A(T) 
IF (S.E&Q.0.0) GO TO 13 
ANT JND=S 
CALL IPK(KReMNy JN+JSF NM) 
JN = JN+1 


13 CONTINUE 
C TRANSFER REST OF ROW IT. 
14 IF [JT.GT.UT2) GO TO 16 
TF FINSLESNASANDe (ING JF) -LE22*NM) GO TO 15 
WRITE (LP ,21) 


CALL EXIT 

15 ANGJN) = ACJT) 
KT = ANDUMys JF+J3T NM) 
CALL EPK(KTeMNg JF SNe NM) 
JN. = JNeL 
JT = JT+l 
GO TO 14 

C JTL TS NUMBER IN ROW IT OF AN. 
16 JTL = JN-1-JTO 


CALL IPKUSTL eMNy 441 Te NM) 
~ STORE ROWS ABOVE IT. 
IF fIT.EQeNRAY GO TO 20 
Th = [T+l 
D0 19 T=ITL,NRA 
C K TS NUMBER IN ROW Ie 


a 


K = [NO (My 4¢1,NM) 
IF (K2EQ.0) GO TO 19 
JT = JT2 

JT2 = JT+K 


JTL = JT+#1 

BDO 18 J = JT1,JT2 
TF CJNCLESNASANDS(UNtJF)LE2*NM) GO TO LT 
WRETE (LP ,_21) 


CALL EXIT 
17 AN(JN) = ACJ) 
K = INDOM,J+JFeNM) 
CALL EPK(KeMNy JN+ JF eNM) 
18 JN = JNel] 


19 CONTINUE 
20 MN(2)) = JN-1 
RETURN 


Cc 


QOOannaannn Oo 


AANDAAOO AN o 


on 


AOAMAAN O 


aan 


HAMA OO 4 


NAAN 
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ERROR MESSAGES. 
21 FORMAT(21LHOARRAY FULL IN ARSPMX) 
22 FORMAT(20H IN ARSPMX R IS ZERO) 
23 WRITE (LP 24) 
24 FORMAT(33H IN ARSPMX IR OR IT OUT OF RANGE.) 
CALL EXIT 
25 WRITE (LP,26) ; 
26 FORMAT(35H IN ARSPMX OUTPUT OVER-WRITES INPUT, 
* 34H OR INPUT HAS NO ROWS ANO COLUMNS.) 
CALL EXIT 
END 
SHREK ERE EE KEE RARER EERE RE EEE ARE EEE EEE EERE ERE EK 
SUBROUTINE CVSPMXtUAsMeICoVeAToMT aNyNAgNMy IPD 
EERE EEE RE ERE EERE ERE RER EEE KEE EER KEE EE 
EXTRACTS COL IC OF SPARSE MATRIX IN AsSTORING RESULT IN V 
IN EXTENDED FORM, 1.€. ALL ELEMENTS INCLUDING ZEROS ARE 
REPRESENTED. 
M CONTAINS COLUMN INDICES OF A. 
AT»MT ARE USED INTEKNALLY. 
N IS DIMENSION OF Ve NA IS OIMENSTION OF AyAT. 
NM IS DIMENSION OF MeMT. 
1° US USED INTERNALLY BY TRSPMX. 


REAL AeATeV 
INTEGER MyIC MT eNgNAgNMy IP 
DEIMENSICN ACNA) eM(NM) sATONAD oMTUNM) VEN) STP END 
CALL TRSPMX(AgMsAT oe MT gNAoNMy I PeN) 
CALL RVSPMXCATe MT eT CoVeNeNAgNM) 
RETURN 
END 
BEEK K HEE AK HE EEE EEE EKER EERE KER ERKEEE EKER SE EEE KEKE 
SUBROUTINE ECSPMX(AgMyANeMNe I Rye SRe Je AT eo MT yNAgNMyNP) 
WARESEEKER REE EERE REE EERE REE EEE EK REKEEEE ERE KEKE 


RESULT IN AN. 

EXCHANGE COLUMNS IRyJR OF SPARSE MATRIX IN Ay STORING 
MeMN CONTAIN COLUMN INDICES OF A,AN. J 1S USED INTERNALLY. 
AT,MT ARE USED BY PERCOL. : 

NA IS DIMENSTON OF AsgANe NM IS DIMENSION OF MeMN. 

NP IS DIMENSION OF JeATeMTo 


REAL AyAN,AT 
INTEGER MeMNoIReIRe J eNCAgNAgNMy NP OMT 
DIMENSICN ANA) »M(NMD SANNA)» 
* MNONM) ¢ JENPDSATONP) oMTOUNP) 
SET UP PERMUTATION ARRAY J WITH ITRsJR INTERCHANGED. 
NCA IS NUMBER OF COLUMNS IN A. ; 


NCA = INO(M,2eNM) 
DO L I * LyNCA 
1 Jr) I 


JCIRD = JR 
JCJRY = IR 
PERMUTE COLS OF A. 
CALL PERCOL (AgMyANeMNe Je AT MT pNAyNM,NP) 
RETURN 
END 
EERE REE EEE EEE EEA REESE ERE EEK ESET EEE AES ERED EERE REE EE EE 
SUBROUTINE ERSPMX(AgMyANeMNy IReJRe dy NAeNMeNP) 
ERIK KEES EER EEE REE ARERR EER EES EERE KEE EERE ES 


EXCHANGE ROWS IRsJR OF SPARSE MATRIX IN AeSTORING RESULT 
IN AN. Ms MN CONTAIN COLUMN INDICES OF A, ANe : 

NA IS DIMENSION OF A,AN. NM 1S DIMENSION OF MyMN. 

NP IS OIMENSION OF JsWHICH IS USED INTERNALLY. 


REAL Ay,AN 

INTEGER MeMNe IR, JRyNRAGNAgNMe Je NP 

DIMENSICN ANA) sMONM) sANOCNA) » MNONM) » JONP ) 
SET UP PERMUTATION ARRAY WETH IReJR INTERCHANGED AND ALL 
OTHER INTEGERS IN NATURAL ORDER. 
NRA TS NUMBER OF ROWS IN Aw 


NRA = IND(My1 NM) 
DO 1 FE = tyNRA 
1 Jtt) = 1 
JCIRY = JR 
JCJRI = IR 


PERMUTE ROWS OF Ae 
CALL PERROW{AyMy AN yMNy Jp NAgNMyNP D 
RETURN 


EN 
dckentacedubasuadesteawesseeeastaketesssesiacarceee ees 
SUBROUTINE OTSPMX(AyMyNyNAgNM) ; 
Se OR 
WRITE SPARSE MATRIX IN AeM ON FORTRAN UNIT N (MASS STORAGE 
OEVICE). 
NAeNM ARE DIMENSIONS OF AgM. 


REAL A 

INTEGER MsNyNEA,yNRA,NEM,NAy NM 

OIMENSICN ACNA),MEINM) 

NEA [S NUMBER OF ELEMENTS IN Aj 

NEA = M(2) 

NRA IS NUMBER OF ROWS IN A. 

NRA = IND(Myl_NM) 

NEM IS NUMBER OF WORDS IN Me 

NEM = (54+NRA+NEA)/2 

WRITE (N) NEM 

WRITE CN) (MCT) s T#l2NEM) 

WRITE (ND (ACT) oe T=1leNEA) 

REWIND A 

RETURN 

END 
SERRE EE TE ON EE EER EEE EE EEE EEE EEE REE EE OE ER 

SUBROUTINE ENSPMXTAyMeNg NAeNM) 

SREREE EEE REE EE EOE EEE EE EEE ES EEE . 
READ SPARSE MATRIX FROM FORTRAN UNIT N (MASS STORAGE 
DEVICE).} STORE IN Ay WITH COLUMN INDEX ARRAY IN Me 
NAsNM ARE DIMENSIONS OF AsM.e 


REAL A 
INTEGER MyNeNEMyNEA 
DIMENSICN A(NA) »M(NM) 


NEM IS NUMBER OF WORDS IN Me 
READ (N) NEM 
READ (ND (MCT), 1212NEM) 


COLLECTED ALGORITHMS (cont.) 


oO 


aAanaan 


Aonan o 


NEA IS NUMBER OF ELEMENTS IN Aw 
NEA = M(2) 
READ (N) CACT)»y TeleNEA) 
REWIND A 
RETURN 
END 
RE EE RTE RE ERE ERE E EERE ERR AR EERE ER ERER RARER ESE KEES ES 
SUBROUTINE MCSPMXCAgMeANeMNy Re ITC eNAg NM) 
SIGIR OR Sot Rot BIO RR Ir OR BOR ROR IO AO ROR 


MULTEPLIES COL [C OF SPARSE MATRIX IN A BY Re STORING 
RESULT IN AN. MyMN CONTAIN COLUMN INDICES OF AgANw 
NA IS DIMENSION OF AyAN. NM IS DIMENSION OF MeMNo 


INTEGER MyMNo IC eNRAgNCAgNEAg I lol eJe JFeNIRAyJ2e Jl eK eI Ts 
* NAeNMyl» NC ‘ 
DIMENSICN ACNA) »MONM) sANCNA) ¢ MN(NM) 
REAL AsANSR 
LP TS UNIT NUMBER OF LINE PRINTER. 
LP = 6 
CLEAR MN. 
00 1 I = 1yNM 
t MN(T) = 0 
CHECK THAT CUTPUT DOES NOT OVER-WRITE INPUT. 
IF (M(1L)2EQ.0) GO TO 9 
UNPACK AND TRANSFER CONTROL DATA. NRA NCAyNEA ARE NUMBERS 
OF ROWS» CCLS», ELEMENTS IN A. 


NRA = INO(My1leNM) 
NCA = IND (My2_NM) 
NEA = M(2) 
DO 2 T = 1le2 


2 sMNUT) = MCD) 
CHECK IC WITHIN RANGE. 
IF CIC.GTsNCA-OReICeLE-0) GO TO 7 
J COUNTS ELEMENTS TO END OF ROW (I-1) OF Aw . 


J = 0 
JF = 44+NRA 
L COUNTS ELEMENTS OF NEW MATRIX. 


L =l 
I COUNTS ROW OF A. 
0O 5 I= 1,NRA 
NIRA IS NUMBER OF ELEMENTS IN ROW I OF Aw 
NIRA = IND(M,44¢I_9NM) 
NIRAN IS NUMBER OF ELEMENTS IN ROW OF AN. 
NEIRAN = NIRA 
IF (NIRAJEQ.0) GO TO 5 
J2 COUNTS ELEMENTS TO END OF ROW I OF A. 


J2 = J#NIRA 
Jl COUNTS ELEMENTS UP TO FERST ONE IN ROW I OF Ae 
JL = J+l 


PROCESS ROW I OF Aw 
p04 K = JieJ2 
IT = IND(Me SFtKyNM) 
IF (ET.EQ.IC} GO TO 3 
TRANSFER COLUMNS OTHER THAN IC. 
AN({L) = A(K) 
Il = IND(M,JF+K,NM) 
CALL IPKUIL»MNe JF tL NM) 
L = Ll 
GO TO 4 
MULTIPLY COL IC BY Re 
3 IF (ReEQe02.0) NIRAN= NERA-1 
TF ({R.EQ.0.0) GO TO 4 
AN(L) = R*A(K) 
tl = IND(M, JF+K NM) 
CALL EPKCELsMNg JF el NM) 
L = L+l 
4 CONTINLE 
END OF ROW I. 
J = J2 
CALL IPKINIRANs MN, 441 _NM) 
5 CONTINUE 
END OF LAST ROW. 
TF (R.NE.0.0) GO TO 6 
M(2) = L-1 
6 RETURN 
ERROR MESSAGES. 
7 WRITE (LP,8) 
8 FORMAT(26H IN MCSPMX IC OUT OF RANGE) 
CALL EXIT 
9 WRITE (LP,10) 
10 FORMAT(35H IN MCSPMX OUTPUT OVER-WRITES INPUT, 
* 34H OR INPUT HAS NO ROWS AND COLUMNS.) 
CALL EXIT . 
END 
FERRARA EER EAEEREREREEEREREES AEE EERE EEE EERE ERE ERE 
SUBROUTINE MRSPMX(AeMsANe MN Re LR eNAgNM) 
PTET EDEL ELE LE LEE L ETS RT EST TTT ete tt eS 


MULTIPLIES ROW IR OF SPARSE MATRIX A BY Re STORING 
RESULT IN ANe MeMN CONTAIN COLUMN INDICES OF A,AN. 
NA 1S DIMENSION OF AygANe NM IS'DIMENSTON OF MeMNe 
REAL AgANgR ; 
INTEGER MeMNoIReNMyNRAgNEA,I lols 
* SeNTRAsI2 dL eK eNAgl o JF 
DIMENSTCN ACNA) ¢MOENM) pANCNA) ¢ MNONM) 
LP IS UNIT NUMBER OF LINE PRINTER. 
LP = 6 
CLEAR MNe 
00 1 k = LyNM 
1 MNCIE 30 
CHECK THAT CUTPUT OOES NOT OVER-WRITE INPUT. 
TF (M(1)2EQ.0) GO TO 9 


OAOOAAAAAAAN 4 


a a a 


a 
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UNPACK AND TRANSFER CONTROL DATA. 
~ NRA = IND(M,1i,NM) 
NEA = M(2) 
DO 2 2 192 
2 MNCI)D = MOT) 
CHECK THAT IR IS WITHIN RANGE. 
TF CIRLGT.NRAJORERLT.1) GO TO 1} 
J COUNTS ELEMENTS TO END GF ROW (1-1) OF Aw 
J = 0 
& COUNTS ELEMENTS OF NEW MATRIX. 
t = 1 
JF = 4*tNRA 
I COUNTS ROWS OF Aw 
0G 7 I = LyNRA 
NIRA IS NUMBER IN ROW IT OF Aw 
NIRA = IND(M,4+I_NM) 
CALL LTPKINIRAgMN,4419NM) 
IF (NIRA.EQ.0) GO TO 7 
J2 COUNTS ELEMENTS TO END OF ROW I OF Ae 


J2 = J#NIRA 
Jl COUNTS ELEMENTS £0 FIRST ONE EN ROW I OF Aw 
Jl = J#l 


IF (F.EQ.IR) GO TO 4 
TRANSFER ROWS OTHER THAN Te 
00 3 K = JleJ2 
1) = IND(M,JF+K,NM) 
CALL IPK(T1LeMN, JF tL 9NM) 
ANIL) A(K) 
3 L L+1 
GO TO 6 
MULTIPLY ROW IR BY Re 
4 005 K = JleJ2 
IF (R-EQ.0.0) GO TO 5 
11 = IND(M,JF+K,NM) 
CALL IFPK(TLeMN, JF el NM) 
AN(L) = R¥A(K) 
L = Le) 
5 CONTINUE 
6 6J = J2 
END OF ROW I. 
7 CONTINUE 
END OF LAST ROW. 
IF (R.NE.0.0) GO TO 8 


wou 


M2) = .t-l 
CALL IPK(O,MN»4¢IRy NM) 
8 RETURN 


ERROR MESSAGES. 
9 WRITE (LP,10) 
10 FORMAT(36H IN MRSPMX QUTPUT OVER-WRITES 
* 34H OR INPUT HAS NO ROWS AND COLUMNS.) 
CALL EXIT 
11 WRITE (LPsl2) 
12 FORMAT(26H IN MRSPMX IR OUT OF RANGE) 
CALL EXIT 
END 
PERRET R TR EEE EEE EEE EROS EERE EE EE EERE EER EERE 
SUBROUTINE MUSPMX(AsMAeB MB eC oMCeNAg NM) 
CREAR REE EEE AEE SE EEE EE ESS EEE EE EEE 


INPUT ¢ 


MULTIPLY TWC SPARSE MATRICES. 
B MUST BE STORED BY COLUMNS, I-E. WE FORM C = 

A*(B TRANSPCSED). 

AeBeC CONTAIN FIRST,SECOND AND PRODUCT MATRICES RESPECT- 
IVELY. 

NAsMBeMC CONTAIN COLUMN INDICES OF FIRST SECOND AND 
PRODUCT MATRICES RESPECTIVELY. 

NA IS DEMENSION OF AsyBeCeo NM ES DIMENSION OF MA,MByMC. 


REAL AgBoCyS 
INTEGER MAsMByMC pNRAgNCAgNRBgNCBeL Ce KA KAFyKBF eKBeKAly 
* KBLeJBe JAM, JBM e JL gp I2oLCMeNEC pKa IT eto deNAgNMyLP 
DIMENSECN ACNA) sBENA) eCUNA) sMA(NM) @ MB UNM) oe MC(NM) 
LP [LS UNIT NUMBER OF LINE PRENTER. 


Lp = 6 
CLEAR MC. 
DO 1 I = LyNM 


1 MmMC(I) = 0 
CHECK THAT C DOES NOT OVER-WRITE A OR Be 
TF (MA(1)0EQ.0.0R.MB(1)~-EQ.0) GO TO 16 
UNPACK CONTROL INFORMATION. NRA IS NUMBER OF ROWS IN A. 


NRA = IND(MAs1,_,NM) 
NCA 1S NUMBER OF COLS IN Ae 
NCA = END(MAs29NM) 
NRB»NCB ARE NUMBER OF ROWS ANC COLUMNS IN B. 
NRB = INO(MBe1,NM) 
NCB = IND(MB,2~9NM) 


TEST FOR ‘COMPATIBILITY. 
IF (NCA-EQ.NRB) GO TO 3 
WRITE (LP_2) 
2 FORMAT(31H A ANO B INCOMPATIBLE IN MUSPMX) 
CALL EXIT 
LC IS NUMBER OF ELEMENTS IN C. 
3 LC zl 
KAFsKBF ARE NUMBERS OF CONTROL DATA IN MAsMB~ 
KAF = 4+NRA 
KBF = 4+NRB 
KA»KB ARE NUMBERS QF ELEMENTS IN 
KA = 0 
NEC IS NUMBER OF ELEMENTS IN Co 
NEC = 0 : ; 


FIRST I ROWS OF AvB. 


— 
o 
~- 
w 
~ 
ou 


NIRA IS NUMBER IN ROW I OF Aw 
NIRA = END (MAs4+1,NM) 


COLLECTED ALGORITHMS (cont.) 


C NIRC IS: NUMBER IN ROW IT OF C. 


c 


c 


NIRC = 0 

IF (NIRA.JEQ.O) GO TO 15 
KAl = KA+tl 

KA = KAtNIKA 


DO 14 J = 1,.NCB 


NIRB IS NUMBER IN ROW I OF Bo 
NIRB = INDOMB,44+J_NM) 
IF (NIRB.EQ.0)} GO TO 14 
KB1 = KB+lL 
KB = KBtNiRB 
C S WILL CONTAIN f-J ELEMENT OF Cy 
Ss = 0. 
JB COUNTS ELEMENTS IN 8B. 
JB = KBL 
DO 8 JA=KALeKA 
JAM = JAtKAF 
5 JBM = JB+KBF 
J1 = IND (MA, JAMyNM) 
J2 = IND (MB, JBMsNM) 
IF (41-J2) 89607 
6 s = StA(JA)*B( JB) 
[F (JB.€Q.K8) GO TO 9 
JB = JB41 
GO TC 8 
7 IF (JB.EQ.KB) GO TO 9 
: JB = JB+l 
GO TC 5 


c 


c 


oOo 


o 


oa 


AAanne 


o 


AMAMANMANOADA 


8 CONTINUE 
IF ELEMENT ZERO OO NOT STORE. 
9 IF (S.£Q.0.0) GO TO 14 
TF ({LC.LE.NA) GO TO 11 
WRITE (LP510) 
10 FORMAT(L7THOSIZE OF PRODUCT , 
* 25HMATRIX EXCEEDED IN MUSPMX) 


CALL EXIT 
STGRE ELEMENT AND INDEX IN CyMC. 
i circ) =S5 
LCM = LC+KAF 


IF (LCM.LE.2*NM) GO TO 13 
WRITE (LP,12) 
12 FORMAT(40OH SIZE OF INDEX MATRIX EXCEEDED IN MUSPMX) 
CALL EXIT 
13 CALL IPK(JSeMC,LCM,NM) 
tc = LC+l 
NERC = NIRC#L 
14 CONTINLE 
NEC = NEC+NZRC 
CALL TPKONERC MC e441 9NM) 
15 CONTINUE 
STORE CONTRCL DATA IN MC. 
CALL EPK(NRA,sMCe,1,NM) 
CALL FPK(NCBsMC,2,NM) 
MC(2) = NEC 
RETURN 
ERROR MESSAGE. 
16 WRITE (LP,17) 
17 FORMAT(36H IN MUSPMX PRODUCT QVER-WRITES INPUT, 
* 34H OR INPUT HAS NG ROWS AND COLUMNS.) 
CALL EXIT 
END 
FO EOE ROE $0 EO OR OR FOR EE OR EE 
SUBROUTINE MVSPMX(AsMsANgMNgNAgNM) 
tO OE Os ee EE CE 


MOVE SPARSE MATRIX IN A TO ANe 
My»MN CONTAIN COLUMN INOICES FOR AgAN. 
NA IS DIMENSION OF AsgANe NM IS DIMENSION OF My MNe 


REAL Ay AN 

INTEGER MeMNeNEAgI sNRAgNeNAgNM 

DIMENSEICN ACNA) »MONM) eAN(NA) ¢ MN(NM) 
NEA 1S NUMBER OF ELEMENTS IN A. 


NEA = M(2) 
MOVE A. 
OO 1 I = 1l,»NEA 
1 ANCE) = ACT) 
NRA ES WUMBER OF ROWS IN Ae 
NRA = IND(M,1,_,NM) 
N IS NUMBER OF WORDS IN Me 
N = (S+NRA+NEA)/2 
MOVE Me 
00 2 1 = 1eN 
2 MN(T) = M(T) 
RETURN 
END 


ERED ERE EEE EERE ERE EEE EERE EE 
SUBROUTINE PERCOL(A Me AP eMPy IP, AT» MT pNAgNMy NP) 
EO EH EE REE ED EK EE ERE EET EE EEK 
PERMUTE COLUMNS OF A SPARSE MATRIX STORED IN Aw 
M CONTAINS COLUMN INOICES OF A. 


APeMP WILL CONTAIN ELEMENTS ANDO COLUMN INDICES OF RESULT. 


TP CONTAINS PERMUTATION--THAT IS OLO COLUMN I BECOMES NEW 
COLUMN IPCI). 

AT SMT WILL CONTAIN ROW OF AgM.e 

NA IS DIMENSION OF AyAP. NM IS DIMENSION OF MyMP. 

NP IS OIMENSTON OF AT ¢MTyIP. 


REAL AgAPsATe Al 

INTEGER MeoMPoIPyNReNColotleNIReKe 
FLeNy Jo Jl ol JoNly TFL eMLeNAyNMyNPyLlP 
DIMENSTICN ACNA) »MONM} pAPINAD» 

* MP(NM) o EPONP) SATINP) gS MT INP) 


LP IS UNIT NUMBER OF LINE PRINTER. 
up = 6 


OHAOGTAGAANHA 


oO 


c 
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CLEAR MP. 
DO 1 iT = LeNM 
1 MP(T) = 0 
CHECK THAT CUTPUT DOES NOT OVER-WRITE INPUT. 
IF (M(1).2€Q.0) GO TO 15 
UNPACK CONTROL INFORMATION. 
NR = IND(My1yNM) 
NC = IND(M»2—9NM) 
CHECK THAT EP(K) IS WITHIN RANGE. 
00 2 K = 1,NC 
IF CIP(K)LEOeOR-IP(K).GT.NC) GO TO 13 
2 CONTINUE 
TRANSFER COATROL DATA TO MP. 
12 = 44NR 
00 3 I = 1,12 
K = I[NO(M,I NM) 
3 CALL IPK(KyMPy i eNM) 
L COUNTS ELEMENTS ALREADY PERMUTED. 
L = 0 
1 IS ROW COUNTER. 
DO 12 I = LlyNR 
N = IND(M,4+1,NM) 
IF (NJEQ-0) GO TO 82 
STORE ROW f IN AT WITH COLUMN INDICES IN MT. 


DO 4 Jo = lyN 
Jl = 44#NR4LeJd 
K = IND(MyJL,NM) 
MT(J) = IP(K) 
LJ = LtJ 


4 AT{(J) = ACLJI) 
IF (N.EQ~1) GO TO 10 


Nl = N-1l 
IFL WILL REMAIN O WHEN SORTING OF ROW I COMPLETE. 
5 TFL = 0 
SORT ELEMENTS OF ROW [ IN ORDER OF INCREASING COLUMN INDEX 
DO 9 Jo = lyNl 


IF (MTC JSI-MTC(JS+1)) 99608 
ERROR MESSAGE. 

6 WRITE (LPy7) 

7 FORMAT(26H IN PERCOL 2 INDICES EQUAL) 
CALL EXIT 

8 ML = MT(J) 
MT(J) = MT(J+1) 
MT(J+1)= Al 


Al = AT(S) 
AT(J) = AT(J+L) 
AT(J+#L)= AL 
TFL =u 

9 CONTINUE 


IF (TFL~EQ.1) GO TO 5 
TRANSFER ROW I. 
to O00 11 J = lyN 


LJ = L+J 
AP(LJ) = ATU) 
Jl = LJU+44+NR 
K = MT(J) 
ll CALL IPK(KeMPs Jl eNM) 
2c = LtN 
RETURN 


ERROR MESSAGES. 
13 WRITE (tPe14) 
14 FORMAT(43H IN PERCOL PERM CONTAINS INDEX OUT OF RANGE) 
CALL EXIT 
15 WRITE (LP,16) 
16 FORMAT(35H EN PERCOL OUTPUT OVER-WRITES INPUT, 
* 34H CR INPUT HAS NO ROWS AND CCLUMNS.) 
CALL EXIT 
END 
SEEKER EEE ERE KEE KEK AERA SE EEEE EERE TED EERE EERE REE RHEE EEE EREE 
SUBROUTINE PERROW(AsMsAP eMPe LP ye NAgNMyNP ) 
FEREEEKHEREKEKE EERE EERE EE ERE ED TESS EEE 
PERMUTE ROWS OF A SPARSE MATRIX STORED IN Ao 
M CONTAINS COLUMN INDICES OF ENPUT MATRIX Ae 
AP CONTAINS ELEMENTS OF OUTPUT MATRIX « 
MP CONTAINS COLUMN INDICES OF OUTPUT MATRIX. 
IP CONTAINS PERMUTAFION--1t-E. OLD ROW IPCI) BECOMES NEW 
ROW Te 
NA IS OIFMENSTON OF AsAP. NM IS DIMENSION OF MeMP. 
NP IS DIMENSION OF IP. 


REAL AyAP 

INTEGER MeMPeEPeNReNCol ell eNIRe I 2y 
*KeLAgl MeNigJedIL eo T3yN2—eNMpNAeNPyLP 
DIMENSICN ACNA) »MONM) AP(NA) o MPOUNM) s IPONP) 


LP LS UNIT NUMBER OF LINE PRINTER. 
LP = 6 
CLEAR MP. 
OO L I = LeNM 
1 MP(E) = 9 
CHECK THAT CUTPUT DOES NOT OVER-WRITE INPUT. 
{F (M(1L).EQ.0) GO TO 10 
UNPACK CONTROL INFORMATION. 
NR = IND(Msl_)NM) 
NC = INO(M,2_yNM) 
RECORD NUMBERS OF ROWS »COLUMNS ANO ELEMENTS IN MP. 
00 2 Tf = Le2 
2 MP(T) = MOT) 
LAeLM COUNT ELEMENTS IN AP,MP. 
LA al 
LM = 5StNR 
PERMUTE ROWS. 
DO 9 I= 1eNR 
NI z= 0 


COLLECTED ALGORITHMS (cont.) 


C J IS OLD NUPBER OF NEW ROW I. 
J = IP(1) 
K = IND(Ms44J_NM) 
CALL IPK (Ke MP, 44+I,NM) 
IF (JeGTeNReOReJeLE.0) GO TO 3 
Go TO § 
C J OUT OF RANGE--GIVE ERROR MESSAGE. 
3 WRITE (LPo4) I 
4 FORMAT(38H IN PERROW PERM CONTAINS INDEX OUT OF 
* LTHRANGE IN POSITIONS 13) 
CALL EXIT 
C PICK QUT START AND END OF ROW J. 
5 IF (J-EQ.1) GO TO 7 


Jl =z J-1 
00 6 13] LyJl 
6 Nl =NL+INO(M,44+13—NM) 


C NIRJ 1S NUMBER IN ROW J OF Aw 
T NIRS = IND(Ms4+J_9NM) 
IF (NIRJ2EQ.0) GO TO 9 
N2 = NL4tNEIRJ 
NL = NL¢l 
C TRANSFER ROW J OF AeM TO ROW I OF APeMP. 
DO 8 132 NL,N2 
APC(LA)= AC(T3) 
K = IND(Ms44NRt13_,NM) 
CALL IPK(K »MP yl MyNM) 
LA = LAtL 


8 uM = LM+l 
C END OF LOOP ON [(ROW NUMBER). 
9 CONTINUE 
RETURN 


C ERROR MESSAGE. 
10 WRITE (LP,11) 
LL FORMAT(35H IN PERROW OUTPUT OVER-WRITES INPUT, 
* 34H OR INPUT HAS NO ROWS AND COLUMNS.) 
CALL EXIT 
ENO 
RHR REE ERK KEE EKER AERA ARH ERKEERERERERERE EKER KE REESE 
SUBROUTINE ROSPMX(AyMyNAyNM) 
RRA EEE EEE EERE SEK KEREK 


READS A SPARSE MATRIX FROM CARDS INTO ARRAY AsSTORING 
COLUMN INDICES ANO CONTROL DATA IN Me 
NA IS DIMENSION OF Ay NM IS DIMENSION OF M. 


GSaAaAHaAAR a 


REAL A 
INTEGER NReNCeNEeJE eo IRe IF eK ae MIN My 
* TeNTRe So db eo J2eb lel 2sNAgNMelLIMFe INyLP 
OIMENSICN ACNA) »MONM) SAIN(4) 9 MIN(4) 
C IN IS UNIT NUMBER OF CARD READER. 


IN = 5 
C LP IS UNIT NUMBER OF LINE PRINTER. 
Le = 6 


C NReNCeNE ARE NUMBERS OF ROWS,COLS,AND ELEMENTS IN Aw 
READ CEN»1) NRe NC NE 
lL FORMAT(315) 
C JE,JR COUNT NUMBER GF ELEMENTS,ROWS. 


JE =] 
JR = 1 
DO 2 I = 1s,NM 
2 Mil) = 0 
IERR = 0 
LIMF = 0 
C JF IS NUMBER OF CONTROL DATA. 
JF = 44NR 
C K COUNTS ELEMENTS WITHIN ROW. 
K = 0 


C AIN,sMEIN ARE ELEMENTS ANO ENDICES AS REAC FROM CARD. 
3 READ CIN|4) CAINCED SMENGI) » L=154) 
4 FORMAT(4(E15.8915)) 
00 10 [I = 144 
C CHECK FOR RCW-SENTINEL. 
TF (MIN(T).GE.90000) GO TO 9 
C CHECK VALIDITY OF COLUMN-INDEX. 
IF (MINCT).LE.NC) GO TO 6 
WRITE (LP,5) MINCT JE 
5 FORMAT(LSH COLUMN INDEX (515e20H)GREATER THAN NUMBER: 
* 24H OF COL IN ELEMENT NUM. 915) 
TERR = 1 
C STORE ELEMENT. 
6 IF (JELESNASANDe ( JE+JF).LE22*NM) GO TO 8 
IF (LUMF.LEQ.1) GO TO 10 
LIMF = 1 
WRITE (LP,7) 
7 FORMAT(21HOARRAY FULL IN RDSPMX) 
Go TO 10 
8 AC(JEP = AIN(T) 
CALL TPKIMINCT )oMy JE+JFyNM) 


JE = JE#l 
K = K+tl 
GO TO 10 


C CHECK FOR END-OF—-MATRIX SENTINEL. 
9 IF (MINCE) 2EQ.99999) GO TO 11 
C RECORD NUMBER OF ELEMENTS IN ROW JR OF A. 
CALL TPK(KyMe4tJUR_ NM) 
« = 0 
JR = JR+1 
10 CONTINUE : 
C ikEAQ NEW CARD. 


GO TO 3 
C AT END OF MATRIX CHECK NUMBER OF ROWS [S AS STATED. 
LL SR. a JR-1 


IF (JREQ.NR) GO TO 13 
WRITE (LPo12) JReNR 


408-P 8. 


12 FORMAT(1L7H NUMBER OF ROWS (¢15,17H) DOES NOT EQUAL -» 
* LSHSTATED NUMBER (,05e1H)) 


TERR = 1 
C CHECK NUMBER OF ELEMENTS. 
13 JE = JE-1 


IF (JE.EQ.NE) GO TO 15 
WRITE (LP 14) JENE 
14 FORMAT(21H NUMBER QF ELEMENTS (,15e911LH) DOES NOT » 
* 21HEQUAL STATEO NUMBER (71591H)) 


TERR el 
C CHECK ASCENCING ORDER OF INDICES. 
15 J = JF 


0O 19 T= 1leJR 
C NER £S NUMBER IN ROW IT OF Aw 


NIR = IND(M,44+I,NM) 
J2 = NER+J 
Jl = J+2 
TF (NIR.JLE.L) GO TO 18 
DO 17 K = JleJ2 
ul = IND(MyK-1L»NM) 
2 = IND(MeKeNM) 
IF (LLetT.L2) GO TO 17 
Kl = K-J1L+2 


WRITE (LP,16) Kil 
16 FORMAT( 9H ELEMENT 91598H IN ROW »f5e11H HAS WRONG 
* L2HCCLUMN INUVEX) 
(ERR = 
17) CONTINUE 
18 og = J2 
19 CONTINUE 
C STORE CONTRCL DATA. 
CALL IPKOUNRyMel ye NM) 
CALL IPKINC»My2eNM) 
M(2) = NE 
IF (TERR-GE.L) CALL EXIT 


RETURN 

END 

EERE EEE EEE REE 

SUBROUTINE RVSPMX(AyMeIReVeNeNAgNM) 

SAH ER RES ER EE EEE EE EEE 
EXTRACTS ROhw IR OF SPARSE MATRIX IN AsSTORING RESULT IN 
VECTOR V IN EXTENDED FORM, I.E. INCLUDING ZERO ELEMENTS. 
M CONTAINS COLUMN INDICES OF A. 
NyNAsNM ARE OIMENSIONS OF VoAeM. 


OHaAG0OaAA o 


REAL AsV 
INTEGER MeIRpNeNRAgITsNIRSy IRE g IM Ke J oNIRAPNANM GLP 
DIMENSICN AUNA) ¢M(NM) SVEN) 

C€ LP IS UNIT NUMBER OF LINE PRINTER. 


LP = 6 
C NRA 1S NUMBER OF ROWS IN Aw 
NRA = {ND(M,lyNM) 


TF (CIRGT.NRA) GO TO 2 
C N IS NUMBER OF COLS IN A (EQUALS NUMBER OF ELEMENTS IN V). 


N = IND(My2_9NM) 
C CLEAR Ve. 
DO lL I = LeN 
1 v(t} = 0.0 
C NIRS WILL BE NUMBER OF ELEMENTS IN ROWS PRIOR TO IR. 
NERS = 0 


IF (1R-1L) 29694 
2 WRETE (LP 23) 
3 FORMAT(34H EN RVSPMX ROW NUMBER OUT OF RANGE) 


CALL EXIT 
4 IRL = ER-L 
DO 5 f = 1,ER1 
5 NIRS = NIRS*tINO(M,4+E,NM) 
6 JM = 4+NRAtNEIRS 
C NERA IS NUMBER IN ROW [Re 
NERA = END(M,44+IR,NM) 


IF (NIRA.JLE.O) GO TO 8 
C TRANSFER ELEMENTS OF ROW IR. 
oe 7 1 L»NIRA 
K IND (M, JM+1,NM) 
J NIRS+f 
7 VIK) A(J) 
8 RETURN 
END 
FR AOR ROR RIOR Rf OR AOI ICE RR KE ARO OE 
SUBROUTINE SMSPMX(AyMyANpMNySyNAgNM) 
eRe a OR OR OR OR 


uu ow 


oO 


ANe 
MyMN CONTAIN COLUMN INDICES FOR AyAN] 
NA [LS DIMENSLON OF AgAN. NM IS DIMENSION OF My MNw 


AaAaNAAN 


REAL AyANyS 
INTEGER MyMNyNEAg I eNRAgNeNAgNMglLP 
DIMENSICN ACNA} e MONM) sAN(NA) » MN(NM) 


c 
C LP IS UNIT NUMBEt. F LINE PRINTER. 


LP = 6 
C CHECK THAT CUTPUT ODES NOT OVER-WRETE ENPUT. 
MN(L) = 0 


IF (M(1)2EQ.0) GO TO 5 
IF (S.EC.0.0) GO TO 3 
C NEA IS NUMBER OF ELEMENTS IN Ao 
NEA = M(2) 
C MULTIPLY A BY Se 
00 1 I = 1lsNEA 
L ANCI) = ACT)*S 
C NRA TS NUMBER OF ROWS IN Aw 
NRA = INO(My1l_9NM) 
N TS NUMBER OF WORDS IN Mo 
N = (5+NRA+NEA)/2 


o 


MULTIPLY SPARSE MATRIX IN A BY SCALAR Sy, STORING RESULT IN 


0 


COLLECTED ALGORITHMS (cont.) 


C MOVE Me: 
pO 2 1 = 1yN 
2 MNCL) = MET) 


RETURN 

3 MNCELPO = MOD) 
NRA: = IND(M,1,NM) 
K =(S+#NRAD/2 


004 f = 2ek 
4 MNUT) = 0 
RETURN 
C ERROR MESSAGE. 
5 WRITE (LP.6) 
6 FORMAT(35H IN SMSPMX GUTPUT OVER-WRITES INPUT, 
* 34H OR INPUT HAS NO ROWS ANO COLUMNS.?} 
CALL EXIT : 
END: 
CREEKS KEE EERE EEE KEES EKER EKAEEEEEAEREHE EEE KEES EK 
SUBROUTINE TRSPMX(AeMeAT oMT yNAgNMy IP 9 NP) 
SOE EREE EEA EER EAE ERE EHEESE EEE E 


a 


TRANSPOSE A SPARSE MATRIX IN Ay STORING THE RESULT IN AT. 
MeMT CONTAIN COLUMN INDICES QF AsAT. 

NA IS DIMENSION OF AsAT. NM IS DIMENSION OF MyMT. 

TP(f) WILL @& NUMBER OF ELEMENTS IN COLUMN . I OF Ay ALSO 
CP(L) WEILL BE POINTER TO FIRST ELEMENT IN ROW f OF AT. 

NP {S DIMENSION OF IP. 


OOAAaaaoain 


INTEGER TetCefFCeIFRelTe dle IP 

REAL A,AT 

DIMENSICN ACNA)  MO(NM) pATI(NA) » MTUNM), LOC NP) 
C LP IS UNIT AUMBER OF LINE PRINTERS 


up = 
C CLEAR MT. 
DO 1 I = 1yNM 


au 


1 MTU) = 0 
C CHECK THAT AT DOES NOT OVER-WRITE Aq 
IF €M(1).E0.0) GO TO 8 
C UNPACK CONTROL INFORMATION. NRyNCoNE ARE NUMBERS OF ROWS, 
C COLUMNS ANC ELEMENTS IN Aq 


NR = IND(My1,—NM) 
NC = INO(M,y2_NM) 
NE = M(2) : 

C CHECK FOR PCSSIBLE OVERFLOW OF MT. 
L = 4+NC#NE 


IF (L.GT.2*NM) GO TO 10 

C PACK NUMBER OF ROWS(NC)}, COLUMNS(NR) AND ELEMENTS OF AT. 
CALL TPKINC MT, 1 ¢NM) 
CALL IPKINR,»MT,2_,NM) 


MT({2) = M(2) 

C IFR»IFC ARE NUMBER OF CONTROL DATA IN A,AT. 
IFR = 4+NR 
TFC = 44NC 

C CLEAR IP. 


DO 2 I = IsNC 
2 rPtt) = 0 
C COUNT NUMBER OF ELEMENTS IN EACH COLUMN OF ACROW OF AT). 
pO 3 T = 1,NE 
K = IND(M, [FR+I,NM) 
3 IPCK) = IPCK)4L 
C PACK NUMBERS OF ELEMENTS IN ROWS OF AT. 
dO 4 J = L»NC 
K = FP) 
4 CALL IPK(KeMT:4%J39NM) 
C SET UP POINTER TO FARST ELEMENT IN ROW I OF AT. 
NICAL = [P(¢1) 
1 


re(t) = 
DO 5 IT = 2,NC 
NICA = TPCT) 
IPCI) = [TPCI-L)*NICAL 
5 NICAL = NICA 


C PROCESS ROWS OF Aw JL IS POSITION OF FIRST ELEMENT OF 
C CURRENT ROW OF A. 
J1 = 1 
C I COUNTS ROWS OF A. 
DO 7 I = LeNR 
C NIRA 1S NUMBER OF ELEMENTS IN CURRENT ROW OF A. 
NIRA = IND (Me 44+I,NM) 
IF (NIRA.EQ.0) GO TO 7 
J2 = JLtNIRA~1 
DO 6 J = JleJ2 
C K IS COLUMN NUMBER OF J*TH ELEMENT IN Ay I.£.ROW NUMBER 
C IN AT. 


K = IND(M,IFR4+J,NM) 
C IT tS POSITION OF CURRENT ELEMENT IN AT. 
If = IP(K) 


AT(IT) = AG) 
CALL LPKOLeMTo TT+LE Ce NMI 
6 IP{K) = IP(K)4l 
7 i = J2+1 
RETURN 
C ERROR MESSAGES. 
B WRITE (LP,9) 
9 FORMAT(27H IN TRSPMX AT GVER-WRITES Ay 
* 34H OR INPUT HAS NC ROWS AND COLUMNS.) 
CALL. EXIT 
10 WRITE (LPellL) 
11 FORMAT(27H IN TRSPMX NT WILL OVERFLOW) 
CALL. EXIT 
END - 
C EROS HEARERS SESE AEEE KEKE EES EEK ERE EH ET 
SUBROUTINE WRSPMX(AsMyTI Te NAyNM) 
c ERE EE EKER AES OREREEEHESEEEREKERE 
C WRITE SPARSE MATRIX A. 
C M CONTAINS COLUMN INDICES OF A. 
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C TIT CONTAINS DESCRIPTION OF Aj 
C NAsNM ARE DIMENSIONS OF AgM. 
c 


REAL A,AOUT,TIT 
INTEGER MePelTel eNRAsJFeNIRAg Jy J29K2—KoK I eMOUT yNAgNMoL P 
DIMENSICN ACNA) sMONM) oTIET(10) pAGUT(S) »MOUTCES) 


c 
C LP IS UNIT NUMBER OF LINE PRINTER. 
Le = 6 
C P IS PAGE CCUNTER. 
P =1 


C HEADING AND OESCRIPTION. 
WRITE (LPL) (CTLITCD)s 121,100,P 
1 FORMAT(23HLPRINTOUT OF SPARSE MAT» 
* GHRIXe re LIX, lLOAS es LOXeSHPAGE ¥15//) 
C L IS LINE CCUNTER. 


L = 2 
C NRA IS NUMBER OF ROWS IN Aw 
NRA = IND(Ms1l»NM) 
JF = 44NRA 
C NIRS IS NUMBER IN ROWS ALREADY WRITTEN. 
NIRS = 0 


Cf £S ROW COUNTER. 
DO 8 I = LeNRA 
C NIRA IS NUMBER IN ROW OF Aw 
NIRA = INO(M,4+T,NM) 
IF (NIRA.EQ.0) GO TO 8 
C J COUNTS ELEMENTS WRITTEN. 
J = 0 
IF (LeLT.51) GU TO 3 
AT END OF PAGE WRITE NEW HEADING ON NEXT PAGEsUPDATING 
PAGE NUMBER. 
2 Pp = P+] 
WRITE (LPohd CTITCK) pK21910) 9P 
L = 2 
3 WRITE (LPe4) I 
4 FORMAT(12HOROW NUMBER 4 15// 
* 1Xe5(14X%— THELEMENT » 5X_ 3HCOL S 5X) ) 
L = L+4 
C EXTRACT NEXT LINE OF OUTPUT. 
5 J2 = MINO(NERA, J+5) 


an 


K2 = J2-J 
00 6 K A 19K2 
KJ a K4+J¢NEIRS 


MOUT(K)= IND(M,KJ+JF, NM) 
6 AQUT(K)2 AIKJ) 
WRITE (LPe7) (AQUT(K)sMOUT(K)K=19K2) 
7 FORMATULXe 5 (EL5 279 1504X)) 
L = Ll 
J = U5 
IF (J.GE.NIRA) GO TO 8 
IF (1-55) 55202 
B NIRS = NIRS#NIRA 
C LAST ROW WRITTEN. 
RETURN 
END 
C BREE ESE EEE EKAAESE SES EE REESE EERE EEK ECE KEES ERE EE 
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Basser Computing Department, University of 
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CR Categories: 514 


There are a number of minor flaws in the presentation of Al- 
gorithm 408. The first concerns the liberal use of a subroutine 
EXIT not described in the Algorithm; nor to be found in the For- 
tran standard as an intrinsic procedure. Probably the use of this 
particular routine is self-evident (especially to IBM users), but 
it is difficult to justify using it when the STOP statement is available. 
The safest way to correct this flaw is to write a short program 
that scans the algorithm text replacing occurrences of CALL 
EXIT by STOP; by my count there are 25 of these. The alternative, 
of supplying a subroutine named EXIT has a trap: a subprogram 


COLLECTED ALGORITHMS (cont.) 


must contain a RETURN (see [1, Sec. 8.4.1.1(5) of the standard]),: 
so the routine must be (a) in nonstandard Fortran or machine 
code, or (b) something like: 


SUBROUTINE EXIT 
J=0 

IF (J.EQ.0) STOP 
RETURN 

END 


The other flaw occurs in the very last line of the algorithm: 
an END statement delimits a program (see [1, Sec. 3.2.2]), so 
that the comment following it must belong to another program 
segment (which does not have an END and is in error). The cure 
is simple: remove the comment. 

There is also a minor criticism one might make of the effi- 
ciency of the subprograms IND and IPK, which are frequently 
called. In practice the advantage of using available Fortran ver- 
sions will often outweigh the gain in speed possible by lansing 
into assembly language, and therefore the following versions are 
offered as probably compiling to a more efficient code. They 
utilize the intrinsic function MOD (often compiled in-line) and 
remove needless computations and assignments. 


FUNCTION IND(M, I, NM) 
DIMENSION M(NM) 
J = (I+1)/2 
IF (MODUI, 2))1, 1, 2 

1 IND = MOD(M(J), 10000) 
RETURN 

2 IND = M(J) / 10000 
RETURN 
END 


SUBROUTINE IPK(K, M, I, NM) 
DIMENSION M(NM) 
J = (I4+1)/2 

IF (MOD(I, 2)) 1, 1, 2 

IMJ) = MJ)+K 
RETURN 

2 M(J) = K*10000 + M(J) 
RETURN 
END 


References 
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E.E. Lawrence [Recd. 1 February 1972, 12 March 1973] 
Central Application Laboratory, Mullard Limited, 


New Road, Mitcham, Surrey CR4 4XY, England 


The subroutines constituting Algorithm 408 were, with the 
exception of MVSPMX and WRSPMxX, tested on an IBM 360/65 
using CALL/360-0S. The author’s alteration (iii) was introduced, 
i.e. declaration of the M-array to be half length. Other changes 
were introduced in order: (a) to make the algorithm more con- 
versational in a time shared environment; and (b) to improve the 
speed of the sorting procedure in PERCOL. 


The following deficiencies in the algorithm were noted 
1. The dimensional parameters of ACSPMX, ADSPMX, and 


MUSPM<X are incomplete. As an illustration of this consider the 
two matrices 


CO WS Oi ca oS 
bho 
LH 


| 

To tal 

oo 
ONOCOO CONO 


each of which has four nonzero elements. 

Then the sum matrix has eight such elements, and in general, 
for two matrices with , and m2 nonzero elements, the number of 
nonzero elements, #3 , in the sum matrix is in the range 0 < n; < 
Ay + Ale. 

However in ADSPMX the condition used is nm; = ng = 3. 

Similar arguments apply to ACSPMX and MUSPMX. 

To correct this requires extensions to the parameter lists and 
dimension statements, and also it changes the conditional state- 
ments within the subroutines concerned. 

This shows up with the CALL/360-0S system since the com- 
piler performs subscript checking. It would not be evident on most 
compilers including the IBM Fortran IV G compiler. It is, how- 


ever, bad practice to rely on default effects of compilers. 


2. There are three, probably copying, errors in;. MUSPMX (page 
270). d 


(i) Line 33 should be: 
IF(NCA.EQ.NCB) GO TO 3 


(ii) Line 55 should be: 
DO 14J = 1, NRB 


(iii) Line 102 should be: 
CALL IPK(NRB,MC,2,NM) 
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A Sparse Matrix Package (Part I) [F4] 


[J.M. McNamee, Comm. ACM 14, 4 (1971), 265-273]. 


Paolo Sipala [Recd 10 October 1976] 


Istituto di Elettrotecnica, University of Trieste, Trieste, Italy 


The subroutines RDSPMX, ADSPMX, MUSPMX, TRSPMX, MVSPMX, and 
WRSPMX of ACM Algorithm 408 were tested after conversion to Basic Fortran, 
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and the following corrections appear to be needed: 
(1) In ADSPMX, the line after statement number 9 should be changed to 


IF (T.EQ.0.0) GO TO 911 
and before statement number 11 the following line should be inserted: 
911 JB=JB+1 
(2) In TRSPMX, after statement number 5 the following line should be inserted: 
J2 =0 


The error in ADSPMX showed up when adding two matrices containing ele- 
ments with opposite values in corresponding positions, which should cancel; the 
error in TRSPMX was noted when transposing a matrix having a null first line. 
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REMARK ON ALGORITHM 408 


A Sparse Matrix Package (Part 1) [F4] 
[J.M. McNamee, Comm. ACM 14, 4 (April 1971), 265-273] 


Fred Gustavson [Recd 25 January 1978] 
T.J. Watson Research Center, IBM, Yorktown Heights, NY 10598 


The subroutine TRSPMX of ACM Algorithm 408 was compared with Algorithm 
HALFP [1] and the following corrections appear to be needed: 
(1) Before statement DO 5... , insert the line 
IF (NC.LE.1) GO TO 100 


(2) After label 5 insert the line 
100 J2=0 


The need for correction (1) is required when the matrix is a column vector 
(NC = 1). The need for correction (2) was noted in [2] as TRSPMX fails when 
transposing a matrix with an empty first row. 


REFERENCES 

1. Gustavson, F.G. Two fast algorithms for sparse matrices: Multiplication and permuted transpo- 
sition. ACM Trans. Math. Software 4, 3 (Sept. 1978), 250-269. 

2. SrpaLa, P., Remark on Algorithm 408. ACM Trans. Math. Software 3, 3 (Sept. 1977), 303. 


COLLECTED ALGORITHMS (cont.) 408-P11- R3 
and the following corrections appear to be needed: 
(1) In ADSPMX, the line after statement number 9 should be changed to 
IF (T.EQ.0.0) GO TO 911 
and before statement number 11 the following line should be inserted: 
911 JB = JB+1 
(2) In TRSPMX, after statement number 5 the following lineshould be inserted: 
J2=0 


The error in ADSPMX showed up when adding two matrices containing cle- 
ments with opposite values in corresponding positions, which should cancel; the 
error in TRSPMX was noted when transposing a matrix having a null first line. 
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A Sparse Matrix Package (Part 1) [F4] 
[J.M. McNamee, Comm. ACM 14, 4 (April 1971), 265-273] 


Fred Gustavson [Recd 25 January 1978] 
T.J. Watson Research Center, IBM, Yorktown Heights, NY 10598 


The subroutine TRSPMX of ACM Algorithm 408 was compared with Algorithm 
HALFP [1] and the following corrections appear to be needed: 


(1) Before statement DO 5..., insert the line 
IF (NC.LE.1) GO TO 100 
(2) After label 5 insert the line 
100 J2=0 


The need for correction (1) is required when the matrix is a column vector 
(NC = 1). The need for correction (2) was noted in [2] as TRSPMX fails when 
transposing a matrix with an empty first row. 
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sition. ACM Trans. Math. Software 4, 3 (Sept. 1978), 250-269. 
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A Sparse Matrix Package (Part 1) [F4] 
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U. Harms, H. Kollakowski, and G. Moller [Received 15 May 1978 and 15 August 
1978; accepted 12 December 1979] 

Regionales Rechenzentrum fur Niedersachsen, Technische Universitat Han- 
nover, Wunstorfer Strafe, D-3000 Hannover, West Germany 


When implementing Algorithm 408 on a CDC Cyber 76-12 and a Cyber 73-16, the 
errors noted by Lawrence [2] are corrected. In ARSPMX the dimensional 
parameters were incomplete and have been completed. Thus it is possible to add, 
for example, two sparse matrices having different numbers of nonzero elements. 
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There is another severe error in ADSPMX, as pointed out by Sipala [3]: when 
adding two elements whose sum is zero, ADSPMX gives an incorrect result. For 4 


exampie, when 
t 2 -—i1 0 2 
A=/]2 8 0], B=|-2 3 0], 


0 1 Of 0 1 Of 
then the result of A + B by ADSPMX is 


-i 0 4 
C=j]-2 6 O|, 
0 2 0 
not the correct sum. 


The necessary changes in ADSPMX are 


(1) line 73: IF (T.EQ.0.0) GO TO 30 
(2) before line 86: 30 CONTINUE 
JIB=JB+1 


Then the zero sum is ignored. For better definition, all elements of C are set to 
zero. If all elements of C are zero, then a message is printed later in the code. 

There are some minor changes in some other subroutines; for example, in 
ARSPMX, MCSPMX, and MRSPMX the variable NEA is set but never used. 
The same thing happens to the variable NC in PERROW. For this reason NEA 
and NC are eliminated in the subroutines presented here. 

In MCSPMX the statement M(2) = L — 1 (1 line before statement 6) should 
be changed to MN(2) = L — 1, and in MRSPMX the statement M(2) = L — 1 
(2 lines before statement 8) should be changed to MN(2) = L — 1. 

Moreover, some errors in TRSPMX, pointed out by Sipala [3] and Gustafson om 
[1], have been corrected. 
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ALGORITHM 


[Code for Algorithm 408 with all the corrections given here is available from the 
ACM Algorithms Distribution Service (see inside back cover for order form) or 
may be found in microfiche form in “Collected Algorithms from ACM.” } 


NAME(n): indicates a Fortran module with n records 

NAME'(n): indicates “NAME” is included for testing purposes 

Contents: IND(18), IPK(14), ANTIP(39), RDSPMX(121), WRSPMX(62), 
ANDPMX(152), SUSPMX(148), MVSPMX(24), SMSPMX(52), 
CVSPMX(30), ERSPMX(29), ECSPMX(28), INSPMX(19), 
OTSPMX(21), PERCOL(101), PERROW(84), RVSPMX(50), 
MUSPMX(134), TRSPMX(95), MCSPMX(89), MRSPMX(83), 
TEST" (536), CHECK"(37), MAIN™(56), RANDO"(13), 
RANDU’'(8), ACSPMX(142), ARSPMX(163) 


COLLECTED ALGORITHMS FROM CACM 


te PN i EE = 


ALGORITHM 409 polynomial should have an error function with well charac- 


terized extrema of equal height. 
° Three emergency exits are provided for extraordinary events. 
Discr ete Chebychev exparameter is an exit to be used when entry data are entered 
7 incorrectly. exmaxit is used when the best fit is not found within 
Curve Fit [E2] 


the maximum number of exchange steps allowed. In this case, 
the parameter ref denotes a new reference which may be used as 


H. Schmitt [Recd. 23 June 1970 and 12 Oct 1970] entry data for a further call of approx. The exit exsign is used 


. ‘ when the approximation errors at the points of reference do not 
Rechenzentrum der Technischen Hochschule alternate in sign. In this case, accuracy of the computer is insuffi- 
Darmstadt, West Germany cient to generate an approximation polynomial of the required 
degree. 
Acknowledgment. The author wishes to express his apprecia- 
tion to Prof. Dr. W. Barth for many valuable discussions on the 
subject of Chebychev approximation. 


Reference 
1. Stiefel, E. L. Numerical methods of Chebychev approximation. 
Key Words and Phrases: approximation, polynomial In On Numerical Approximation, R. Langer, (Ed.), U. Wis- 
approximation, exchange algorithm, Chebychev approximation, consin Press, 1958, pp. 217-232; 
Remez algorithm begin 


CR Categories: 5.11, 5.13 


procedure approx (m, n, k, x, y, epsh) transients: (maxit, ref) 
results: (hmax, h, a) exits: (exparameter, exmaxit, exsign); 
value m,n, k, epsh; integer m, n, k, maxit; real epsh, hmax; 
array x, y, 4, a; integer array ref; 
label exparameter, exmaxit, exsign; 

comment This procedure computes the best approximation poly- 
nomial in the sense of Chebychev of required degree m to a set 
of n distinct points given by their abscissas and ordinates (array 
x, y [l:a]). The abscissas must be arranged in increasing order 
x[1] < x[2] < +--+ < x{n]. The desired polynomial is even, odd, 
or mixed for k = 2,k = 1, ork = 0, respectively. It is expected 
that x[1| > 0 in case of kK = 2 and x[(1] > O in case of k = 1. 
Leveling according to the exchange method described by Stiefel 
[1] is done up to a tolerance of abs(epsh). The sign of epsh 
decides whether ref is expected to supply entry data (cf. param- 
eter ref). 

maxit enters an upper limit for the number of exchange steps 
allowed and returns the number of steps actually performed. 
The parameter ref is used to carry entry data only if epsh < 0. It 
is an integer array containing the subscripts of the points to be 
used as initial reference. The lower array bound is 1, the upper 
bound (say p) is m + 2 in the case of mixed (k = 0) polynomials, 
entier ((m+3)/2) in the case of odd (k = 1), and entier 
((m+4) /2) in the case of even (kK = 2) polynomials. It is expected 
that 1 < ref[l] < ref[2] < +--+ < ref[p] < n. Unless an initial 
reference is not explicitly given by means of the array ref and 
indicated by a value epsh < 0, the points lying next to the so- 
called Chebychev abscissas (with regard to the interval [x{1], 
x[n]]) are determined to start off the algorithm. As output, this 
parameter returns the reference belonging to the approximation 
polynomial. 

The output parameters are hmax to return the maximum devia- 
tion, an array A{1:n] to return the approximation errors at all 
given points, and an array a[0:m} to carry the polynomial 
coefficients. The array A containing the approximation errors is 
introduced as a formal parameter to allow a drawing of the error 
function to be made outside the procedure. This provides a means 
to look at the quality of the computed approximation and is 
recommended to the user. A totally leveled approximation 


integer i, j, p, g1, q2, r; Boolean KO, k1; 
kO:=k=0; kl:=k=1; 
qi := if k1 then 1 else 0; 
q2 := if kO then 1 else 2; 
for i := O step 1 until m do a[i] := 0; 
if — kO then m := entier ((m—q1) X 0.5 + 0.1); 
p:=m-+ 2; 
comment Check for properly given parameters; 
ifn<pVm<0V AkKOA (AAI V x[1] < 9) 
/\ (k + 2 V x[1] < 0) then go to exparameter; 
for i := 2 step | until 1 do 
if x[i] S x[i—1] then go to exparameter; 
begin 
procedure exchange (n, p, h, epsh, z, equal); 
value n, p, epsh; 
real epsh, integer n, p; label equal; 
array fh; integer array Z; 
comment This procedure performs the exchange technique. 
The number of points and the number of reference points 
are entered by 1 and p. The approximation errors at different 
points are compared relative to epsh. The subscripts of the 
points of reference are carried by z[1] --- z{p] of the integer 
array z[0:p+1], a parameter which serves to enter the 
former and return the new reference. z[0] and z[p+1] are 
for internal use only and are expected to have the values 0 
and n + 1. If both the old and new references are equal to 
each other, a jump to the label equal occurs. No global 
quantities are contained within this procedure; 
begin 
integer i, j, 1, index, indl, indr, sig, ze; 
real hz1, hzp, max, maxl, maxr; 
1:=0; sig := —sign (h[z(1])); 
if sig = O then sig := 1; 
for i := 1 step 1 until p do 


begin 
max :=0; sig := —sig; ze:= 2{i+1] — 1; 
for j := z[i—1] + 1 step 1 until ze do 


if (A[j]-max) X sig > O then 

begin max := h{j]; index := j end; 

if abs (max-h{z|i]]) > abs(max) XK epsh then 
begin z[i] := index; I := 1 end 
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Shr: 


SAl: 


end: 


end; 
maxl := maxr := 0; 
for j := z[p] + 1 step 1 until 1 do 
if (maxr —h{j]) * sig > 0 then 
begin maxr := h{j]; indr := j end; 
hzl := Alzi{l)]; sig := sign(hzl); 
for j := 1 step 1 until z{i] — 1 do 
if (maxl—hl{j]) X sig > O then 
begin max! := A{j]; indl := j end; 
maxl := abs(maxl); maxr := abs(maxr); 
hz\ := abs(hz1); Azp := abs(h{z[p}}); 
if / = O then 
begin 
if maxl — hzp S maxl X epsh / 
maxr — hzl S maxr X epsh then go to equal 
end; 
if maxl = 0 A maxr = 0 then go to end; 
if maxl > maxr then 
begin 
if maxl > hzp then to go shi 
else if maxr = hz\ then to go shr 
end 
else 
begin 
if maxr > hz1 then go to shr 
else if max! = hAzp then go to sh/ 
end; 
go to end; 


index := z{1]}; 
for 7 := 1 step 1 until p — 1 do z{i] := z[i+1]; 
z[p] := indr; 
if maxl > 0 then 
for i := 1 step 1 until p — 1 do 
begin 
if abs (h{indl]|) = abs (A{z{i]}) then 
begin j := z[i]; z{i] := indl; indl := index; 
index := j end 
else go to end 
end; 
go to end; 


index := z[p]; 
for i := p step — 1 until 2 do z[i] := z{i—1]; 
z{1] := indl; 
if maxr > O then 
for i := p step —1 until 2 do 
begin 
if abs (h{indr|) = abs(h{z{ij]) then 
begin j := z(i]; 2[i] := indr,; indr := index, 
index := j end 
else go to end 
end; 


~ 


end procedure exchange; 
real arg, max, pi, q, s, t, dt, x1, xa, xe; Boolean 51, 52; 
array xx[1:7], aa, daa[0O:m], c, d (1:p]; , 
integer array z(0:p+1]; 
comment Set up of initial reference; 
z(0] := 0; z[p+i):=n+1; 
if epsh < 0 then 
begin 
j:= 0; 
for i := 1 step 1 until p do 
begin 
r:= 2[i] := refi); 
if j < rthen/ := r else go to exparameter 
end; 


m0: 


mi: 
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if j > n then go to exparameier; 
epsh := abs (epsh); goto ml 
end; 
pi := 3.14159265; xl := x[1]; xe := x[n]; 
if kO then 
begin xa := xe + x1; xe:= xe — xl; 
arg := pi/(m+1) end 
else 
begin xa := 0; xe := xe + xe; 
arg := pi/(2X(m+1)+q]1) end; 
for 7 := p step —1 until 1 do 
begin 
xl := xa + xe X cos (arg X (p—j)); r:= 2[j+1]; 


for i := r — 1 step —1 until 2 do 
if x({/] + x[i—1] s x1 then go to m0; 
i:= 1; 


z[/] := ifr > ithenielser — 1 
end; 
if z{1] = 1 then go to m1; 
for j := 1,7 + 1 while z[j] < j do z[j] :=/; 


for i := 0 step 1 until m do aa[i] := 0; 
for i := 1 step 1 until 7 do 
begin Ali] := yli]; ¢@:= xl; 
xx|[i] := if kO then g else gq X @ 
end; 
b1 := 52 := false; r:= —1; ¢:=0; 


iterat: 


r:=r+1; s:= 1.0; 
comment Computation of the divided difference schemes; 
if ki then 


begin 
for i := 1 step 1 until p do 
begin 
s:= —s; j:= [i]; @:= xf]; 
cli] := (AU + 5X D/q; di] := s/q 
end 
end 
else 
for i := 1 step 1 until p do 
begins := —s; cfi] := A(z{i]] +5 Xt; afi] := send; 


for i := 2 step 1 until p do 
for 7 := p step —1 until ido 
begin 
q := xx{[z[j]] — xx[z[1+j—il; 
elf] := (eli) — eli-1))/a; 
d{j] := (dalj} — dli—1))/9 
end; 
dt := —c[p)/d[p]; t:= t+ dt; 
comment Computation of the polynomial coefficients; 
for i := m step —1 until 0 do 
begin 
daa{i] := c{i+1] + dt * dfi+1]; ¢@:= xx[z[i+1]]; 
for j := istep 1 until m — 1 do 
daa{j] := daa{j| — q X daa{j+1] 
end; 
for i := O step 1 until m do aa{i] := aa{i] + daali); 
comment Evaluation of the polynomial to get the approxima- 
tion errors; 


max := Q; 
for i := 1 step 1 until 7 do 
begin 


s:=aa(m]; q:= xxi]; 

for j := m — 1 step —1 until Odo s := s X q + aa{j; 
if ki then s := s X x[i]; 

q:= hfi] := yli] — s; 

if abs (q) > max then max := abs(q) 
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end; 
comment Test for alternating signs; 
j := —sign (A{z(1))); 
for i := 2 step 1 until p do. 
if sign (A{z[i]]) = j then j := —/j else 


begin 51 := true; go to m2 end; 
comment Search for another reference; 
exchange (n, p, h, epsh, z, m2); 
if r << maxit then go to iterat else b2 := true; 
comment Results to output parameters; _ 
m2: 
for i: 
for i := 1 step 1 until p do ref[i] := z[i]; 
hmax := max; maxit := r; 
if 51 then go to exsign; — 
if 52 then go to exmaxit 
end 
end procedure approx 


0 step 1 until m do a(q1+ixXq2] := aalfi]; 
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ALGORITHM 410 
Partial Sorting |M1| 


J. M. Chambers [Recd. 15 July 1970] 
Bell Telephone Laboratories, Murray Hill, NJ 07974 


Key Words and Phrases: sorting, partial sorting, order 
Statistics 
CR Categories: 5.31 


Description 

We introduce the notion of partial sorting as follows. Given an 
array A of N elements the result of sorting the array (in place) is 
to arrange the elements of A so that 


A(l) < AQ) < --- < ACN). 

An equivalent statement is that, for J = 1, 2,---, N, A(J) isa 
value such that for] <I1<J< K<WN 

All) < AV) < A(K) (1) 


This property is also equivalent to the statement that A(J) is the 
Jth order statistic [4] of A, for all J. 

Partial sorting is a procedure which rearranges A so that (1) 
holds for some selected values of J, but not necessarily for all J. 
The advantage of using partial sorting, where possible, is that the 
cost is substantially less than for sorting, when the number of order 
statistics required is small compared to N. 

Such will frequently be the case, for example, in statistical applic- 
ations, when the sample is to be summarized using some of the 
order statistics. For large N only a portion of the sample would be 
needed, even for displays such as the empirical distribution function. 

Specifically, in the algorithm PSORT below, the user supplies the 
array A of size N and a set of indices IND of size NIJ. On return, 
A will have been rearranged so that relation (1) holds, i.e. A(J) has 
the value it would have if A were sorted, for J = IND(1), IND(2), 
..., IND(ND). 

For example, suppose A is the vector (10., 8., 3., 5., 7., 2.) and 
IND is the vector (2, 5). Then after a partial sort of A with IND, 
A(2) = 3..and A(5) = 8.. 
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The method used is based on Hoare’s method [1, 2] as de- 
veloped by Singleton [3]. Hoare’s method consists of choosing an 
element A(m) and splitting the array into three portions which are 
respectively smaller than, equal to, and larger than this element. 
The method is then applied recursively to the first and third por- 
tions, until the data is completely sorted. Successive versions 
leading to [2] alter the method in four important respects. (i) in- 
stead of choosing A(m) arbitrarily, the median of the first, last and 
middle element are chosen; (ii) the recursion is simulated, rather 
than explicit; (iii) short sequences (less than 10 in [3]) are sorted 
by a “‘sinking” sort; (iv) a different treatment of “tied” observations 
is introduced. 

Hoare’s method is very well suited to handle the partial sorting 
problem. The algorithm is modified simply by passing over the 
portion of A in which none of the indices in IND are found. Once 
we have established a segment of 4 which is known not to contain 
any of the desired order statistics, there is no need to sort it further. 
The special case of NJ = 1 was treated in procedure FIND of [1]. 

For a fixed number of indices, the cost of applying PSORT is 
very nearly proportional to N, as opposed to the full sort, with 
cost of the order of Nlog(N). Because of the simplicity of the 
modified algorithm, the cost of PSORT will almost always be sig- 
nificantly less than the cost of the full sort, providing NJ is sub- 
stantially less than N. Notice, however, that a full sort will be carried 
out unless some adjacent elements of IND differ by more than 10. 

The following restrictions are to be noted: it is assumed that 
IND is initially sorted into ascending order; A is of type REAL; 
if N is the dimension of the A array then the arrays INDU, INDL, 
1U, IL must have dimension K where N < 2*+1, (see [3]); 

Examples. Table I gives some examples of the performance of 
PSORT on various size arrays with various initial orderings. The 
examples were constructed as follows. Samples of N were simulated 
with a standard normal marginal distribution, and a correlation p 
with an ordered normal sample. (Specifically we generated a, 6; for 
i = j,---, N as independent standard normal variates, then 
formed y; = pa: + (1 — p*)4d; and sorted the y;, carrying along 
the a; . The resulting a; are the desired input to PSORT.) 

Computations were carried out in two ways. By replacing the 
comparisons of elements in A by special functions, the number of 
comparisons required was counted, and is shown in the columns of 
Table I headed C. This gives a machine independent result, but 
does not include the costs of transposition, logic, etc. Therefore, 
we also give timings for the original algorithm, on a GE 635 com- 
puter, in the columns headed 7. The unit of time is one millisecond. 

The results of Table I suggest, as one would expect, that the 
most expensive case, for given value of NJ, is for the desired order 
statistics to be evenly spaced; i.e. jN/(NI+1) for j = 1, +--+, NI. 
For this worst case, the cost does grow proportionately to N (a 
little less than that, in the table). 

A comparison with the full sort, using Singleton’s algorithm, 
is included for sample size 500. 


Table I. Examples of PSORT. C = number of comparisons, 7 = time in 1073 sec. 


=1.0 

N NI IND Cc T Cc 
100 2 33 67 303 11.0 384 
100 3 25 50 fis 323 11.8 468 
500 2 33 67 1122 36.0 1356 
500 3 25 50 75 1182 37.9 1414 
500 3 125 250 375 1628 49 .3 2213 
500 Call to SORT 151.3 

1000 3 250 500 750 3258 96.9 4870 


Correlation with ordered data 


“ 0.0 +0.5 +1.0 

T Cc T Cc T C T 
14.2 392 14.2 386 13.6 291 9.0 
17.5 429 15.9 470 16.7 329 10.2 
43.7 1169 36.7 1362 41.3 1121 27.8 
46.3 1307 41.5 1406 43.1 1181 29.9 
70.1 2184 71.3 2205 67.2 1748 43.9 
15152 150.2 150.4 150.9 
145.1 4438 137.27 4725 140.4 3503 85.6 
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Algorithm 


SUBROUTENE PSORT (Ay NgteteNI) 


PARAMETERS TO PSORT HAVE.THE FOLLOWING MEANING 
A ARRAY TO BE SORTEC 
N NUMBER OF ELEMENTS IN A 


INO ARRAY CF INOICES IN ASCENDING ORDER 
NI NUMBER OF ELEMENTS IN INO 
CEUMENSICN AUN), INCENIT) 

DIMENSICN INOUC16)»sINDL(16) 

DIMENSICN 1U(16)e1L 016) 
INTEGER P 
JLeL 
JUANI 
INOL (1)=1 
INDUCL)=NI 
C ARRAYS INDL», INCU K&EEP ACCOUNT OF THE PCRTEON OF IND RELATED TO THE 
C CURRENT SEGMENT OF OATA BEING ORDEREC. 
1=} 
J=N 
M= I 

S IFCI.GE.J} GO TU 70 

C FIRST ORDER ACT) eAC ID eACET4+5)/2)¢ ANE USE MEDIAN TO SPLIT THE DATA 

10 K=f 

[JH(leJsd/2 
T=ACEJ) 
IFCACT).LE.T) GU TO 20 
A(TJ)=ACT) 
AU) =T 
T=A(1J) 
20 t=J 
TFCA(J).GE.T) GO TO 40 
ALI JI=AJ) 
A(J)=T 
T=A(EJ) 
[FCACT).LE.T) GU TO 40 
AUESID=ACT) 
AUK)=T 
T=A(T JS) 
GO TO 4¢ 
30 A(LI=A(K) 
A(K)=TT 
40 L=t-1 
TF(A(L).GT.T) GG TO 40 
TT=A(L) 
C SPLIT THE DATA INTC ACI TO L)oLT.T., ACK TO J).GT.T 
50 K=Ke1 
IFCACK) LTT) GO TO 50 
IF(K.LE.L) GO TC 30 
INDL OM) =JL 
INDUCM)=JU 
P=M 
M=Me} 

C SPLIT THE LARGER OF THE SEGMENTS 
IF(L-1.LE.J-K) GC TC 60 
IL(P)=1 
(UCP d=L 
1=* 

SKIP ALL SEGMENTS NUT CCRRESPCNDEING TO AN ENTRY IN IND 

> IF(JL2GT.JU) GO TOA 70 
IFCIND(JL).GE.1)} GO TO 58 
JL=JIL4L 
GO TO 55 

du INDUCP}=JL—1 

Go TO ac 
60 IL(P)=K 
IU(P)=J 
Jat 
6% IFC JL2GT.JU) GC TO 70 
IFUINO(JU).LE.J)? GO TO 68 
JU=JU-1 
GO TO 65 
68 INQL (P)=JUFL 
GO TO ac 
1G M=M—1 
TF(MSEQ.0) RETURN 
T=TL(M) 
J=FUCM) 
JL=INDL (IM) 
JU=INOU(M) 
IF(JL.GT.JU) GC TO 70 
&C TF(J-10.2€7210) GO TO 10 
IFCT~E€Q.1) GO TO 5 
I=h-1 
oC I=I¢1 
TFCT2EQ.J) GO TC 70 
T=A(T41) 
IFCA(T).LE.T) GU TO 90 
KzI 

buc A( Keb) =A(K) 

K=K-1L 

IFUTSLT.ACK)) GEO TO 100 

A(K+l)=7 

Go fo 9¢ 

END 


aaoran 


ea 
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Algorithm 411 


Three Procedures for the Stable 
Marriage Problem [H] 


D.G. McVitie* and L.B. Wilson (Recd. 12 Aug. 1968 
and 15 July 1969) 

Computing Laboratory, University of Newcastle 
upon Tyne, Newcastle upon Tyne, NEI 7RU, 
England 


Key Words and Phrases: assignment problems, assignment pro- 
cedures, combinatorics, discrete mathematics, operations research, 
stable marriage problem, university entrance 

CR Categories: 5.30 


Part 1 
procedure GS (malechoice, femalechoice, marriage, count, n); 
value x; integer count, n; 
integer array malechoice, femalechoice, marriage; 
comment This procedure finds the male optimal stable marriage 
solution using the Gale and Shapley algorithm. The result is 
left in the integer array marriage. Thus marriage [i] is the man 
whom the ith woman marries. n is the size of the problem, 
count is the number of proposals made before the stable marriage 
is found. malechoice and femalechoice are the choice matrices 
for the men and women respectively, i.e. femalechoiceli, j] 
is the jth choice of the ith woman. The femalechoice array is 
changed.to the integer array fc, where /c[i, j] is the choice num- 
ber (first, second, third,...) of the jth man to woman i. This 
new arrangement is adopted for convenience when the women 
compare proposals. All the women keep a dummy man 0 in sus- 
pense initially. This dummy man is given a choice number n + 1 
so that he will be given up as soon as any other offer is made; 
begin 
integer i, m, 7; Boolean array refuse [0:n]; 
integer array fc [1:n, 0:nl, proposal, malecounter (1:n); 
for i := 1 step 1 until n do 
begin 
for j := 1 step 1 until 7 do 
fe li, femalechoice [i, f\] := j; 
comment The femalechoice array is rearranged for conven- 
ience in the marriage part of the procedure; 


refuse |i] := true; marriage [i] := 0; 
malecounter |i} := 1; j/celi,0]):=n+1 
end; 
count := OQ; 
PROPOSE: 
m := OQ; 


comment Now the rejected men propose to the next woman in 
their choice lists. Initially all the men propose to their first 
choices; 


*Now at Software Science, Ltd., Wilmslow, Cheshire, England. 
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for i := 1 step 1 until ” do 
if refuse [i] then 


begin 
proposal [i] := malechoice |i, malecounter (i]\; 
malecounter [i] := malecounter [i] + 1; 
m:= m+ 1; refuse li] := false 

end 

else proposal [i] := —1; 


if m = O then go to FINISH; 
comment The procedure terminates if at any stage no proposals 
are made by the men; 
count := count + m; 
comment In the next part of the procedure all the on who 
have had a proposal decide whether to reject it .. the one 
they are keeping in suspense; 
for i := 1 step 1 until x do 
if proposal [i] > O then 
begin 
j := proposal (il; 
if fc [j, i] > fc Lj, marriage [j]j then refuse {i] := true 
else 
begin refuse [marriage [j\] := true; marriage [j] := i end 
end; 
go to PROPOSE; 
FINISH: 
end of procedure GS 


Part 2 
procedure MW(malechoice, femalechoice, 
valuen; integer count, n; 
integer array malechoice, femalechoice, marriage; 
comment The heading is the same as for the GS procedure and 
the formal parameters have the same meaning. Also the female- 
choice array has been rearranged in the array fc as before, and 
the women given initially a dummy man 0 with choice number 
n+; 
begin 
integer i, /; 
integer array fc[1:n, 0:n), malecounter [1:n]; 
procedure PROPOSAL (i); value i; integer 7; 
comment This procedure makes the next proposal for man i 
and calls the procedure REFUSAL to see what effect this 
proposal will have. The procedure does nothing if man i is 
the dummy man QO; 
if i + O then 
begin 
integer j; count := count + 1; 
j := malecounter|i]; malecounter[i] := j + 1; 
REFUSAL(i, malechoiceli, |) 
end; 
procedure REFUSAL(i,j); value i,j; integer i, j; 
comment This procedure decides whether woman j should keep 
the man she is holding in suspense in marriagelj] or man i 
who has just proposed to her. Whichever she rejects goes 
back to the procedure PROPOSAL to make his next proposal; 
if fej, marriage\j||>/cl/, i] then 
begin 
integer k; 
k := marriagelj); 
PROPOSAL(k) 
end 


marriage, count, ny); 


marriagelj| := i; 
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else PROPOSAL/(i); 
for i := 1 step 1 until 1 do 
begin 


for j := 1 step 1 until 7 do 
feli, femalechoiceli,j\) := j; 


marriageli] := 0; malecounter|i] := 1; fcli, 0) := 1 +1 
end; 
count := 0; 


for i := 1 step t until 7 do PROPOSAL(i); 
comment This for statement operates the algorithm and after 
the ith cycle a set of stable marriages exists for the men 1 toi 
and i of the women; 
end of procedure MW 


Part 3 

procedure ALL STABLE MARRIAGES (malechoice, femalechoice, 
n, STABLE MARRIAGE); 
value 77; 
integer array malechoice, femalechoice; 
integer 7; procedure STABLE MARRIAGE; 

comment malechoice and femalechoice are the same arrays as were 
used in GS and MW, nis size of problem. STABLE MARRIAGE 
(marriage, n, count) is a procedure (with three parameters) writ- 
ten by the user which is entered when a new stable marriage is 
formed after count proposals. The marriage is stored such that 


marriage|i] contains the number of the man married to woman i. 


The locally declared Boolean array unchanged is used to make 
sure Rule (2) is not violated; i.e. during a breakmarriage opera- 
tion started on man i only men > i may propose. The locally 
declared Boolean success is set true if breakmarriage to man i 
leads to a new stable marriage, otherwise it is set false; 
begin 
integer array marriage, malecounter (0: n), fe [1: n, 0: n); 
Boolean array unchanged (0: nj; 
integer i, 7, k; Boolean success; 
procedure breakmarriage(malecounter,marriage,i,n,count) ; 
value malecounter, marriage, i, n, count; 
integer i, n, count; integer array malecounter, marriage; 
comment This procedure breaks the marriage of man i; 
begin 
integer j; 
marriage |malechoice [i, malecounter [i]—1}] := —i; 
proposal (i,malecounter ,marriage,count) ; 
if — success then go to EXIT; 
STABLE MARRIAGE (marriage,n,count); 
for 7 := istep 1 until x — 1 do 
breakmarriage (malecounter,marriage,j,n,count) ; 
comment The lower limit i in the above for statement is the 
application of Rule(1) which after a successful break- 
marriage operation on man i restricts further breakmarriages 
to men > i; 
for j := i+ 1 step 1 until 7» — 1 do 
unchanged |j] := true; 
EXIT: 
unchanged |i] := false; 
end of breakmarriage; 
procedure proposal (i, malec, marriage, c); 
value i; 
integer i,c; integer array malec, marriage; 
comment In this procedure man i proposes to the next woman 
in his choice list, and calls the procedure refusal for this 
woman. If i is negative on entry then a successful break- 
marriage operation has been completed and a new stable 
marriage found. If the Boolean success is made false during 
a breakmarriage operation then it means that this break- 
marriage has failed; 
if i < O then success := true 
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else if i = 0 VV malec [i] = n +1 VV -= unchanged (i| 
then success := false 
else . 
begin 
c:=c+1;j7:= malec [i]; malec li) := 7 + 1; 
refusal (i,malechoiceli,j|,malec,marriage,c) 
end of proposal; 
procedure refusal (i,j,malec,marriage,c) ; 
value i,/; 
integer i,j,c; integer array malec, marriage; 
comment This procedure decides whether woman j prefers man 
i or the man in marriage |j]. Whichever she rejects goes back 
to the procedure proposal to make his next choice; 
if fc [j, abs (marriage [j])|>fclj,i] then 


begin 
k := marriage |j|;_ marriage [j] := i; 
proposal (k,malec,marriage,c) 

end 

else proposal (i,malec,marriage,c) ; 

for i := 1 step 1 until 7 do 

begin 


for 7 := 1 step 1 until ” do 
feli,femalechoice \i,j]| := j; 


marriage |i| := 0; malecounter [i] := 1; 
fcli,0]) := n + 1; unchanged [i] := true; 
end; 
count := 0; 
for ¢ := 1 step 1 until 7 do 


proposal (i,malecounter,marriage,count) ; 
comment Male optimal stable solution found; 
STABLE MARRIAGE (marriage,n,count) ; 
for i := 1 step 1 until x» — 1 do 
breakmarriage (malecounter, marriage, i,n,count) ; 
end of procedure ALL STABLE MARRIAGES 
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1 if x[i] < xmin then xmin := x{i] 
Algor ithm 4 | 2 end of hunt for maximum and minimum values of x; 
if = 1 then goto N1A; 


Graph Plotter [J6] Hg ag He 


Josef Cermak (Recd. 19 Mar. 1970 and 12 Nov. 1970) a step 1 until 7 do 
; : : egi 
Department of Physics, University of Chemical ESI ih Saw athed wae = ele 
Technology, Pardubice, CSSR. evi < ymin then yin ait 
end of hunt for maximum and minimum values of y; 
: ei escape: 
Key Words and Phrases: plot, graph, lineprinter plot if abs(xmax-xmin) < xerror \/ abs(ymax-ymin) <_yerror 
CR Categories: 4.41 then go to /abel; 


p := (L—1)/(xmax—xmin); q := (S—1)/(ymax— ymin); 
procedure graphplotter (N, x, y, m, n, xerror, yerror, g, L, S, EM? for i := 1 step 1 until Z do plot L{i] := 1; 


C0, Cl, C2, C3, C4, label); for i := m step—1 until 1 do 
value N, m, #, xerror, yerror, g, L, S; begin 
array x, y; integer N, g, m,n, L,S; real xerror, yerror; integer r; 
string EM, CO, Cl, C2, C3, C4; label /abel; r:= 1 + entier ((x[i]—xmin) X p+4-0.5); 
comment This procedure is functionally identical with Algorithm plot L{r] := 0; ind{r] := i 
278. It needs, however, a significantly smaller array than Al- end; 
gorithm 278 for storage of the graph before it is printed. The NEWLINE (N,1); SPACE (N, g); 
procedure is intended to be used to give an approximate graph- comment NEWLINE and SPACE must be declared globally to 
ical display of a multivalued function yf[i, j] of x[i], on a line graphplotter, NEWLINE (N, p) outputs p carriage returns 
printer. Output channel N is used for all output. The graph is and p line feeds on channel N, SPACE (N, p) outputs p 
plotted for those points such that 1 <i<mand1<j<n blank character positions on channel N; 
where 2 <n < 4. If nm = 1, then y must be a one-dimensional for j := 1 step 1 until S do outstring ‘N, EM); 
array y{i] and the graph is plotted for x[i] and y[i] for! <i < for i := 1 step 1 until Z do 
m. The format of the output is arranged so that a margin of g begin 
spaces appears on the left-hand edge of the graph. ZL and S plot S(1] := plot S [S] := 1; 
specify the number of lines down the page and the number of for j := 2 step 1 until S — 1 do plot S[j] := 2; 
spaces across the page which the graph is to occupy, respec- if plot L{i] = 0 then 
tively. The graph is printed so that lines 1 and Z correspond te begin 
the minimum and maximum values of x, and character posi- ifn = 1 then 
tions 1 and S correspond to the minimum and maximum values plot S (1+entier (0.5+-q¢X (y[ind{i]] — ymin))] := 3 
of y. That is to say, y is plotted across the page and x is plotted else 
down the page. After the entire graph has been plotted, the for j := n step — 1 until 1 do 
minimum and maximum values for x and y are printed out in plot S (1+entier (0.5+ 4X (ylind [i], j]-—ymin))] := j + 2 
order xmin, xmax, ymin, ymax. The argument EM represents end; 
the character which is printed on the perimeter of the display. NEWLINE (N, 1); SPACE (N, 8); 
The argument CO represents the character printed at empty for j := 1 step 1 until S do 
positions. The arguments, Cl, C2, C3, C4, represent the charac- begin 
ters printed for y{i, 1], y[i, 2], yli, 3], and yli, 4], respectively. switch SW := SW1, SW2, SW3, SW4, SW5, SW6; 
At those points at which more than one character would ap- go to SW [plot s{j]]; 
pear, the order of preference is C1, C2, C3, C4. Control is SWI: 
passed from graph-plotter to the point whose label appears as outstring (N, EM); go to fin; 
the parameter /abel if the range of x[i] is less than xerror, or SW2: 
if the range of y[i, /] is less than yerror, for all j. If the values of outstring (N, CO); go to fin; 
x{i] occur at equal intervals, choosing L = m will make one SW3: 
line of printout equivalent to one interval of x. The graph may outstring (N, C1); go to fin; 
look somewhat out of true proportion since this algorithm as- SW: 
sumes that spacing along both axes is the same, but most line outstring (N, C2); goto fin; 
printers do not have the same spacing between adjacent lines SWS: 
as between adjacent characters on a line; outstring [N, C3); go to fin; 
begin SW6: 
real p, q, xmax, xmin, ymax, ymin; outstring (N, C4); 
integer i, /; fin: 
integer array plot L, ind [1:L], plot S [1:S]; end; 
xmax := xmin := x{1); end; 
for i := 2 step 1 until ™ do NEWLINE (N, 2); SPACE (N, g); 
begin for j := 1 step 1 until S do outstring (N, EM); 


if x[{] > xmax then xmax := x{[i] else NEWLINE (N, 2); SPACE (N, g); outreal (N, xmin); 
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outreal (N, xmax); 
outreal (N, ymin); 
go to end; 
N1A: 
ymax := ymin := y[1]; 
for i := 2 step 1 until m do 
begin 
if y[i] > ymax then ymax := y[i] else 
if y[i] < ymin then ymin := y[i] 
end of hunt for maximum and minimum values of y when 
n=1; 
go to escape; 
end: 
end of graphplotter 


outreal (N, ymax); 


Remark on Algorithm 412 [J6]| 
Graph Plotter [Joseph Cermak, Comm. ACM 14 (July 
1971), 492-493] 


Richard P. Watkins [Recd. 31 Jan. 1972], Mathematics 
Department, Royal Melbourne Institute of Technology, 
Melbourne, Australia 3000 


This algorithm is not functionally identical to Algorithm 278 
as Claimed. If the x{/] values are not uniformly spaced or if m > L, 
it is possible for two or more of them to correspond to the same 
printer line. In this case, the array id will contain only the largest 
of the values of i and only one set of y{i, /] values, corresponding to 
that value of 7, will be plotted. 

The array ind is redundant. The following changes enable 
plotL to take over the functions of ind (where all line numbers refer 
to lines relative to the label escape): 


a. Line 4. Replace 


for i := 1 step 1 until Z do p/orL[i) := 1 
by 
for i := 1 step 1 until L do ploiL{i] := 0 


b. Line 9. Replace 
plotL{r) := 0; ind[r] := 7 
by 

plotL\r| := i 

c. Line 21. Replace 

if plorL{i] = O then 

by 

if plotL\i| > 0 then 

d. Line 24. Replace 
plotS {1 + entier(0.5 + ¢ X Qlindli|} —ymin))| := 3 
by 


plotS |1 + entier(0.5 + 4 X (lplotL{ij] —ymin))| := 3 
e. Line 27. Replace 
plotS (1 + eatier(0.5 + ¢ X (vlindtil,j) — ymin))| := 7 4+ 2 


by 
plotS\\ + entier(0.5 + gq X (yiplotL\il,j) — ymin))| := 7 + 2 
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(The referee has noted that there is a typographical error on the 
fifth line before the line labeled escape. Replace 


for / := step | until 1 do 
by 
for j := 1 step | until do 


He has also noted that the array declaration for ind should be 
deleted if the above changes are made.—L.D.F.) 


COLLECTED ALGORITHMS FROM CACM 


Editor’s note: The algorithm described here is available on magnetic 
tape. The text plus the listing of the algorithm will be printed in the 
Collected Algorithms from CACM. The charge for the taped algo- 
rithm is $16.00 (U.S. and Canada) or $18.00 (elsewhere). If the user 
sends us a small tape (wt. less than I |b.) we will copy the algorithm 
on it and return it to him at a charge of $10.00 (U.S. only). All orders 
are to be prepaid with checks payable to “ACM Algorithms.” The 
algorithm is recorded as one file of BCD 80-character card images at 
556 B.P.I., even parity, on seven-track tape. If requested, the algo- 
rithm is supplied at a density of 800 B.P.I. The cards for the algorithm 
are sequenced starting at 10 and incremented by 10. The sequence 
number is right-justified in column 80. Although every attempt is made 
to insure that the algorithm conforms to the printed description, no 
guarantee is made, nor is there a guarantee that the algorithm is 
correct.—L.D.F. 


* Work performed under the auspices of the U.S. Atomic 
Energy Commission. {This research was supported in part by 
National Science Foundation research grant number NSF-GP-16071 
from the Division of Mathematical and Physical Sciences. 

Copyright © 1971, Association for Computing Machinery, Inc. 

General permission to republish, but not for profit, an algo- 
rithm is granted, provided that reference is made to this publica- 
tion, to its date of issue, and to the fact that reprinting 
privileges were granted by permission of the Association for 
Computing Machinery. 
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Algorithm 413 


ENTCAF and ENTCRE: 
Evaluation of Normalized 
Taylor Coefficients of an 
Analytic Function [C5| 


J.N. Lyness,* Argonne National Laboratory, Argonne, 
IL 60439, and G. Sande,t Department of Statistics, 
The University of Chicago, Chicago, IL 60637 (Recd. 
17 June 1968, 12 Feb 1970, and 20 July 1970) 


Key Words and Phrases: Taylor ceefficients, Taylor series, 
Cauchy integral, numerical integration, numerical differentiation, 
interpolation, complex variable, complex arithmetic, fast Fourier 
transform 

CR Categories: 5.12, 5.13, 5.16 


Description 

Introduction. Two subroutines, ENTCAF and ENTCRE, 
coded in ANSI FORTRAN are described here. ENTCAF may be 
used to calculate approximations rta{” to a set of normalized 
Taylor coefficients 


ra, = rf@(g)/s! 


The values of r and ¢, a complex number, are provided by the user 
together with a function subprogram that represents f(z) as a 
complex-valued function of a complex variable. The user also pro- 
vides a value of ¢,e, , the required absolute accuracy. The routine 
returns an accuracy estimate e,,; together with approximations 


8 _(m) 


ra, and anumber m. These are supposed to satisfy 


SS. OD 2 yee 2s (1.1) 


ras” — r'a,| < es: 5 = 0,1,2,...,m— 1, 
rae < eee S=m,m-+1,.... 


(1.2) 


A result status indicator NCODE is output. If ees: > €reg this gives 
a brief indication of why the required accuracy was not achieved. 
ENTCRE carries out the same task as ENTCAF in the case that 
¢ is real and also that f(z) is real when z is real. In this special and 
common case, ENTCRE is about twice as economic as ENTCAF. 
Outline of method. The Taylor coefficients a, occur in the Taylor 
series 


fe = = a(z—-s)', |2z-f| <Re, (2.1) 


where R, is the radius of convergence of the Taylor series. Cauchy’s 
theorem provides a set of integral representations. One of these is 
re (z 

f(z) d 


Omi (z oe Zz r<oRe, (2.2) 
Choe 


ra, = 


where C, is the circle |z — ¢| = r. The approximation r’a{” is 
obtained by replacing the integral in (2.2) by an approximation 
based on an m-point trapezoidal rule approximation. Specifically, 


m-1 


=m » exp (— 2zijs/m) f(g + rexp (2rij/m)), (2.3) 


s=0,1,...,m— 1. 


8 8 (m) 
ra; ~Tras 
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The calculation is in two parts. The first part (stages 1, 2, and 3) 
is iterative in nature. Using (2.3) the approximations as” with 
m = 1,2,4,8, --- are calculated. The function values are retained. 
The convergence criterion is based on the circumstance that the 
true value 


ao = f(t) | | (2.4) 


of one of the approximations a{” may be determined by a single 


function evaluation. A rather involved convergence criterion based 
on the orderly approach of the sequence a;”, m = 1,2,4, ..., to its 
limiting value ao is used. This is described in some detail by Lyness 
[8]. 
When the convergence of as” to ay has been achieved the routine 
carries out the second part (stage 4). This consists of evaluating 
ras” from (2.3) for s = 0,1, --» ,  — 1 using the function values 
calculated and retained during the first part. A fast Fourier trans- 
form technique is used for this calculation. This is particularly 
appropriate since m is a power of two. The derivation and imple- 
‘ mentation of this technique is described in Gentleman and Sande 
(5, pp. 566-7]. The specialized version used in ENTCRE is 
described in Sande [9]. 
Restrictions: theoretical. There are two restrictions of a theo- 
retical nature. 
1. The value of r must be less than the radius of convergence, 
R., of the Taylor series. So long as this condition is satisfied, it 
can be shown (see [5] and [8]) that 


| r’a, | < Kp’, 
[ ras” — ras| < Ke™**/(1 — p™), 


(3.1) 


where p is any number greater than r/R, and K depends on p. Thus 
the approximations approach their limiting values and there are 
only a finite number. of normalized Taylor coefficients whose 
magnitude exceeds ¢,e,. If this restriction is violated, that is, a 
value of ry > R, is chosen, then in general the sequence r’a{” 
converges, but not to rsa, . Instead it converges to the integral on 
the right m (2.2), but (2.2) is not generally vailid if r > R,. Thus 
the routine itself fails to converge since aj”” does not approach 
F(¢) in the limit of increasing m. — 

2. The function f(z) must not be an odd function of (z — &). 
While the convergence criterion based on (2.4) has much to recom- 
mend it, it does have one serious drawback. If it happens (as it does 
in the case f(z) = sin (z); ¢ = 0) that 


(EX) = fe He), (3.2) 


then every approximation a§” is zero, as is the true value a). The 


routine then finds that it converges immediately. In this case the 
problem should be reformulated. One defines g(z) = f(z)/(z — £) 
or g(z) = (z — $)f(z). The Taylor coefficients A, of g(z) = ¢ are 
then calculated using ENTCAF. A, is the same as a4; OF @s-1 aS 
the case may be. 

Restrictions: practical. There are two principal practical re- 
strictions. These arise because (1) the computer uses finite length 
floating-point arithmetic; (2) execution cannot be allowed to 
continue indefinitely; at some stage it has to terminate whether 
or not the calculation is complete. 

An output status parameter NCODE indicates to the user 
whether the results have been significantly affected by either of 
these restrictions. 

1. Roundoff error. The routine requires as an input parameter 
the machine accuracy parameter ey. The approximations r°as” 
giyen by (2.3) are of such a form that an estimate of the roundoff 
error level is 


en) = eM max [fe + rexp Qnij/m)) |. (3.3) 
Dae! (1 Soe 


If, at any stage it appears that 


erig < 10k"), (3.4) 


the routine internally replaces «eq by 10 «{” and either terminates 
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(input NCODE negative) or continues with the calculation (input 
NCODE nonnegative). 

2. Physical upper limit. This is defined by an input parameter 
NMAXxX. Iterations inthe first part to calculate af”, m = 1,2,4,8,..., 
with m < NMAX are possible. If convergence has not been achieved 
by this stage, the calculation is completed. 

The output status parameter NCODE is +1 if all went well. In 
general NCODE = 0 if the calculation was terminated; is positive 
if it converged and negative if it did not converge; has magnitude 
1 if roundoff error. was not observed; and has magnitude 2 if 
roundoff error was observed. 

If NCODE # 0, the returned value es, corresponds to the 
estimated accuracy of TCOF(J) whether or not convergence or 
roundoff error occurred. If NCODE = 0, the quantity 10 «:”) is 
returned in place of es, . 

Comments. The algorithms described here deliver approxima- 
tions to a.set of normalized Taylor coefficients ra, . It is natural 
to ask why this choice of output was made, rather than perhaps a 
set of Taylor coefficients a, or a set of derivatives f@(¢). The most 
immediate reason is that the algorithm naturally provides a set or 
normalized Taylor coefficients to a uniform absolute accuracy. 
The user specifies r and ¢,e, only. If, for example, one is interested 
in a set of derivatives, the specification of the accuracy requirements 
becomes very much more complicated. However, if one looks 
ahead to the use to which the Taylor coefficients are to be put, one 
finds in many cases that uniform accuracy in normalized Taylor 
coefficients corresponds to the sort of accuracy requirement which 
is most convenient. 

As an illustration we conside~ a very trivial problem. We wish 
to represent f”(x) as a polynomial in the interval (—/,/) to an 
accuracy E£. Clearly 


oO 


fx) = ¥ sls — 1a. xt? = > s(s — l)asr' ex (4.1) 
am? 


amd 


A very crude approach might be to take r = / and e = r°E/6. In 
this case the error in the sth term is less than s(s — 1)E(x/l)*-?/6. 
One cannot be assured that for x ~~ / these errors may not cooperate 
in such a way as to lose the required accuracy. However, if r is 
chosen to be greater than / and e = r°(1.— //r)3E/2 then it follows 
at.once that if the allowed error ina,, r* is less thah e, the error in 
f”(x) is less than E. These two approaches represent extremes. 
Neither take into account that the sequence a,r* itself approaches 
zero and for high values of s it is unnecessary to bound the error 
in omitting such a term by e. A more complicated formula based on 
(3.1) is derived by Lyness and Delves [5], eq. (2.9). But the under- 
lying feature of any of these approaches to approximating (4.1) 
is that a uniform absolute accuracy for a,r*,s = 0,1,2,..., is very 
convenient for this problem. If the algorithm instead calculated 
f(0) to a specified relative accuracy, the determination of the 
accuracy to use in this problem would be very much more involved. 

Possible modifications. The general approach to a numerical 
calculation by means of the numerical evaluation of contour 
integrals is at present an open field for investigation. The algo- 
rithms described here may be used in several problems known to 
the authors. These are: (a) determination of zeros of analytic func- 
tion [7, 1, and 5]; (b) numerical differentiation [7, 6]; (c) numerical 
quadrature [8]. 

In particular applications, modifications of ENTCAF or 
ENTCRE can lead to more efficient calculations. Possible modifi- 
cations include: (a) Provision for calculation of only some of the 
Taylor coefficients, for example, s even or s < 12; (b) Provision 
for a ‘‘subsequent return option” which allows the same calculation 
to be taken up at a later stage if it is found subsequently that higher 
accuracy is required; (c) Provision for an “early exit.” Used in 
conjunction with (b) this would enable the program to consider 
intermediate results to determine whether to continue with the 
current values of r and e, before a high investment of computer 
time has been made. 

In fact, ENTCRE is a special modification of ENTCAF de- 
signed for a particular application, ¢ real, f(x) real. The output 
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status parameter NCODE is of particular use in these applications 
since it allows appropriate remedial action to be taken under pro- 
gram control. 

Algorithms which include modifications (b) and (c) above have 
been used by the first author. However, these involve complicated 
logic and are strongly connected with the particular application. 
The algorithms listed here may be modified by the user in par- 
ticular applications for any large scale use. However, in pilot runs 
or small scale calculations they are adequate as they stand. 

Comparisons and examples. \n [6] and [8], several numerical 
examples are given, and comparisons with other methods are made. 
So far as the determination of zeros of an analytic function is con- 
cerned, the method described in [6] has some advantages in a 
global situation, but should not be used locally. For numerical 
quadrature, the method described [8] is definitely superior to 
standard methods if there is a nearby pole or singularity of a special 
type. In these cases a proper evaluation depends on the details of 
the problem under consideration. 

It is in problems involving numerical differentiation that the 
method on which these algorithms are based show up to great 
advantage. This is.simply because, once the use of complex function 
values is allowed, the numerical instability associated with numeri- 
cal differentiation may be avoided. 

In [6],:a different but related method for numerical differentia- 
tion is described. The remarks about the roundoff error given there 
apply to these routines also. There as an example, the calcula- 
tion of f(0) was considered for 


f(x) =. e*/(sin?(x) + cos%(x)). 
The actual value of this derivative is an integer, namely 
fO) = —164. 


In order to provide some sort of comparison, a special algo- 
rithm for numerical differentiation based on polynomial interpola- 
tion was written using only function values at real abscissas. A set 
of several’ dozen numerical experiments were carried out on a 
machine for which ey = 3 X 107". The closest result was in error 
by 10-; the worst result had the wrong sign. 

ENTCRE was then used for the same problem in an attempt to 
obtain seven-digit accuracy, i.e. an absolute accuracy of E = 107. 
A sequence of values of r was used, with in each case éreg = 7° X 
10~4/5! and input parameter NCODE = —1 to secure immediate 
termination if roundoff error prevented a sufficiently accurate 
result from being attained. With r = 0.1 and r = 0.2, execution 
terminated using in each case one complex and three real function 
values. With r = 0.4, the result 


f©(O) = —164.00000013 


was obtained at a cost of 15 complex and three real function values 
(m = 32); the accuracy estimate given by the algorithm was 


Eest = €est S!/r? = 6 X 107%. 


Incidentally, an absolute accuracy of less than 10-4 was estimated 
and a better accuracy obtained for r = 0.3, 0.4, 0.5, 0.6, 0.7 
with m =: 32, 32, 64, 64, 128, respectively. For r = 0.8 andr = 0.9 
the routine failed to converge with m = 128 giving absurd results 
and estimates. These latter values of r are greater than the radius 
of convergence R, = 7/4. 

The role played by the output status parameter NCODE is 
illustrated in this example. With r = 0.1 and r = 0.2, the value of 
NCODE indicated immediately that the results were not to be taken 
seriously because of roundoff error. With r = 0.8 and r = 0.9, 
the value of NCODE indicated that the results were not to be taken 
seriously because of lack of convergence. Thus the calculation 
could have been carried out completely under program control, 
with a driver program finding for itself an appropriate value of r. 
An efficient program for this application would require modifi- 
cations (a), (b), and (c) of the previous section. 

The testing of the algorithm included the calculation of high- 
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order derivatives. In general, it frequently happens that even when 
analytic closed expressions are known for such derivatives, these 
expressions are difficult to evaluate because of excessive subtraction 
error. Cases in point include the functions e*/x and sin(x)/x. 
Programs were written to evaluate the first 80 derivatives of these 
functions at x = 5, 10, 20, 40, and 80. It turned out that meaningful 
results could be obtained. For example, for f(x) = e*/x, using 
r = 32 and ere, = 10-”, ENTCRE gives 


£)(40) = 3.6560469 x 10% 


with an estimated relative accuracy of 2.5 X 10. These results 
were compared with those obtained using an algorithm due to 
Gautschi and Klein {2, 3]. In all cases examined corresponding 
results agreed to within the calculated error estimate. 
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Algorithm 


SUAROUTENE ENTCRE ¢€ CFUNe ZETAs RCERC»s EPREQe FRMACHs NMAXe NCODEs 
e FPEST® NTCOFs TCOFe WORK, NTAHs SINTAK ) 


eo EVALUATION OF NORMALIZED TAYLOR CORFFICIENTS ## 
on OF & REAL ANALYTIC FUNCTION oe 


so GENERAL PURPOSE aH 

THIS RQUTINE FVALUATES A SET OF NORMALIZED TAYLOR COEFFICIENTS 
TCOF(C je1) = (ROIRC#HI) % (STH DERTVATIVE GF CFUMN(7) AT Z=ZETA) 
DIVEDED BY FACTOWIAL (J) see J = Oele2edeasNMAXale 

TO A tIMNTFORM ABSOLUTE ACCURACY *##FPEST#4 USING FUNCTION 

VALUFS OF CFUN(Z) AT POTNTS ITN THE COMPLEX PLANE LYING ON 

THE CIRCLE UF RAUIUS ##RCIRC#*® WITH CENTER AT Z = ZETA. 

THIS ROUTINE IS A SPECIAL VEKSTON OF ENTCAR FOR USE WHEN 

7JETA 1S REAL AND ALSO CFUN(Z) 15 RFAL WEKEN Z IS REAL « 


ae THEORETICAL RESTRICTIONS ## 
RCIRC MUST HF SMALLEX THAN THE RANIUS OF CONVERGENCE OF 
THE TAYLOR SERTES. THE PROBLFM HAS TO He REFORMULATED 
SHOULD CFUN(Z) HAPPEN TO HE AN OUD FUNCTION 
OF (7 = ZETA) «6 THAT [IS IF THE RELATIGN 
8H =CFUN(H(Z-ZETA))=CFUN(Z=ZF 14) HH TS AN IDENTITY. 


ta REQUIREMENTS FOR CALLING PROGRAM #4 
CALLING PROGRAM MUST CONTAIN CONTROL STATEMENTS DESCRIBED 
NOTES (4) AND (4) BELOW. {7 MUST ALSO ASSIGN VALUFS TO 
INPUT PARAMETERS. THE ROUTINE REQUTRES TWO SUBPROGRAMS > 
HF COF (LISTED AFTER ENTCRE) ANU CFlIN (SEE NOTE (4) BELOW). 


#aTNPUT PARAMETERS 4 
(1) CFUN NAME QF COMPLEX FUNCTION SUBPROURAMs 
(2) 7FTA RFAL POINT AAOUT wHICH TAYLOR EXPANSION IS REQUIRED 
(3) RCIRC RADTUS (REAL) 
(4) EPREQ THE ABSOLUTE ACCURACY (REAL) TO WHICH THE 

NORMAL EZED TAYLOR COEFFICENTSs VCOF(J)« ARE REQUIRED 

(5) FORACH THE MACHINE ACCURACY PARAMETER (REAL) 
(OR AN UPPER BOUND ON THE RELATIVE ACCURACY OF 
NUANTITES LIKELY JO HE ENCOUNTERED) « 
PrYSITCAL UPPFR LIMIT ON THE STZE AND LENGTH 
OF THE CALCULATION, 
COEFFICIENTS CALCULATED wILt BE THAT POWER OF TwO 
LESS THAN OR FQUAL TO NHAX. NMAX ITS ASSIIMED TO 
HE AT LEAST 4. (SEE NOTE (3) HELOW). 
eGE.0 THE KOUTENF WILL DO AS WELL AS TF CAN, 
ef TL) THE ROUTINE WILL ABOPT AT AN EARLY STAGE 
LF TRE REQUIRED ACCURACY CANNOT BE ATTATHED 
HECAUSE OF ROUrI OFF ERROR. 


(6) NMAX 
THE MAXIMUM NUMBER OF 


(7) NCONE 


ANAAGAANANRANAAANANANAANANAANNHANGDADAANANNANANANANNAANANAANAANN 
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C (12) NIAR LN NURMAL RUNNINGs NTAB SHOULD BE SET TO ZERO C (3) USES NMAX TO CHEZK PHYSICAL LIMIT. IF THIS HAS BEEN 
c HELFORE THE FIRST CALL TO ENTCREs BUT LEFT ALONE C REACHED. Il GOES TO STAGE FOUR(A) SETTING NCODE = <1 OR -2. 
c AFTER THAT. (FOR MORE SOPHISTICATED USFe SEE C (4) OTHERWISE CONTINUES NEXT ITERATION BY GOING TO STAGE THREE 
c OUTPUT PARAMETERS (12) AND (13) AND NOTE (2) BELOW) c 

Cc C # CALCULATION OF FIRST NTCOF TAYILOR COEFFICIENTS IN 

Cc fe QUEPUT PAKAMETERS it C #% STAGE FUURTA) 

Co (1) (Ade (310 (409 (5) 006) TINENTICAL WITH INPUT VALUES. C A VERSION OF THE FAST FOURTER TRANSFORM USING A WORK ARRAY 
C (7) MNCODE RESULT STAIUS INDICATOR. C IS USFN, THE ARRAY ##WORK## 1S USED ONLY DURING THIS STAGE. 
c TAKES OWE OF FIVE VALUES AS FOLLOWS» C THF wWoRK ARRAY ALLOWS THE PERMUTING OF TINUICES ASSOCIATED 
c = ¢)e CONVERGER NORMALLY. C WITH [N=PLACE FFTS TO SE SUPPRESSED. THE FFT CALCHLATES 
c =e). DLO NOT CONVERGE. NO ROUND OFF ERROR C THE NFECCESSARY SUMMATIONS EXCEPT FOR DIVIDING BY NTCOFe 
c TROUBLE « Cc 

Cc = +2. CONVERGFOs BUT AIETH A HIGHER TOLFRANCE C #® SETTING OF REMAINING TAYLOR CUFFFICIENTS IN STAGE FOUR(B) ## 
Cc SET mY THE ROUMD OFF LEVEL. (EPEST.GP.FPREQ) C THE CANVERGENCE CRITERION ALLOWS US TO INFER THAT THE 

Cc = <2. OH) NOT CONVERGE IN SPITE OF HIGHER C NORMALTZEN FAYIUR COEFFICIENTS OF URDER GREATER THAN NTCOF 
c TOLERANCE SET KY ROUND OFF LEVEL. C ARE ZERO TO ACCURACY PEST. THEY ARE EVALUATED AS BEING 
Cc = 0. PUN WaS ABORTED RECAUSE EPREQ IS C EXACTLY ZERO. 

Cc UNATTATNARLE DUE TO ROUNTD OFF LEVEL AND ENPUT COMPLEX CFUN 

C NCObDE LS NEGATIVE. REAL ZF LAeRCIRCeEPRFQeEPMACH®SEPEST 

C (RB) FREST ESTIMATE OF ACTUAL UNTFOKM AHSOLUTE ACCURACY TNTEGER N4AXeNCODE sNTCOF sNTAB 

c IN ALL TCOF. FACEPTs IF NCODE*EQ.0 ESTIMATE REAL TCOF (1)5 WORK (1)¢ SINTAB (1) 

Cc OF kOUND OFF LEVEL. TNTEGER NAHORT s+ NCONVeNUTSP »NOOL IMs NPREVsNROUND ,NSPACE 
C (9) NTCOF NUMBER OF AONTRIVTAL VALUES OF TCOF ACTUALLY INTEGER Je JCONJSs JCOS 9 JF ROMs JRCONS 9 JREFL eISING STO 

Cc CALCULATEN. THEY ARE BASED ON NTCOF/2+2 CALLS RFAL APPROX sCUSDIF sERCOF sEPMINSEPROVEPI29EP42 

Cc QF CFUN (THREE CALLS WERE FOR PURELY REAL ERROR] sERROR2Z ERROR 3 sERRORG 9 EXACT oF MAX OF VAL IM 

Cc REAL ARGUMENT). REAL FVALRE eRCOS eRSINe SAFETY # SCALE e SUPPER» TWOPT 

C (10) TCO HEAL DIMENSTON (PIM). APPROXIMATIONS TO THE COMPLEX FVALeZVAL 

Cc NORMALIZED TAYLOR COEFF ICTENTS» EXCEPT whEN COMPLEX CMPLX 

Cc OUTPUT NCODE = Us (SEE NOTE (3) BELOW) C tae STAGE ONE aon 

C (12) vaRK INFERNAL WORKING AREA OF REAL UIMENSION (DIM) C ween wn cnsc rarer eseese 

c (SER NOTE (3) SELOW.) CONTENTS IS TDENTICAL WITH C INITIALISE BOOKKEFPING PARAMETERS AND EXACT VALUE OF TCOF (1). 
Cc THAT OF TCOF es NROUND = 1 

C (be) 1 AR NUMHF RP OF VALUES OF STNTAB® AVATLAALE NaRORT = 9 

c (SEE NOTE (2) HKELOw). TF (NCOVDE SL T.9) NAHORT = 1 

C (23) SINTAK REAL OTMENSION (D1M/4). (SEE NOTES (2) AND (3) EPCOF = ERREQ 

c HELOW.) SENTOA(G+1) = SIN(PE#S/2#NTAB) SAFETY = 10.0 

c J = OelbateeeetiiAH-1e ZVAL = CMPL X(ZE 6 A020) 

c (A QUARTER CYCLE) OTHER LOCATIONS ARE EMPTY. FyaAl = CFUN(ZVAL) 

c FYALRE = REAL (F VAL) 

Cc aes NOTES ON [NPUT/OUTPUT PARAMETERS itt EXACT = FVALRE 

C NOTE (1) ®# CODE 1S USF BOTH AS TNPUT AND OUTPUT PARAMETER. C ae STAGE Two te that 

C NORMALLY I} RETAINS THR VALUE «1 ANI) WEED NOT BE RESET C tre ec nse rrert tert eee- 

C RETWEEN NORMAL RUNS, C FIRST THREE ITERATLONS € THOSE WITH NTCOF = 1le2e4 >, 

C NOTE (?) HH THF APPEARANCE OF NTAH AND SINTAK IN THE ZVAL = CMPLACZETAtRCIRC90.0) 7. 
C CALLING SEQUFNCE ALLOWS THE USER FO MAKE USE OF = OR TO FyAL = CFINGZVAL) 

C PRECOMPUTF = THESE NUMHERS IN ANOTHER PART OF THE PROGRAM FyALRE = REAL (F VAL) 

C SHOU HE SU DESIRE. STAR MUST BE & POWER OF TWO OR Oe APPROX = FVALRE 

C NOTE CAD8® THE APPEARANCE OF MMAXeTCOF ¢WORK AND STNTAB IN FAX = ABS(F VALE) 

C THE CALLING SEQUENCE ALLOWS THE SCOPE OF THE SUBPROGRAM AND TCOF (1) = FVALRE 

C THE AaQUNT OF STOPAGF TQ RE ASSIGNED KY THE CALLING FRRORZ = AAS CAPPRKOX-EXACT) 

C PROGRAMse WHICH SHOULD CONTAIN A CONTROL STATEMENT TO THE 2VAL = CMPLX(CZETASRCIRC 90.0) 

C FOLLOVING FREFECT = FyAL = CFUN(ZVAL) 

C REAL TCOF (UTM) « WORK(UIM) 6 SINTABCIIMs/4) FVALRE = REAL(FVAL) 

C WHERE SIM [5 NORMALLY A POWER OF TwO. NMAX IS NORMALLY APPROX = 0.5% (APPROX+FVALRE) 

C EQualL TO DIMs BUT MAY HE LESS THAN HIM, FMAX = AMAXI(FMAXsARS(FVALRE)) 

C NOTE (4)*# CFUN(Z) IS A USER PROVIDED COMPLEX VAHUED TCOF (3) = FVALRE 

C FUNCTION SHHPROGRAM WIth A COMPLEX VALUFD ARGUMENT, THE FRRORA = ARS CAPPROX-<E XACT) 

C CALLING PROGRAM MUST CONTAIN CONTROL STATEMENTS aS FOLLOWS - ZVAL = CMPLXCZETASRCERC) 

CC EXTEONAL CrUN COMPLEX CFUN FVAL = CFEUN(2VAL) 

c FVSOLRE = REAL (FVAL) 

Cc te BOOKKEEPING PARAMFTFRS FOR STAGE ONE #4 FyALIM = AIMAG(FVAL) 

Cc NCONY 1 CONVERGENCE ACHIEVED. APPROX = 0,57 (APPROX*FVAL RE) 

Cc -1 NO COMVERGENCF 4CHTevel. FiAAX = AMAXI(FMAX s+ CAHS (FVAI_)} 

Cc NROUNIY 1 MO ROUN) OFF TROURLE OSSERVED. TCOF (2) = FVALRE 

c 2 ROUND OFF TROUALE ORSERVED. TCOF (4) = FVALIM 

C NABOWY 0 UPDATE TOLERANCE AND CONTINUE ON APPEARANCE OF FRROR] = ABS (APPROX-EXACT) 

Cc OF POUND OFF TROUBLE. NICOF = 4 

Cc } TERMINATE WHEN ROUND OFF TROUBLE OBSERVED. FPRO = FMAX#SAFETY#EPMACH 

c Exact THe EXACT VALUE OF TCOF (LE) WHICH LS CFUN( ZETA). IF (EPRO.LTeEPCOF) GO TO 300 

C SAFETY THIS [IS a SAFETY FACTOR RY WRICH THE ROUTINE AVOIDS EPCOF = EPRO 

c THe Rou OFF CFvEL. IT Is SET To 1020 ANN NROUND = 2 

c APPEARS ONLY IN THE COMBINATION (SAFETT#EPMACH)*. TF {NABORT.EQ.0) GO TO 300 

Cc TO ALTER THIS FACTOR. OR TO REMOVE THE ROUND OFF MCODE = 0 

Cc ERWOR GUARD COMPLETELY« THE USKR NEEO ONLY ANJUST FPEST = FPRU 

Cc THe INPUT PARAMETFR FPMACH APPROPRIATELY. 60 TO 470 

Cc C tithe STAGE THREE ae at 

c aes OUANTETIES CALCULATED IN STAGE THREE (A) ## C eweeeen- ere ete n cee cen- 

C THIS TS THe FIRST PART OF ALTERATION NUMBER NTICOF. PRESENTLY C COMMENCE ITERATION NUMBER NICOF. 

C AVATE ARLE ARE = 300 CONTINUE 

C SINTAR(J#l) = SIN(PTH#U/28NTAR) 9 J = OsleedeeeeNTAH=1e NEREV = NTCOF 

C WE RFOUIRE THE SEQUENCE SIN (PI eu72# (NTCOFZ4) ) 9 NTCOF = 2#NTCOF 

C JS = Le deSeaee (NICOFS4$1) 6 C the STAGE THREE (A) bated 

C OIF CNTCOF ebb eG4#NTAR) THESF NUMHERS ARE ALREADY AVATLABLE IN (eee ee weewecen 

C THE STNTAR TABLE SPACED AT AN INTERVAL 2#NSPACE = A#NFAB/NTCOF. C UPDATE SINTAB TABLE IF NECESSARY. 

C OTHERWISE+ NTCOF = S#NTAB AND THE SINTABR TAHLE IS UPDATED. IF (4*NTAB.OE.NTCOF) GO TO 346 

C THIS TMVOLVES REAPRANGING THE NTAH VALUES AVAILABLE + IF (NTAKeGEe2) GO TO 310 

C CALCULATING AND STORING NTA SEW VALUES AND UPDATING SINTAW(1L) = 0.20 

C NTAS TO @*NTAB. SINTAH(2) = SQRT(0.5) 

Cc NTAB = 2 

Cc te QUANTITIES CALCULATED IN STAGE THREE (B) ## 50 TA 340 

C  LTERATIONS ARE NUMBERED Sel6932eeeeAT THE END OF ITERATION 310 CamTINut 

C NTCOF. THE NICOF/2 + 1 COMPLEX FUNCTION VALUES AT NDOLIM = NTAH=1 

C ABSCISSAS REGULARLY SPACED ON UPPFR HALF OF CIRCLE ARE NO 320 J = LeNDOLIM 

C STOR IN THE TCOF VECTOR AS FOLLOWS « IFROM = NTARSJ 

C TCOF(J#1) = REAL PART OF CFUN(Z(J)) J=0019290eeNTCOF/2. ITO = et JF ROM 

C  TCOF(NTCOF*J41) = IMAGIWAPY PART OF CFUN(Z(J)) SINTAHK (UTO#}) = SENTAB(UFROM4)]) 

Cc J=lLetreee (NTCOKS2=1) 6 E 320 CONTINUE 

C WHERE NTAB = e#NTAK 

C ZtS) = ZETA * RCIRCPCE XP (e#P TSE YE # I/NTCOF) TWORPT = KeUHATAV(1.9) 

C THIS IMVOLVES A REARRANGEMENT OF THE NTCOF/4 + 1 FUNCTION COSDIF = COS(TH#OPT/FLOAT (4#NTAB)) 

C VALUFS AVATLARLE AT THE START OF THE ITERATION AND THE NOOLIM = NTAB=3 

C CALCULATION OF A FURTHER NTCOF/4 FUNCTION VALUES. IN NN 330 9 = LeNDOLIMe2 

C ADDITION FMAX ANU APPROX APE CALCULATED. THESE ARF SUNTAB(J#1) = COeS#STNTAB (SD) © 0058S INTAH (U42)) /COSDIF 
Cc FMAX MAXIMUM MODULUS OF THE FUNCTION VALUES SO FAR 339 CONTINUE 

Cc ENCOUNTERED « SINTAB(NTAB) = COSDIF 

C APPROX AN APPROXIMATION TO TCOF (1) 340 CONTINUE 

Cc RASED ON THESE FUNCTION VALUES. Cae STAGF THREE (13) iad 

re C se---- lta ttatetetatatel erreee -- 

c ts QUANTITIES CALCULATED AT STAGE THREE(C) ## C UPDATE LIST OF FUNCTION VALUFS IN [COF+ 

C FRRORI CURRENT VALUE OF THE FRROR = ANS (APPROX-EXACT). C CALCIN ATE FMAX AND APPROX. 

CC ERRORAsERRURIeERRORS VALUES OF ERROR AT END OF THREE NUOLIM = NPPEV~1 

c PREVIOUS LTFRATIONS. Ny 356 4 = LeNDOLIM 

C FRPMACH MACHINE ACCURACY PARAMETER. (INPUT PARAMETER) AFROM = NPRFV=J 

C EPREQ = RFQUTRED ACCURACY. (INPUT PARAMETER) JTO = Ot JE ROM 

Cc EPROA HIGHEST ACCURACY REASONABLY ATTAINABLE IN VIEW OF TCOF (gTO*L) = TCOF (UFROM4)) 

c THE SIZE OF THE FUNCTION VALUES SO FAR ENCOUNTERED. 359 CONTINUE 

Cc (=10.0*EPMACHHF MAX) SIPP#tR = 0.0 

CC EPCOF CURRENTLY REQUIREQ ACCURACY (=AMAX1(EPREQ+¢EPRO)) NoOLIM = (NPREV/2)-1 

Cc EPFST ESFEIMATE OF CURRENT ACCURACY. (THE MAXIMUM OF EPRO NSPACE = (44NTAR) /NTCOF 

c AND A FUNCTION OF ERRORS teée3 AND 4) {OUTPUT PARAMETER) Do 360 WF PeNDOL TM ee 

Cc ISIN = J#NSPACE 

C #* CONVERGENCE ANY) TERMINATION CHECKS IN STAGE THREE(C) ## JCOS = MNTAB=JSIN 

C (1) USES FMAX TO RAISE EPCOF ABUVE ROUND OFF LEVEL. IF RSIN = RCIRC#SINTAB(USINGL 

C THIS 1S NECESSARY AND THE [INPUT VALUE OF NCODE IS NEGATIVE®s RCOS = RCIRC#SIMTAG(UCOSel 

C IT TROMINATES SETTING NCONE = 0. JCONJ = NICOF-J 

C (2) USES APPROX TO EVALUATE CONVERGENCE OF TCOF (1) TOWARDS ZVAL = CMPLX(ZETA*RCOOSeRSIN)D 

C EXACT. IT MAY ASSIGN CONVERGENCE AND GO TO STAGE FOUR(A) FVAL = CFUN(ZVAL) 

C SETTING NCODF = #1 OR #2, FVALRE = REAL (FVAL) 


COLLECTED ALGORITHMS (cont.) 


Aaa 


Aaannnnaannnannaaanna 


360 


Pao 


FVALIM AIMAG(F VAL} 
SUPPER SUPPER +FVALRE 
FMAX = AMAX] (FMAXeCABS(F VAL) ) 
TCOF (Jel) = FVALRE 
TCOF CUCONJ*+1) = FVALIM 
JREFL = NPREV-J 
JRCONJ = NTCOF~JREFL 
7VAL = CMPLX(ZETA&=RCOSoRSIN) 
FVAL = CFUN(CZVAL) 
FVALRE = HEAL (FVAL > 
FVALIM = AIMAG(FVAL) 
SUPPER = SUPPER+FVALRE 
FMAX = AMAX] (FMAXsCABS (FVAL}) 
TCOF (JREFL +L) = FVALRE 
TCOF (JRCONJ+1) = FVALIM 
CONTINUE 
APPROK = 0.5#APPROX+SUPPER/FLOAT (NPREV) 
STAGE THREE (C) La datiad 


CONVF AGENCE AND TERMINATION CHECK. 


370 


380 


CALCULATION OF FIRST NICOF 


400 


410 


420 


430 


440 


450 
ae 


FRRORS = ERRORG 
FRPORZ = ERROR 
FRRORe = ERROR] 
FRROR] = ARS (APPROX=<EXACT) 
EPRO = FMAX#SAFETY#EPMACH 
TF (EPROeLT-EPCOF) GO TO 370 
FPCOF = EPRO 
NROUNU = 2 
TF (NARORT.EQL9) GO TO 370 
NCODE = 0 
EPEST = EPRO 
Gg TQ 470 
CONTINUE 
FRRORS = AM4OX) (ERFRORG cEPRO) 
FRRORZ = AMAXL(ERROR3sEPRO) 
Fe4? = ERROR2* ( (ERRURAJERRORG) #7 (4,.0/320)) 
EPa2 = ERRAR?# ( (ERROR2/ERRORI) #H2) 
FEMIN = AMIN (ERRORZsEP329F P42) 
FRPFST = AMAXT CERRORLEPMINeEPRO) 
TF (RE PFST.GTLEPCOF) GO TO 3450 
MNCONV = | 
ag TO 400 
CONTINUE 
TF (2®NTCOF «LE eNMAX) GO TO 300 
NCONV = -1 
STAGE FOUR(A) i at tt 


CONT PNUE. 
NCODE = NCONV#NRCUND 
NHTSP = NTCOF 
COMT ENUE 
NNISP = NOTSP/2 
CALL HFCOF (NFICOF sNDTSPs TCOF oWORK »NTABsSINTAB) 
IF (NDISP.Gf.1) GO TO 430 
NN 420 J = beNTCOF 
TCOF (9) = WORK (YU) 
CONTINUE 
GN TO 4460 
CONTINUE 
NDESP = NDTSP72 
CALL RFCOF (NTCOF sNDISPeWORKs TCOF sNFABsSINFAB) 
TF (NDEISP.GTe1) GO TO 410 
CONTINUE 
SCALE = 1.Q/FLOATINTCOF) 
DO 450 J = LeNTCOF 


TCOF (J) = TCOF (9) #SCALE 
WORK (J) = FCOF () 
CONT PNUE 


STAGF FOURTA?) that 


SETTING OF REMAINING TAYLOR COFFFICIENTSe 


460 
470 


END 


IF (NTCOF.GEeNMAK) GO 10 470 
NOOLIM = NTCOF +1 
DO 460 J = NUOL TM eNMAX 
TCOF (I) = 020 
“MORK (I) = 0.0 
CONTINUE 
CONT TNE 
RETURN 
OF ENTCRE 
Faw 


SURROUTLINE HFCOF € NICOFs NOISPs TCOFs WORK: NTABe 


to HERMITIAN FOURTER CORFFICIENTS ## 


THIS ROUTINE DOS ONE PASS OF 


THE 


OPDER aT Tht ENT) OF THE LAST PASS. 
THE USF OF SEPARATE ARRAYS FOR INPUT ANI) OUTPUT OF THE 
PARTIAL RESULTS. THIS ROUTINE IS CALLFON ONCE FOR FACH PASSe 


)) 
{?) 
(3) 
(5) 
(A) 


wey GENERAL PHRPOSE ae, 


to INPUT PARAMETERS ## 


NTCOF NUMHER OF COEFFICIENTS TO HE PROCESSED. 
MOTSP MAXIMUM VALUE OF DISPLACEMENT TNDEX. 
TCOF {REAL) INPUT ARRAY. 

NTAH NUMBER OF ENTRIES IN SINTARS 

SUNT An (KF AL) TOHLE OF VALUES OF STNE. 


SINTARCS+ 1) =SIN(PLESZ24NTAB) » S200 be 2eeeNTAH@] 


FAYLOR COEFFICIENTS USING FoF eTe 


A FAST FOURIER TRANSFORM. 
INDEXING £S ARRANGED SO THAT THE COEFFICIENTS ARE IN 
THIS INOEXING REQUIRES 


Cc 


G 


Cc 


c 


ANMNAANDAANANIAIAANANANANANAANNNANANAAAN AANAANAANNANAAAAANAANAAN 


413-P 5- 


## QUTPUT PARAMETERS #4 
(4) WORK (REAL) OUTPUT ARRAY. 


aa INDEXING OF ARRAYS ## 
THE TwO POINT FUURIER TRANSFORM IS APPLIED TO THE POINTS 
OF TCoF WITH INDICES 
JO ESP#NPREV*JREPL AND JOTSP#NPREV+ JREPL ¢NHALF 
THE RESULTS ARE MODIFIED AY THE APPROPRIATE TWIDOLE FACTOR 
AND STORED IN WORK WITH INOICES 
JNISPHNNEXT*+JREPL AND JDISP#NNEXT*¢JREPL *NPREV 
WHERE 


NOTSP PRODUCT OF REMAINING FACTORS» 

NPREV PRODUCT OF PREVIOUS FACTORS. 

NME AT PPODUCT OF PREVIOUS AND CURRENT FACTORSs« 
NHALF PROVUCT OF PREVIOUS ANO REMAINING FACTORS. 
JREPL REPLICATION INDEX = Le2eeueNPREVe 

ANTSP HERMITIAN SYMMETRY IN THES INDEX RESULTS IN 


THYEE CASES. 

1) INITIAL POINT = JOISP#0. INPUT POINTS 

ARE PURELY REAL. AND OUTPUT POINTS ARE 

PURELY REAL. 

2) MIDULE POINT = JUISPSNDISP/2 - NOT 

ALWAYS PRESENT. INPUT POINTS ARE COMPLEX AND 
OUTPUT POINTS ARE PURELY REAL. 


3) INTERMEDIATE POINTS ~ JUISP=le2906(NDISP/2-1) 


- NOT ALWAYS PRESENT. INPUT POINTS ARE 
COMPLEX AND OUTPUT POINTS ARE COMPLE Xe 


ON INPUTs THE HERMITIAN SYMMETRY IS IN A BLOCK OF LENGTH 
2*NDISPs Tete THE POINT CONJUGATE TO JOISP IS 2#NDISP-JOISP. 
ON OUTPUTe FHE HERMITIAN SYMMETRY IS IN A BLOCK OF LENGTH 
NDISP, Tefe THE POINT CONJUGATE TO JOTSP 1S NDISP=JDISP. 

A HERMITIAN SYMMETRIC HLOCK HAS REAL PARTS AT THE FRONT 
IMAGINARY PARTS (WHEN THEY EXIST) AT TRE CONJUGATE 

POSITIONS AT THE BACK. 


THE THTODLF FACTOR CEXP(=PISFEYE#J/NDISP)» J=be2eee (NUISP/2-1) 
1S ORTAINED AS SEPARATE REAL AND IMAGINARY PARTS FROM 

THE SINTAB FARLF. THE IMAGINARY PART SIN(PI®J/NOISP) US 
FOUND AT A SPACING OF NSPACE=2*NTAA/NDISP IN SINTAR. 

THE RFAL PART 15 FOUND) AT A CONJUGATE POSITION IN THE TABLE. 


INTEGER NTCOF sNUISPeNT AB 

REAL TCOF (h)e #oRK CL)0e SINTAK (1) 

PFAL CSeTSoTUs (Oe Le RSerteRVeR] «SN 

TNTEGER JCONJSs JCOSs JDTSPe JREPL USING IT eUTCo Swe JWE OKT OOKTI 


0 


INTEGER KT20KT3 eK WO ok W1 eo KW2 ok M3 oS NHALE oNMIOL eNNEXT #NPREV eNSPACE 


NHALF = NICUF/? 

NeEREV = NTICOF/(2#NDISP) 
NNFXT = NTCOF/NUISP 
NaTDL = (NDISP=1) 72 


MS5PACE = (2*NTAG) /NDISP 
INTTEAL POINTS OF HLOCKS,. 
OH 100 JPEPL = LeNPREV 


KTO = JREPL 

KT] = KTO¢NHALF 
KWwO = JREPL 

KWl = KW+NPREYV 
20 = ICOF(KTO) 

Rl = TCOF(KTLD 
WORK (KWO) = RO*R) 
WORK (KW]) = ROR) 


100 CONTINUE 
INTERMEDIATE POINTS OF ALOCKS. 
TF (NMTOL.L Eel) GO TO 406 
Ny 300 JDISP = 1eNMTOL 
JCON) = NDISP-JDISP 
JSIN = JOITSP#NSPACE 
ACOS = NTAH-JSIN 
SN SINTAB(SSING 1D) 
cs SINTARCICOS+1) 
JT JOTSPeNPREV 
FC = GCONJHNPRE V 
Jw = JOTSPHNNEXT 
JwC = JCONJ#NNEXT 
40 200 JREPL = 1eNPREV 


KTOQ = JT *JREPL 

KT! = KTO+NHALE 

KT? = JTC+JREPL 

KTA = KI eenNHALr 

Kwe = JweJREPt 

Kwl = KWO+#NPREV 

Kw? = JWC+* J REPL 

KW3 = KW2+NPREV 

RO = TCOF(KTO) 

1Q = TCOF (KT 3) 

RP} = TCOR(KT2) 

Tl = =<TCOF(KTIL) 

RS = RutR] 

1S = [nelt 

RU = RO-|RY 

Iu = (0-11 

WOPK (KWO) = RS 

WORK (Kw2) = IS 

WORK (KWi) = RUHCS+ LUSH 
WORK (KwW3) = TU#CS@RUHSN 


200 CONTINUE 
300 CONTINUE 
400 CONTINUE 
MIODNLF POINTS OF “LOCKS. 
TIF (NDISP.tF el) GO TO 600 


JT = (NUDISP/Z2) #NPREV 
Ja = (NDISP/2) #NNEXT 
Oy SOM JRFPL = LeNPREV 
KTO = JT eJuREPL 
KT1 = KIQ*NHALF 
KWO = JweJREPL 
KWL = KWO+NPREV 
vO = ICOF(KTO) 
10 = YCOFCKTI) 
WORK (KWO) = 2.0#R0 
WORK (KW1) = 2.0410 


500 CONTINUE 

600 COMTINUE 
PE TURN 

FND OF HFCOF 
Fain 


SUBROUTINE ENTCAF (€ CFUN» ZET4s RCIRCs EPREGQs FPMACHs NMAXs 


« FPESTs+ NTICOFe TCOFs WORK» NTARs EXPTAB ) 


NCQDE>s 


COLLECTED ALGORITHMS (cont.) 


ANMAADADAADADADADDADANAAANADANIADADAGDANIAD A AND ANAND AANDDAAANADADDAANDNGBAADDBANAADANDADAAAADAANBDAAAABANANAADDAAANDAANDADAANNAANAAANDADAAANAAANAANAAAN 


4@ EVALUATION OF NORMALIZED TAYLOR COEFFICIENTS ## 
te OF AN ANALYTIC FUNCTION we 


ee GENERAL PURPOSE ## , 
THIS ROUTINE EVALUATES A SET OF NORMALIZED TAYLOR COEFFICIENTS 
TCOF (g+1) = (RCIRC#HY) & (J-TH DERIVATIVE OF CFUN(7) AT Z=ZETA) 
DIVIDEN BY FACTORIAL (J) woe J & Oels2edeeeNMAX—le 
TO A UNIFORM ABSOLUTE ACCURACY ##EPEST## USING FUNCTION 
VALUFS OF CFUN(Z) AT POINTS IN THE COMPLEX PLANE LYING ON 
THE CIRCLE OF RADIUS ##RCIkC## WITH CENTER AT 2 = ZETA. 


ae THEORETICAL RESTRICTIONS ## 
RCIRC MUST BE SMALLER THAN THE RADIUS OF CONVERGENCE OF 
THE 'TAYLOR SERTFES. THE PROBLEM HAS TO BE REFORMUL ATED 
SHOULN CFUN(Z) HAPPEN TO BE AN OOD FUNCTION OF (Z = ZETA)» 
THAT 1S IF THE RELATIUN ##=CFUN (= (2-ZETA)) =CFUN(Z=ZETA) #® 
1S AN IOENTITY. . 


ta REQUEREMENTS FOR CALLING PROGRAM ## 
CALLING PROGRAM MUST CONTAIN CONTROL STATEMENTS DESCRIBED 
IN NOTES (3) AND (4) HELOW. [TT MUST ALSO ASSIGN VALUES 10 
INPUT PARAMETERS. THE ROUTINE REQUIRES TWO SUBPROGRAMSs 
CFCOF (LISTED AFTER ENTCAF) AND CFUN (SEE NOTE(4) BELOW)» 


#eTNPUT PARAMETERS## 


(1) CFUN NAME OF COMPLEX FUNCTION SUBPROGRAM. u 

(2) 7ETA COMPLEX POINT ABOUT WHICH TAYLOR EXPANSION 
1S REQUIRED. 

(3) RCIRC PRANTUS (REAL) 

(4) FPREQ THE ABSOLUTE ACCURACY (REAL) TO WHICH THE 


NORMALIZED TAYLOR COEFFICIENTSs TCOF (J) ARE REQUIRED 
(S) FPMACH THE MACHINE ACCURACY PARAMETER (REAL? (OR AN 
UPPER BOUM) ON THE RELATIVE ACCURACY OF 
QUANTITIES LIKELY TO RE ENCOUNTERED) » 
PHYSICAL UPPER LIMIT ON fHE SIZE AND LENGTH OF 
THE CALCULATION. THE MAXIMUM NUMBER OF 
COEFFICIENTS CALCULATED WILL BE THAT POWER OF 
TwO LESs 1qAN OR FQUAL TO NMAXs. NMAX IS 
ASSUME) TO BE AT LEAST 46 (SEE NOTE(3) BELOW.) 
2GEe0 THE KOUTINE WILL DO AS WELL AS ITT CAN. 
eLTe0 THE ROUTINE WILL ABORT AT AN EARLY 
STAGE [F THE RFQUIRED ACCURACY CANNOT BE 
ATTAINEN BECAUSE OF ROUND OFF ERROR. 
IN NORMAL PUNNINGs NTAB SHOULD BE SET TO ZERO 
BEFORE THE FIRST CALL TO ENTCAF se AUT LEFT ALONE 
AFTER THAT. (FOR MORE SOPHISTICATED USFe SEE 
OUTPUT PARAMETERS (12) 4NN (13) AND NOTE (2) 
HELOW.) 


{6) NMAX 


(7) NCODF 


(12) NTAB 


te QUTPUT PARAMETERS a 

(1) 9t2)003)5 (40005) e(6) IDENTICAL WITH INPUT VALUES. 

(7) NCODE RESULT STATUS [NUTCATOR. TAKES ONE OF FIVE 

VALUES AS FOLLOWS 

fel. CONVERGED NORMALLY. 

=<le DID NOT CONVERGE. NO ROUND OFF ERROR TROUBLE 

=4+?. CONVENGEDs BUT WITH A HIGHER TOLERANCE SET 

WY THE ROUND OFF LEVEL. (EPEST «G1 .sEPREQ) 

=-2. DIN NOT CONVERGE IN SPETE OF HIGHER 

TOLFRANCE SET RY ROUND OFF LEVEL. 

= O% FUG WAS ABORTED RECAUSE EPREQ IS 

UNATTATNARLE DUE TO ROUND OFF LEVEL AND INPUT 

NCODE [5S MEGATIVE. 

FSTIMATE OF ACTUAL UNIFORM ABSOLUTE ACCURACY 

IN ALL TCOF. FXCEPT 1F NCODE.EQ.0 ESTIMATE OF 

ROUND OFF LEVEL. 

NUMBER OF NONTRIVIAL VALUES OF TCOF ACTUALLY 

CALCULATED, THEY ARE BASED ON NICOF+#1] CALLS 

OF CFUN. 

COMPLEX DIMENSION (DIM). APPROXIMATIONS [O 

THE NORMALIZED TAYLOR CORFFICIENTSs EXCEPT WHEN 

OUTPUT NCODE = Ue (SEF NOTE C3) BELOWs) 

INTERNAL WORKING AREA OF COMPLEX DIMENSION (OIM). 

(SEE NOTE (CS) RELOW). CONTENTS IS IDENTECAL 

WITH THAT OF TCOF, 

(12) €xXPTAB COMPLEX DIMENSION (UIM/2),. (SEE NOTES (2) AND 
(1) BELOW.) FXPTAR(J+1) = CEXP(PIHEYE#J/NTAS) 
J = teles2recesNIAR~]. (A HALF CYCLE) 
UTHER LOCATIONS ARE EMPTY. 


(8) FPFST 


(9) NTCOF 


(10) TCOF 


(11) wARK 


ae NOTES ON INPUTZOUTPUT PARAMETERS ## 

NOTE ¢1)@# NCODE IS USED BOTH: AS INPUT AND OUTPUT PARAMETER. 
NORMaLLY Ti RETAINS THE VALUE +1 AND NEED NOT BE RESET 
BETWEFN NORMAL RUNS. 

NOTE (2) #® THE APPEARANCE OF NTAB 4ND EX#TAK IN THE CALLING 
SFQUENCE aLLOwS THE USFR TO MAKE USE OF - UR TO PRFCOMPUTE =< 
THESF NUMBERS IN ANOTHER PAPT OF THE PROGRAM SHOULD KE 
SO OFSTRE. NT4H MUST BE A POWER OF TWO OR 0. 

NOTE (a) 4% THE APPEARANCE OF NMAXe TCOF, wORKe AND EXPTAB 
IN THE CALLING SEQUENCE ALLOWS THE SCOPE OF THE SUBPRUGRAM 
AND THE AMOUNT OF STORAGE TO HE ASSIGNED BY THE CALLING 
PROAGRAMs WHICH SHOULD CONTAIN A CONFROL STATEMENT TO THE 
FOLLOWING EFFECI 
COMPLEX TCOF (OTM) © WORK (UTM) «© EXPTAB(DIMZ2) 

WHERE OIM 1S NORMALLY A POWFR OF TWO. NMAX ITS NORMALLY 
EQUA) TO DiMe HUT MAY dE LESS THAN VIM. 

NOTE (4) 4 CFUN(Z) ITS A USER PROVIDED COMPLEX VALUED 
FUNCTION SUHPROGRAM WITH A COMPLEX VALUED ARGUMENT. THE 
CALLING PPOGRAM MUST CONTAIN CONTROL STATEMENTS AS FOLLOWS 
EXTERMAL CRUN 
COMP] FX CFIIN 


ae HOOKREFP TING PARAMETERS FOR STAGE ONE #4 
NCONY 1 CONVERGENCE ACHIEVED. 
“1 MO CONVERGENCE ACHIEVED. 


NROUND { NO ROUND OFF TROUBLE OHSERVED. 
P ROUND OFF TROUKLFE OHSFRVED. 
NABORT 0 UPDATE TOLERANCE AND CONTINUE ON APPEARANCE OF 
ROUND OFF TROQUHLE, 
1 TERMINATE wHEN ROUND OFF TROUBLE OBSERVED. 
FKACT Trt EXACT VALUE GF TCOF CL) WHICH IS CFUN(Z7ETA). 
SAFETY THIS TS A SAFETY FACTOR RY WHICH THE ROUTINE AVOIDS 


THE ROUND OFF LEVEL. IT FS SET TO 10.20 AND APPEARS 
ONLY IN THE COMBINATION (SAFETY#EPMACH). TO oLTER THIS 
FACTOR. OR TO REMOVE THE ROUND OFF ERROR GUARA 
COMPLETELYs THE USER WEFD ONLY ANYUST THE INPUT 
PARAMFIER EPMACH APPROPPTATELY. 


NADBAADOMADDADDADADADAADAANBDAADA NAMA AN ADD ADADADADANDAADAADAAABAANDAADAAAAAAAAADIAAAAAAA 
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ve QUANTITIES CALCULATED IN STAGE THREE(A) ## 
THIS IS THE FIRST PART OF ITERATION NUMBER NTCOF. PRESENTLY 
AVAILABLE ARE EXPTAB(J+1) = CEXP(PIFEYE*J/NTAB) « 
J = Nelo2eeveNTAK=1e 
WE RFQUIRE THE SEQUENCE 
J= Le 39Seeee (NICOF 2-1). 
IF (NTCOF «LE «?#NTAB) THESE NUMBERS ARE ALREADY AVAILABLE 
IN THE EXPTAB TARLE SPACED AT AN INTERVAL 2*NSPACE = 4*NTAB/NTCOF. 
OTHFRWISE« NTCOF = 4®NTAB AND THE EXPTAB TABLE IS UPDATED. 
THIS INVOLVES REARRANGING THE NTAB VALUES AVATILARLE» 
CALCULATING ANO STORING NTAB NEW VALUES AND UPDATING 
NTABR TO 2#NTAHe 


CEXP (PIHE YER I/NTCOF /2) 06 


we QUANFITIES CALCULATED IN STAGE THREE (B) #* 
ITERATIONS ARF NUMBERED 4eBsl6eeee AT THE ENN OF 
ITERATION NUMBER NICOFs THE NTCOF COMPLEX FUNCTION 
VALUES AT ABCISSAS REGULARLY SPACED ON CIRCLE ARE STORED 
IN THE TCOF VECTOR AS FOLLOWS ‘ 
TCOF (J#1) = CFUN(Z(S))  JEOeLe2eeeeeNTCUFM) 
WHERE, 
7(3) = ZEPA + RCIRC#HCFAP (Z#P TSE YE*I/NTCOF) 
THIS INVLOVES A REARRANGEMENT OF THE NTCOF/2 FUNCTION 
VALUFS AVAILABLE AT THE STAPT OF THE ITERATION AND THE 
CALCULATION OF A FURTHER NTCOF/2 FUNCTION VALUES. IN 
ADDITION FMAX AND APPROX ARE CALCULATED. THESE ARE 
FMAX MAXIMUM MODULUS QF THE FUNCTION VALUES SO FAR 
ENCOUNTERED. 
APPROX AN APPROXIMATION TO TCOF (1) BASED ON THESE 
FUNCTION VALUES. 


ee QUANTITIES CALCULATED AT STAGE THREE (C) ## 
FRROR] CURRENT VALUE OF THE ERROK = CABS (APPROX~EXACT ) « 
ERROR?. ERROR3+ ERRORS VALUES OF FRROR AT END OF THREE 
PREVIOUS ITFRATIONS. 
EPMACH MACHINE ACCURACY PARAMETER. (INPUT PARAMETER) 


EPREQ REQUIRED ACCURACY. (INPUT PARAMETER) 
EPRO HIGHEST ACCURACY REASONABLY ATTAINABLE IN VIEW OF 
THE SIZE OF THE FUNCTION VALUES SO. FAR ENCOUNTERED. 
(=10.0*EPMACHFPFMAX) 
EPCOF CURRENTLY REQUIRED ACCURACY (2AMAX1(EPREQsE PRO) )« 
EPEST ESTIMATE OF CURRENT ACCURACY. (TRE MAXIMUM OF EPRO AND 


A FUNCTION OF ERRORS 1293 AND 4. (OUTPUT PARAMETER) 


te CONVERGENCE AND) TERMINATION CHECKS IN STAGE FHREE(C) #% 
(1) USES FM4x TO RAISE EPCOF ABOVE ROUND OFF LEVEL. 
IF THIS NECESSARY ANDO THE INPUT VALUE OF NCODE IS NEGATIVEs 
IT TEYMINATES SETTING NCODE=06 
(2) USFS APPROX TO EVALUATF CONVERGENCE OF TCOF (1) TOWARDS 
EXACT. IT MAY ASSIGN CONVERGENCE AND GO TO STAGE FOUR (A) 
SETTING NCODE=+1 OR +2. (CONVERGENCE IS NOT CHECKED FOR 
FOUR OR FEWER POINTS). 
(3) USFS NMAX TO CHECK PHYSICAL LIMIT. IF THIS HAS BEEN 
REACHEDs IT GOES TO STAGE FOUR(A) SETTING NCODES-1 OR -e. 
(4) OTHERWISE CONTINUES NEXT [TERATION BY GOING TO STAGE 
THREE. 


#eCALCULATION OF FIRST NTCOF TAYLOR COEFFICIENTS IN STAGE FOUR(A) 
A VEpSION UF THE FAST FOURTER TRANSFORM USING A WORK ARRAY 

IS USED. THE ARRAY ##yORRe# TS USED ONLY DURING THIS STAGE. 

THE WORK ARWAY ALLOWS THE PERMUTING OF INDICES ASSOCIATED 

WITH TN=PLACE FFIS TO GE SUPPRESSED» TRE FFT CALCULATES 

THE NFECCESSARY SUMMATIONS EXCEPT FOR OIVIOLNG BY NTCOFe 


e8SETTING OF REMAINING TAYI.OK COEFFICIENTS IN STAGE FOUR(B) 
THE CONVERGENCE CRITERION ALLOWS US TO INFER THAT THE 
NORMALIZED TAYLOR COEFFICIENTS OF ORUER GREATER THAN NTCOF 
ARF 7F£RO TO ACCURACY FPEST. 
THEY ARE EVALUATED 4S BEING FXACTILY ZERO- 
COMPLEX CFUN 
COMPLEX ZETA 
REAL RCIRCeEPREQsEPMACHeEPEST 
INTEGER NMAX es NCONE sNTCOF eNTAS 
COMPLEX TCOF (1) s WORK (1) « EXPTAB (1) 
INTEGER NABORT eNCONV eND ISP eNDOL IM yNPREV *NROUND sNSPACE 
PFAL COSD LF sEPCOF tEPM INS FPHO sf P32sEP42¢ERRORI oFRROR2 
REAL FRROR3sERRORG 9 FMAX es SAFETY 9 SCALE se TWOPI 
COMPLEX APPROX e@E XACT eF VAL eREXP 9 SUM9ZVAL 
INTEGER Je JCONJe JFROMs STARS ITO 
COMPLEX CHPLXeCONIG 
biel STAGE ONE sab tt 
INITIALISE BOOKKEEPING PARAMETERS AND EXACT VALUE OF TCOF (1). 
NROUNG = 1 
NaRORT = 0 
IF (NCOUE«LT.0) NABORT = } 
FPCOF = EPREU 
SAFETY = 19.U 
7vAL = ZETA 
FyAL = CFUNCZVAL) 
FXACT = FVAL 
ae STAGF Twn a ete 
FIRST TWO TIERATIONS ( THOSE WITH NICOF = 1e2 De 
FRRORZ = 0.0 
7VAL = ZETA+CMPLX (RCIRCE0.0) 
FVAL = CFUN(ZVAL) 
APPROX = FVAL 
FMAX = CAKS(FVAL) 
TCGF(]) = FVay 
FRROR?Z = CARS (APPROXEXACT) 
7vAL = ZETA-CMPLX(RCIRC10.0) 
Fyal = CFUMN(ZVAL) 
APPROX = 0.5% (APPROX+FVAL ) 
FeaaxX = AMAX] (F MAX*+CAKS (CF VAL)? 
TOOF (2) = FVAL 
FRRORL = CAHS (APPROK=EXACT) 
NTCOF = 2 
ahah STAGE THREE oh te th 
COMMFNCE ITERATION NUMBER NTCOF. 
300 CONTINUE. 
NPREV = NTCOF 
NICOF = 2#NTCOF 
ths STAGE THREE (A) stab tt 


UPDATF EXPTAB TAHLE ITF NECKSSARY. 
TF (2#NTAR.GE.NICOF) OO TO 340 
IF (NT45.6F 22) GO 10 319 


FXPTAR(L) = (be099.0) 
FXPPAdG(2) = (0.001.0) 
NTAB = 2 


GO TO 344 
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Aaa 


Oo5 


HAADANDAAGD 


310 CONTINUE 


NOOLIM = NTAH~1 
No 320 J = LeNDOLIM 
JFROM = NPAAR#J 
JTO = 24JFROM 
FXPTAB(ITO*¢L) = FXPTAB(JFROM+1 
320 CONTINUE 
NTAB = 2#NTAK 
TWOP] = HeOFATAN(1 69) 
CASOIF = COS CI WOPEZFLOAT (2¥NTAH) ) 
NDOLIM = NTAB=3 
N4 330 J = LeNDOLIMs2 
PXPTAH(J+1) = (OOF XPTAB(S) ©0-S#EXPTAR (J*2)) /COSOIF 
330 CONTINUE 


FXETAHI(NTAB) = 


(0 eS#EXPTAR (NTAH=1)= (0659020) )/COSOIF 


340 CONTINUE 


+O 


STAGF THREE (H) 


att 


Pee 


UPDATF LIST OF FUNCTION 


VALUES IN TCOFs 


CALCHILATE FMAX ANI) APPROX. 


NOOQLIM = NPREV-1 

Nn 350 JS = LeNUOLIM 
JFROM = NPREV=J 
JTO = 24 FROM 


rCOF(JTO+1) = 


TCOF CUFROM+1) 


350 CONTINUE 


SuM = (0.00000) 
NSPACE = (2#NTAH) /NTCOF 
09 360 J = LeNDOLIMed 
JTAB = J#NSPACE 
PEXP = RCIRC#EXPPAK(ITABEL) 
7VAL = ZETA+REXP 
FVAL = CFUN(/VAL) 
SUM = SUM*F VAL 
FMAX = AMAX] (FMAXeCABS(FVAL)? 
TCOF (J+1) = FVAL 
JCONJ = NTCOF-J 
7ZVAL = ZETA CONJG(REXP) 
FVAL = CFUN(7VAL) 
SUM = SUM+FVAL 
FMAX = AMAXL(FMAX+CASBS(FVAL)) 


TCOF CUCONJS+1L) = 


FVAL 


369 CONTINUE 


APPROX 


STAGE THREE (C) 


= 0.54 APPROX*+SUM/ELOAT (NTCOF) 
eet 


C CONVERGENCE AND TERMINATION CHECK. 


ERRORS 
FRRORG 
FRROR2 
FRRORY 
FRRO = 


IF (EPRO.I TeEPCOF) 


FPCOF 
NROUND = 2 


TF (NABORT.EQ.0) 


CODE 
FPEST 
60 TO 470 


470 


FRRORG 
FREORS 
Fpa2 = 
Fe 32 = 
FoOMIN 
FPEST 


TE CRPEST.GT.EPCOF) GO 


NCOM 


v 


FRRORY 

ERROR? 

ERRORL 

CABS (APPROX-EXACT) 
FMAKHSAFETY#EPMACH 

60 TO 3706 


uuetu 


= FERRO 


GO TO 370 
= 9 
= FPR 


CONT TNUE 
TF (NTCOPF Lb 4) 


GQ TO 389 
= AYAXL CERRORG E PRO) 
= ASAXT (ERROR E PRO) 
ERRORZS ( (FRRORE/ERRORSG ) 24 (4.0/7320)) 
BRRORZ*® ((ERRORZ/ERROKI) #42) 
AMTIN] CERROR2Z EP 32eEP4P) 
AMAX1 (FRROR I sEPMINeEPRO) 
TO 300 
= 1 


50 TO 409 


NCONV 


CALCULATION OF FIRST NTCOF 


STAGE FOUR(A) 


CONT ENUE 
IF (2#NICOF eLESNMAX) GO 


To 306 
= <1 


abt 


TAYLOR COLFFICIENTS USING FoF ele 


400 CONTINUE 


NCODE 
NO}SP 
4.10 
NDTSP 


Carl CFCOF 
TF (NOISP.GT.L) 


NM 420 


rCOF (J) = 
CONTINGE 


4en 
aa TO 


NOTSP 


Catt CRCOF 


NCONV#NROUND 
“TCUOF 


CONTINUE 


NUTSP/2 

CNTCOF «ND ESP e TCOF eWORK eNTAHS EAP FAB) 
GO TO 430 

4 = LeNTCOF 

WORK (J) 


440 


Con TINguk 


NOTSP 7? 
(NT COF eND TSP e WORK» TCOF sNTAHSEAPT AB) 


TF (NDISP.6GI.1) GO TO 410 
440 CONT Tue 
SCALE = LeO/FLOAT CNTCOF) 
Ho 450 J = leNTCOF 
TCOF (J) = TCOF (J) #SCALE 
WORK (J) = TCOF CU) 
450 CONTINU 
init STAGE FOUR (rt) th te tt 
SETTIWG OF KEMAIN IG TAYLOe COEFFICIENTS. 
TE (NTCUF 666 oNHAKY GO TO 470 
NOOLIM = NICOF4) 
NO 460 J = NDOL TM ewMax 
TCOR (CJ) = (0.900029) 
HORK CID = (0400020) 
460 COMTINUE 
4/0 CONTINUE 
RF TURN 
END OF ENTC4F 
Fan 
SuUePROUTINE CRCOF ( MTCORe NO[SP 5 TCOFs WORK. NTABs EXPTAR 


ete COMPLEX FOWOTER 

se GE MER AL, 
THIS QUT ENE. 
THE TNDEXEIMG ES ARRANGE SO 
ORDER AT THe Fifi? GF THE ft 4ST PASS, 


THE 
PARTI A! 
FACH PASS. 


USF OF 


RESULTS. 


COEFFICIENTS ## 

PURPOSE % 

NOES ONE PASS OF A FAST FOURTER TRANSFORM. 
THAT (HE COFFFICIENTS 4PE IN 
THIS INDEAING RFQUIRES 
ARRAYS FOR INPUT ANY) OUTPUT OF TRE 

1S CALLFU ONCE FOR 


SF PAP ATE 
THIS ROUTINE 


) 


1) 
(2) 
(3) 
(5) 
(A) 


(4) 


AND 


aAAgAANNANNNAANANNnAANANANNANANAANANAAANNA 


100 
200 


Cc END 
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ws INPUT PARAMETERS tH 


NTCOF NUMBER OF COEFFICIFNTS TO Ht PROCESSED. 
MDISP MAXIMUM VALUE OF LISPLACEMENT INDEX. 
TCOF (COMPLEX) [INPUT ARRAY. 
NT AH MU“BERP OF ENTRIES IW EXPTABs 
FXPTAH (CUMPLEX) TABLE OF VALUES OF COMPLE EXPONENTIAL « 
tXPTARC HEL) = CeXbk (PTFE YE#I/SNTAB) © 
JS = DelebrecesNIAK~L, 
+¢ OUTPUT PARAAFTERS #4 
WORK (COMPLEX) QUTPUT ARRAY. 
tess INDEXING OF ARRAYS ## 


THE TwO POINT FUURTER TRANSFORM 

OF TCOF WITH LNWOICES 
(CUNT SP) #NPRE V+ JREPL 

THF SFSULTS ARE MODIFIE) BY THE APPROPRIATE TwIDDLE FACTOR 


STOREO 


(JD TSP = 1b) eNME XT + JREPL 
WHERE 


NNTSP 
NPQEV 
ae XT 
MHALE 
JNTISP 
JPEPL 


THE TvINOLE FACTOR CEXP(-PIFEVYERI/NDISP) 9 
TS ORTALINE!) RY TAKING 
EVERY NSPACLENTAKR/NDISP OF EXPTAt, 


INTEGER 
COMPLEX 
COMPLEX 
COMPLEX 
TNTEGER 
INTEGER 
NALF 


1S APPLIED TO THE POINTS 


AND (JO1SP=1) *NPREV*JREPL + NHALF 


(N WORK WITH INUICES 


AND (JO]SP-1) #NNEXT ¢JREPL +NPREV 


PRODUCT OF 
PRODUCT OF 


REMAINING FACTORS« 

PREVIOUS FACTORS. 

PRODUCT OF PREVIOUS AND CURRENT FACTORS. 
PRODUCT OF PREVIOUS AND REMAINING FACTORS. 
VISPLACEMENT INDEX = LeQeeeeNDISPe 
REPLICATION INDEX = Lle2aeeeNPREVe 


J=0e1e99ND1SP=1 
THE CONJUGATE OF ELEMENTS SPACED 


NTCOF sNOISPeNTAY 
TCOF (1) ¢ WORK (1) 0 EXPTAR (1) 
CONJG 


ROTs 20021 

Je JDIGP s+ JREPL eJTAR OUT eo Uw 

KTOsKT] eK WO oK Wi eNHALE oNNEXT oe NPREV os NSPACE 
NTCOF/2 


NPRPEV = NTICOF/(2#NDTSP) 
NeikXT = NTCOF/NDISP 
NSPACE = NTAK/NDISP 
Ng 200 JDTSP = LeNDTSP 
Jo = JUTSP-1 
JTAB = J#NSPACE 
POT = CONJG CF XPTAH (UTAREL)! 
To = JHNPREYV 
Jw = JHNNEAT 
NO 100 JREPL = 1eNPREV 
KfO = JIT+JREPI 
KT] = KT0+NHALF 
KWo = JweJREPL 
Kwil = KwU+#NPRE Vv 
ZO = TCOF (KTO) 
74 = TCOF (KT1) 
WORK (KWwO) = 70471 


woR 


K(Kwl) = (270-21) 4ROT 


CONT TINUE 
CONT ENUE. 


RE TURN 


OF CFCOF 


Emn 


COLLECTED ALGORITHMS FROM CACM 


Key Words and Phrases: approximation, Chebyshev 
approximation, Remez algorithm 
CR Categories: 5.13 


The second algorithm of Remez can be used to compute the 
minimax approximation to a function, f(x), by a linear combination 
of functions, {Q;(x)}¢, which form a Chebyshev system. The only 
restriction on the function to be approximated is that it be continuous 
on a finite interval [a,b]. An Algol 60 procedure is given, which will 
accomplish the approximation. This implementation of the second 
algorithm of Remez is quite general in that the continuity of {(x) is 
all that is required whereas previous implementations have required 
differentiability, that the end points of the interval be “critical 
points,”’ and that the number of “critical points’’ be exactly 7 -+ 2. 
Discussion of the method used and of its numerical properties is given 
as well as some computational examples of the use of the algorithm. 
The use of orthogonal polynomials (which change at each iteration) as 
the Chebyshev system is also discussed. 


Description 

1. Introduction. Given a Chebyshev system, go(x), ¢1(x), ..., 
n(x), we define the Chebyshev or minimax approximation to a 
continuous function f(x) over an interval [a, 6] to be the function 


Pr(x) = Copo(x) + +++ + Cnpn(x), (1.1) 

such that e is minimized, where 

e = max | f(x) — Pa(x) |. (1.2) 
a<ar<b 


If o:(x) = x*, we have the minimax polynomial approximation of 
degree n to f(x). If g:(x) = T; (x), where T;(x) denotes the Cheby- 
shev polynomial of the first kind of order i, we have the minimax 
approximation as a sum of Chebyshev polynomials. For the defini- 
tion of a Chebyshev system, see Achieser [3, p. 73]. 
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Algorithm 414 


Chebyshev Approximation of 
Continuous Functions by a 
Chebyshev System of 
Functions [E2] 


G.H. Golub and L.B. Smith* (Recd. Oct. 11, 1967, 
Jan. 27, 1969, and Apr. 11, 1970) Dept. of Computer 
Science, Stanford University, Stanford CA 94305 


The algorithm presented here computes the coefficients c, , 
i=0,1,..., 2, in (1.1) for any given Chebyshev system ¢,(x), 
i = 0,1, ..., 2. The algorithm is based on the second algorithm 
of Remez [1], and also makes use of the exchange method de- 
scribed by Stiefel [2]. 

The characterization of the error curve, given by 


(x) = Bi civi(x) — 02), (1.3) 


is the basis for the second algorithm of Remez. It is shown, for 
example, by Rice [11, p. 56] that p,.*(x) = Di iro cigs (x) is the 
Chebyshev approximation to f(x) on (a, b] if and only if there exists 
a set of points a< x9 < x1 << x2< +++ < Xny1 SB such that 


(a) €(x:41) = —e(x:), 
(b) | e(x:) | = &*, and 
(c) max | e(x)| = é. 

a<cz<b 
Thus, when the computed error curve attains this ‘‘equal ripple’ 
character with at least n + 1 sign changes in [a,b] we know we 
have the desired minimax approximation. 

The second algorithm of Remez, based on the characterization, 
can be outlined in three steps. 


(i) Choose an initial set of points, the reference set, a < xy < 
x< ree Ss Kasim b. 

(ii) Compute the discrete Chebyshev approximation to f(x) on 
the reference set. 

(iii) Adjust the points of the reference set to be the extrema of the 
error curve (1.3). 


Steps (ii) and (iii) are repeated until convergence is obtained. 

Proof of the existence of the minimax polynomial (given by 
(1.1) and (1.2) with {¢;}0, a Chebyshev system) is given by Achie- 
ser [3, p. 74]. 

Proof that the second algorithm of Remez converges for any 
starting values for the critical points is given by Novodvorskii and 
Pinsker [4]. If f(x) is differentiable, Veidinger [12] proves that the 
convergence is quadratic. That is 


e* — &) = O(e* — cD)? ask > ~, 


where ¢* is the maximum error for the Chebyshev approximation 
and e) is the maximum error at the Ath iteration. A survey article 
concerned with minimax approximations is given by Fraser [8}. 

2. Applicability. The algorithm presented herein has wide ap- 
plicability in that it can be used to approximate any continuous 
function given on an arbitrary closed interval. In addition, the 


COLLECTED ALGORITHMS (cont.) 


approximating function is not restricted to polynomials or Cheby- 
shev polynomials, but is allowed to be any linear Chebyshev system, 
to be supplied by the user. Three simplifying assumptions often 
made in an implementation of the second algorithm of Remez are: 


(a) Differentiability of f(x), the function to be approximated. 
(see [6], for example) 

(b) The end points of the interval are critical points (see [8, p. 
299}). 

(c) The existence of exactly n + 2 points of extreme value on the 
error curve (see [8, p. 299]). 


None of these three assumptions is made for this algorithm. 

3a. Formal parameter list: input to the procedure 

n integer degree of the Chebyshev system of functions to be 
used in the fit {yo(x), ei(x), +++ , @n(x)}. 

a lower end point of the interval of approximation, of type 
real. 

6 upper end point of the interval of approximation, of type 
real. 

kstart integer controlling the number of points 
(kstartX(n-+2)) used in the initial approximation. See (i) in Sec- 
tion 5. 

kmax integer allowing control of the number of times &k is 
increased above kstart. 

loops integer allowing control over the number of iterations 
taken by Remez’s second algorithm if convergence is not yet 
attained. 

f areal procedure to compute the function f(x) to be ap- 
proximated; procedure heading required: 


real procedure f(x); 
value x; 
real x; 


the argument is the untransformed variable x. f(x) must be con- 
tinuous in the interval [a, 5]. 

chebyshev a procedure to evaluate the Chebyshev system of 
functions being used at some point, x, in the interval [a, 5]; pro- 
cedure heading required: 


procedure chebyshev(n, x, 1); 
value n, x; 

integer 7; 

real x; 

real array f; 


n is the degree of the system, x is the point in {a, 5], and ¢ is an 
array that will contain the values ¢[i] = 9;(x), i = 0,1,..., 2. 

eps a real procedure to compute the error curve given by 
(5.1); procedure heading required: 


real procedure eps(x, c, 7); 
value x, 71; - 

real x; 

integer n; 

real array ¢; 


x is a point in [a, 5], n is the degree of the system, and c is an array 
containing the coefficients of the approximation, c[i] = c; in (5.1). 
exchange a procedure, [10] for example, to locate the n + 2 
subset of m + 1 given points which determine the minimax poly- 
nomial on those m+ 1 points; procedure heading required: 


procedure exchange (a,d,c,m,n,refset,emax,singular,r) ; 
value mn; integer m,n; real emax; 

real array a,d,c,r; 

integer array refset; 

label singular; 


aisarealm +1 byn + 1 array, d is a m + 1 component vector, 
c is an + 2 component vector, m+ 1 is the integer number of 
points (xo, ... , Xm), 2 is the degree of the system, refset isan + 2 
component integer vector, emax is a real number and singular is a 
label. r is a vector containing the m + 1 values of the residual 
at the m + 1 points under consideration. On entry the components 
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of a and d are 
ali,j] = 9j(x;) and 
di] = f(x:), i= 0()m, 


Upon exit from exchange, the array c contains the coefficients of 
the minimax function found, refset contains the subscripts identify- 
ing the points used to compute the minimax function, i.e. the refer- 
ence set, and emax contains the value of the maximum deviation 
of the minimax function from f(x) on the points x; , i = O(1)m. 

3b. Formal parameter list: output from the procedure 

c the array of coefficients c; of eq. (5.1). 

emax the maximum modulus of the error curve (5.1) for the 
final approximation function, of type real. 

trouble a label to which control is transferred if remez does 
not converge properly. 

why an integer whose value on exit will be set to one of the 
following: 


j = O(1)n. 


why = —1 if number of added points is greater than n. (See step 
(ii) in Section 5.) 

why = | if trouble occurs in procedure quadraticmax. 

why = 2 if trouble occurs in procedure exchange. 

why = 3 if no convergence after iterating /oops times. 

why = 4 converged according to the maximum and minimum 
residual comparison. 

why = 5 converged according to wh 
test. 

why = 6 converged according to why = 4 and the Coefficient test. 

why = 7 converged according to why = 4 and both the critical 
point and the coefficient tests. 

why = 8 converged according to critical point test only. 

why = 9 converged according to coefficient test only. 

why = 10 converged according to critical point and coe ficient 
tests. 


li 


== 4 and the critical point 


4. Organization and notational details. The algorithm calls 
for three procedures, in addition to the function f(x) to be approxi- 
mated, as indicated by the formal parameter list. 

exchange Based on Stiefel’s: Exchange algorithm, which finds 
the n + 2 subset of m + 1 given points which determine the mini- 
max polynomial. Use [10], for example. 

eps To be supplied by user: eps computes the error curve 


e(x) = 2a, ci (x) — f(x] (4.1) 
where the c; , i = 0,..., m, are parameters and the ¢;(x), i = 0, 
1,..., a, are the Chebyshev system of functions being used to 


fit the function f(x). For best results e(x) should be computed in 
double precision and then rounded to single precision accuracy. 
If f(x) cannot be calculated easily or efficiently in double precision 
at least the sum, ) > i=o c:g;(x), should be accumulated in double 
precision and rounded to single. 

chebyshev To be supplied by user: chebyshev evaluates the 
Chebyshev system ¢;(x), i = 0, 1,..., for a given argument x. 
chebyshey is called by eps. 

The functions e«(x) and y;(x) (computed by eps and chebyshev) 
can often be computed by simple recursive procedures. For ex- 
ample, if the Chebyshev system used is the set of Chebyshev poly- 
nomials, there is a well-known recurrence relation (y:41(x) = 
2xg:(x)—¢;-1) that can be used to efficiently evaluate the required 
functions. 

An outline of the organization of the algorithm is given in the 
following steps: 

(i) Let m =k X (n+2), take m+ 1 points in the interval 
{a,b] and use exchange to determine the ‘best’? polynomial (i.e. 
the 


nr 


c:; > max | >> evex(xj) — f(x;) | = minimum) 
0<j<n i=0 


on those points. Exchange will pick n + 2 of the original points as 
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critical points. The m + 1 points are chosen equally spaced or as 
the zeros of Tn—i(x) — Tn-3(x) with k > 1. 
(ii) Use the n + 2 points chosen by exchange in step (i) and v 
other local extrema (subject to the conditions discussed under 
Example 2, Section 6) as input to the procedure: guadraticmax 
(v>.0). 
(ili) Procedure quadraticmax adjusts the n + » + 2-critical points 
to be the abscissas of the extrema of the error curve given by (4.1). 
Section 5b gives a discussion of how the adjustments are com- 
puted. After adjustment the new points are tested for alternation 
of sign, and if the property has been lost, we increase k and go 
back to step (i). 
(iv) The adjusted critical points are then input to exchange which 
finds the new coefficients c; , i = 0, 1, ---+ , n for the “‘best’’ poly- 
nomial on the adjusted » + » + 2 points. 
(v) Now convergence tests can be applied to the coefficients c; , 
found in step (iv), to the critical points x;, i = 0, 1,---, and 
to the extreme values of (4.1). If not converged, go back to step 
(ili) since the previous critical points will not be the exact extreme 
points after the approximating polynomial is changed in step (iv). 
5a. Discussion of numerical properties and methods: accuracy 
and convergence. The accuracy of the approximations generated 
by this procedure is limited by the precision of the arithmetic used 
and the accuracy of the subsidiary procedures f, exchange, eps, 
and chebyshev. The use of double precision in eps, for example, 
can improve the results of remez since it will then have a‘‘ smoother” 
error curve to work on. This use of double precision in eps is 
strongly recommended by the authors. The maximum absolute 
error of the approximation is output from remez and depends, of 
course, on 1, the degree of approximation. 

The procedure is deemed to have converged when the coeffi- 
cients of the approximating function or the critical points have 
satisfied certain relative criteria between successive iterations. 
We use the notation c{”” to represent the ‘th coefficient at the mth 
iteration and similarly, x{”’ represents the ith critical point at the 
nth iteration. 


When 
max | cs = ok? | < epse| cS” | (5.1) 
¥% 
or 
max | xe = xe PLS epsx| x$” | (5.2) 
t 


. J 
we consider the procedure to have converged. If | on | or eal 


is very small the relative test is not appropriate. In that case we 
test for” — cf"? | and | x§ — x$""? | against allowed absolute 
errors, absepsc and absepsx. Typical values for the constants (for 
an 11-decimal place machine) could be 


epsc = 10-8 
epsx = 10-4 (5.3) 
absepsc =: 10-8 
“ib tenes == 10-4 


A third convergence criterion is the comparison of the maxi- 
mum and minimum magnitudes of the error curve at the critical 
points. Let 


maxr = max | e(x;”’) | 
t 


and 

minr = mix | e(xs””) | 
tu 

where {xy } are the critical points chosen at the nth iteration, and 

then make the following test. If maxr< rcompare & minr then 

claim convergence. A typical value for the constant rcompare 


could be 1.0000005. 
When the maximum absolute error approaches 10-*(f,), 
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where s is the number of places available in the machine, and /f, 
is MaXa<xr<b | F(x) |, we are approaching the limit of obtainable 
accuracy. We are working with 


(x) = P,,(x) od S(x) (5.4) 


so when e(x) is nearly equal to 107*f(x), we are losing about s places 
in the subtraction in (5.4). This is where judicious use of double 
precision can be made to increase accuracy if necessary. P,(x) 
can be computed in double precision and a single precision differ- 
ence formed, or for even further accuracy f(x), if possible, could be 
computed in double precision and the double precision difference 
taken. 

A comparison of the discrete approximation on a finite num- 
ber of points in an interval, and the continuous approximation 
which this algorithm finds, is studied by Rivlin and Cheney in 
[9]. Rice [11, pp. 66-70] discusses the question of convergence (and 
rate of convergence) of the discrete approximation to the continu- 
ous approximation. This relates to the question of how large to 
choose & in step (i), Section 4. We have found that for well-behaved 
functions like et on [—1,1] a value for & of about 3 gives good 
starting values. On the other hand a function like 1/(x--X) on 
[—1,1] with X > 1 and) near | requires k to be about 15 to obtain 
good starting values. The choice of & should be large enough so 
that the initial approximation chosen by the procedure exchange 
is close enough to the final approximation to insure that the ‘“‘alter- 
nation of sign” property is never lost during the iterations. There is 
no known method of choosing such a k a priori. This is why the 
algorithm tests for ‘‘alternation of signs’? at each iteration and in- 
creases k if the property is lost. 

5b. Discussion of numerical properties and methods: Locating 
the extrema of «(x). Most of the programming effort is involved 
in locating the extrema of the error function e(x). The programming 
is similar to that done by C.L. Lawson in a Fortran program to 
compute the best minimax approximation [7]. e(x) is given by 


(x) = 2, ces (2) — f(x). 


The procedure exchange then is used to compute the coefficie 1ts 
of the minimax function. That is, given n + v + 2 points, »> 0, 
exchange computes the coefficients of the function yo Ho Cie: (x) 
such that on the discrete set of points e(x;), 7 = 0,1,-::,n+vt+1 
has at. least » + 2 extreme values (at the given points) equal in 
magnitude and of alternating signs. The satisfaction of this condi- 
tion when the points are indeed the extrema of the continuous e(x) 
guarantees that }- j=. c.g;(x) is the unique minimax approximat- 
ing function that we seek. 

5b.1 Discussion of numerical properties and methods: Parabolic 
approximation to locate extremum. Given the initial guesses x; , 
i=0, 1,---,a+v+1 (at each iteration) for the abcissas 
of the extrema of. the error curve, we must locate these critical 
points more precisely. We consider two cases. First the interior 
points, and secondly the least and greatest of the initial guesses 
which may be equal to the respective end points of the interval on 
which the function is to be approximated. 

For interior points we do the following. Take 


u= x; 
v= Xi + a(Xi4i1—xi) (5.5) 
Ww = X; + a(xi_-1—x:) 


where a is a parameter 0< a < 1 (e.g. a=0.1). We then deter- 
mine the parabola through the three points e(u), e(v), and e(w). 
The abscissa, x*, corresponding to the vertex of this parabola is 
then taken as the next guess for the ith ‘critical point.” The point 
x* is given by 


« — LIW = weclw) + ( — wW)e(u) + (wt — W#)e(v)] 
2 [(u— v)e(w) + (v — whe(u) + (w — u)e(v)) 


(5.6) 


For computational purposes x* is not computed directly by (5.6) 
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since for u, v, and w very close, the denominator will be quite small. 
Therefore, the denominator of (5.6) is computed 


d= [(u—v)e(w) + (v—w)e(u) + (w—ue(v)], (5.7) 
and then by dividing out (5.6), we express x* as 

tu +yv) ifd=0 
soe (5.8) 


1(v — u)(u— w) [e(v) — e(w)] 


1 
= = ifd ~ 0. 
put +s 7 ifd # 


Once x* is computed, it is then tested to insure acceptability since 
for u, v, and w very close, machine roundoff may introduce spurious 
results. Also, the value of a or the nature of the function f(x) and 
therefore of e(x) may introduce an unacceptable value for x* in 
which case .u, v, or w, whichever has highest ordinate value, is used 
for x*. If x* is acceptable it can replace u, v, or w, whichever has the 
lowest (in abolute value) ordinate value on the error curve e(x), 
and a second x* is computed. This iteration will converge to the 
abcissa of the extremum near x; if roundoff is ignored and uy, v, 
and w are sufficiently close to that point. (Compare convergence 
to Muller’s method for solving algebraic equations [5].) However, 
this iteration need not be carried out excessively (2-4 iterations 
should be sufficient) since during each iteration of the overall 
process. we.recompute the approximating function and thereby ob- 
tain a new error curve whose extrema will not necessarily have the 
same abscissas. 

For the end points (5.5) cannot apply since x;,, and x;_, do 
not exist at the right and left ends respectively. Therefore we take, 
at the left end for example, 


u= Xi 
v= XX + alXi41—%X:) 

{xi + B(%i41—-X;) If x; =a 
. tee: + a(a—x,;) ifa< x;, 


(5.9) 


w 


with the requirement that a ¥ 8. The right end is handjed simi- 
larly. Again the parabola through the three points e(u), e(v), and 
e(w) is used to determine x*. The tests for acceptability and itera- 
tions are performed as they were for the interior points. 

5b.2 Discussion of numerical properties and methods: Crude 
search to locate extremum. In case approximation by parabola does 
not yield an acceptable value for the abscissa of an extremum, the 
following rather crude method works effectively. We simply divide 
the interval under consideration into / equal intervals (e.g. /=10) 
and examine the ordinate of the error curve at the end points of the 
intervals. The points to the left and right of the point with maxi- 
mum ordinate (in absolute value) then define a new interval upon 
which the process is repeated. This subdivision continues until the 
subintervals become smaller than some specified value (e.g. 10-5). 
The method causes the function to be evaluated more often than 
the parabolic approximation, but works successfully at a point 
where the error curve has a sharp cusp-like extremum. 

The choice of / = 10 in this crude search procedure is arbitrary. 
In fact, for an initial interval of length J, a smaller value, say / = 4, 
would reduce the subinterval size to 10-5-J with a minimum of 21 
function evaluations, whereas using / = 10 would require at least 
51 function evaluations. However, small values of / increase the 
chances of missing the true extremum. 

To decide whether to use this crude search or not we employ a 
relative test. Let the parabolic choice be x* and the three points 
used to compute x* be u, v, and w. Then one would expect (hope) 
that | «(x*) | > | e(u) |, | e(v) |, and | e(w) |, in which case x* has 
the desired properties. However, if ¢, = maxz=u,v,w | e(x) |, and 
| e(x*) | < en , then we must doubt the acceptability of x* and 
perhaps use the crude method to determine x*. We found a suc- 
cessful way to make this decision was to use the crude method if 
|| e(x*)| — em | > C-em, where C is an arbitrary constant (e.g. 
10~*). 


Fig. 1 


f(x) 
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Table I. Coefficients c; of ‘best’? polynomial 
P(x) = DE a0 c:T;(x) (to 6D) 


i Start 

0 1.266 063 
1 1.130 321 
2 0.271 495 
3 0.044 337 
4 0.005 523 


Iteration 1 
1.266 066 
1.130 318 


0.271 495 
0.044 336 


0.005 519 


Table II. Critical points, x;, of best polynomial (to 6D) 


Start 
—1.000 000 
—0.771 429 
—0.257 143 
0.314 286 
0.828 571 
1.000 000 
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Table III. Comparison of starting values x; for f(x) = e7,n = 4 


Iteration I 
—1.000 000 
—0.797 573 
—0.278 189 
0.339 805 
0.820 978 
1.000 000 
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11.0 
Iteration 2 Iteration 3 
1.266 066 1.266 066 
1.130 318 1.130 318 
0.271 495 0.271 495 
0.044 336 0.044 336 
0.005 519. 0.005 519 
Iteration 2 Iteration 3 
—1.000 000 —1.000 000 
—0.797 682 -—0.797 682 
—0.279 152 —0.279 152 
0.339 061 0.339 061 
0.820 536 0.820 536 
1.000 000 1.000 000 


(to 3D) 

j T3(x) — T3(x) exchange on exchangeon TRUE 
= Oor |7s(x)| 6(N+2) 201 points (computed) 
= 1 points equally 

equally spaced 
spaced 

0 — 1.000 — 1.000 -- 1.000 —1.000 

I — 0.809 —0.771 ~-0.800 —0.798 

2 —0.309 —0.257 -- 0.280 —0.279 

3 0.309 0.314 0.340 0.339 

4 0.809 0.829 0.820 0.821 

5 1.000 1.000 1.000 1.000 

Dmax 0.030 0.027 0.002 — 

Table IV. Critical points chosen at each iteration. 

Iteration The 2+2 points used (see Figure 3) 

Ist 1 3 4 7 8 9 10 I1 12 

2nd 1 3 6 7 8 9 10 it 12 

3rd 1 3 6 7 8 9 10 11 = «=12 
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6. Examples. The procedure was tested on the Burroughs B5500 
at the Stanford Computation Center using Burroughs Extended 
Algol. 

We have chosen two examples to illustrate the use of the al- 
gorithm. The first is the function 


fi(x) = e? on [—1,1] (6.1) 
and the second is 
A(x) =1L+x, -10<x< -05 
= —xX, -0.5<5 x< 0.0 (6.2) 
a 2 0.0< x < 1.0. 


The first example, fi(x), is an infinitely differentiable function so 
that the error curve (4.1) is also differentiable, whereas f2(x) (see 
Figure 1) is continuous, but its derivative, /2’(x), has discontinuities 
at x = --0.5 and at x = 0.0, which cause the error curve to have a 
discontinuous derivative. We examine f2(x) as it provides an in- 
teresting example of approximating a function which is only con- 
tinuous. In both cases we used Chebyshev polynomials as the 
Chebyshev system of functions. 

Example 1. [fi(x) =e*]. Tables I and II show how the critical 
points and the coefficients of the approximating polynomial con- 
verge as we approximate fi(x) = e? by a 4th-degree sum of Cheby- 
shev polynomials. Figures differing from the final result are under- 
lined at each step. 

Table I shows that the coefficients of the “best”? polynomial 
have converged to 6D after only one iteration; however, the critical 
points don’t converge until the second iteration as shown by Table 
II. In other words, the polynomial does not change coefficients 
very much with a small change in the critical points. The starting 
points shown in Table II are chosen by exchange from 6 X (n+2) = 
36 (for n=4) equally spaced points in the interval [—1,1]. 

Various methods for choosing the starting values for the 
critical points have been proposed. These include the zeros of 
Tr4i(x) -- Tr-1(x), which are also the extrema of 7,,1(x), and 
what we propose here is to let exchange choose n + 2 points from 
some original set of k(n+2) points where k > 1. The original 
k(n+2) points may be equally spaced, or they may be the zeros of 
Th(ng2y4t(%) — Teing2)-1(x). 

Table III compares various starting values for this example, 
Si(x) = et(n=4). Dnaz represents the maximum deviation from 
the “TRUE” values. 

Example 2. [f2(x)]. Approximation of f2(x) by an 8th degree 
sum of Chebyshev polynomials (7=8) poses the problem of hav- 
ing an error curve with more than N + 2 local extrema. This 
problem also arises when approximating an even or odd function 
(see [6]). We resolve the problem by including all the local extrema 
of the error function, e(x), which have the alternation of sign 
property, in the search for n + 2 critical points. That is, if the 
abcissas of the extrema are ordered algebraically, the signs of the 
corresponding ordinates must alternate. We obtain starting guesses 
for local extrema by having exchange pick n + 2 starting points 
from some original set of points, together with the corresponding 
first approximating polynomial, and then examining the resultant 
residuals. If the table of residuals indicates an extremum not al- 
ready chosen by exchange, which has the correct alternating sign, 
then the corresponding abcissa is included as a critical point for 
later iterations. & must be chosen greater than 1 in order for this 
method to work. 

Figure 2 shows the error curve, e(x), for the first and third 
iterations of approximating f2(x) by an 8th-degree linear combina- 
tion of Chebyshev polynomials. 

Table IV indicates how the choice of critical points can change 
from one iteration to the next. If we had not included the addi- 
tional extrema at points 5 and 6 at the first iteration, we would have 
arrived at the approximation whose error curve is illustrated by 
Figure 3. That is » + 2 extrema of the error curve have equal 
Magnitude and alternating signs, but another extremum exists 
with larger modulus. 
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Table V. Comparison of starting values x; for f(x) = f2(x), 
n = 8 (to 4D) 


J Ts(x) — T1(x) exchange on exchangeon TRUE 
= 0 33 points 201 points (computed) 
equally equally 
spaced spaced 
0 — 1.0000 — 1.0000 —1.00 — 1.0000 
1 —0.9397 —0.8750 —0.86 —0.8565 
2 —0.7660 —0.6250 —0.62 —0.6248 
3 —0.5000 —0.1250 —0.14 —0.1424 
4 ~—0.1736 0.0 0.0 0.0 
5 0.1736 0.1250 0.15 0.1456 
6 0.5000 0.4375 0.44 0.4413 
7 0.7660 0.7500 0.73 0.7290 
8 0.9397 0.9375 0.93 0.9289 
9 1.0000 1.0000 1.000 1.0000 
Dmax 0.3750 0.0210 0.0048 — 


AS an interesting comparison to Table III we give a similar ta- 
ble for f(x) = f2(x). Dmaz reptesents the maximum deviation from 
the “‘TRUE” values in Table V. 

7. Use of orthogonal polynomials. Consider the polynomials 
Po(x), pi(x), «++ , Pa(x) orthogonal on the set of points x. < x1. < 
+--+ <_Xm. Such polynomials are described by Forsythe [13], 
and they form a Chebyshev system. This is easily seen since any 
licear combination, 


P(x) = 2, ciPs(x), (7.1) 


is a polynomial of degree m which has exactly ” zeros. Hence on 
any interval, P(x) has no more than n zeros. This satisfies the defini- 
tion of a Chebyshev system. 

It is known, see Forsythe [13], that orthogonal polynomials 
have advantages over standard polynomials in least squares data- 
fitting. In the Remez algorithm, if a new set of polynomials, or- 
thogonal on the critical points, is computed each time the critical 
points are adjusted, convergence is assured. This can be proved by 
nothing that at each iteration the best orthogonal polynomial 
fit is equivalent to the best fit that would be obtained if the Cheby- 
shev system were held constant as standard polynomials. Perhaps 
this use of orthogonal polynomials will have computational ad- 
vantages over, say, standard polynomials on the interval [0,1]. 

The use of orthogonal polynomials for the Chebyshev system 
has been implemented and tried successfully on a Burroughs B5500 
but as yet we have no illustrations of any dramatic advantages over 
any other Chebyshev system. 
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Algorithm 
procedure remez (n, a, b, kstart, kmax, loops, f, chebyshev, eps, 
exchange, c, emax, trouble, why); 
value n, a, b, kstart, kmax, loops; 
real array c; reala, b, emax; label trouble; 
integer n, kstart, kmax, loops, why; 
real procedure /, eps; procedure chebyshev, exchange, 
comment Procedure remez finds the best fit (in the minimax sense) to 
a function / using a linear combination of functions which form a 
Chebyshev system. The exchange algorithm of E.L. Stiefel is used 
to obtain starting values for the critical points and the Remez 
algorithm is then used to find the best fit; 
begin 
procedure quadraticmax(n, x, niter, alfa, beta, ok, a, b, c, nadded, 
eps); 
value n, niter, alfa, beta, nadded; array x, c; 
integer n, niter, nadded; real alfa, beta, a, b; 
Boolean ok; real procedure eps; 
comment Procédure guadraticmax is called to adjust the values of 
the critical points in each iteration of the Remez algorithm. The 
points are adjusted by fitting a parabola to the error curve ina 
neighborhood, or if that proves unsatisfactory a brute force de- 
termination of the extrema is used; 
begin 
integer i, count, count2, nhalf, signepsxstar, signu, signy, signw, 
jmax, nerude, j, nn; 
real u, v, w, denom, epsu, epsv, epsw, xstar, epsxstar, xxx, misse, 
missx, dx, emax, etmp; 
integer array signepsx [(0:n + 1]; 
nn := n — nadded,; 
comment On arbitrary parameters... 
ncrude The number of divisions used in the brute force 
search for extrema. 
nhalf The parameter (alpha) which determines the size of 
interval to be examined for an extremum is reduced by 
half if a bad value for xstar is computed, however this 
reduction may occur only nhal/ times. 
misse If the value of the error curve at a new critical point 
differs from the previous value by a relative difference of 
more than misse then the brute force method is brought in, 
missx The brute force method keeps searching until it is 
within missx of an extremum; 
comment Set values of the constants; 
nerude := 10; nhalf := 4; misse 
1.010 ad 5; 
comment Compare missx with absepsx. They should be equal; 
for i := O step 1 until x + 1 do 


array epsx (0: + 1]; 


:= 1.019 —2; missx := 


begin 
epsx(i] := eps(x{i], c, nn); 
signepsx{i| := sign(epsx(i]); 
end; 


for i := step 1 until x + 1 do 
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begin 
comment If the starting values for the critical points do not 
alternate the sign of eps(x), then we go to the label trouble; 
if signepsx|i] X signepsx[i—1] * —1 then go to trouble; 
end; 
comment First find all the interior extrema. Then we will find 
the end extrema, which may occur at the ends of the interval; 
for i := 1 step 1 until 1 do 


begin 
count! := 0; cownt2 := 0; 
Ll: 
u:= x(t); 
v:= u+ alfa X (xfi+1] — w; w:= u + alfa X 
(x{i—1] — u); 
epsu := epsx{i]; signu := signepsx[i]; 
epsv := epS(v, c, nn); signv := sign(epsv); 
epsw := eps(Ww,c, nn); signw := sign(epsw); 
if — signu = signv \/ - signv = signw then go to L3; 
comment If the sign of eps(x) at the three points is not the 
same, we go to L3 where alfa is reduced to make the points 
closer together; 
epsu := abs(epsu); epsv := abs(epsv); epsw:= abs(epsw); 
L2: 
denom := 2.0 X ((epsv — epsu) X (w — u) + (epsw — 
epsu) X (u — ¥)); 
if denom = 0.0 then xstar := 0.5 X (v + w) else xstar := 
0.5 xX v+w) + v— Bw XK (u— w) X (epsv — epsw)/ 
denom; 
count) := count! + 1; 
comment Test xsfar to be sure it is what we want. Is it be- 
tween x{i—1] and x[i+-1]? Is eps(xstar) > eps(u, v, w)? If 
xstar is too bad, go to L3 and reduce alfa unless alfa has 
been reduced nhalf times. Otherwise if ok, go to savexstar; 
if xstar = u \/ xstar = v \/ xstar = w then 
begin 
epsxstar := eps(xStar, c, nn); signepsxstar := sign 
(epsxstar); 
epsxstar := abs(epsxstar); go to savexstar 
end; 
if xstar < x{i—1] \V xstar > x[i+1] then go to Z3; 
epsxstar := eps(xstar, c, nn); 
Signepsxstar := sign(epsxstar); 
epsxstar := abs(epsxstar); 
if signepsxstar # signu \/ epsxstar < epsu \/ epsxstar < 
epsv \/ epsxstar < epsw then 
begin 
if epsu > epsv /\ epsu > epsw then 
begin 
if abs(epsxstar — epsu) > misse * epsu then go to 
LBL2; 
xstar := u; epsxStar:= epsu; signepsxstar := signu: 
go to Savexstar, 
end; 
if epsv > epsu /\ epsv > epsw then 
begin 
if abs(epsxstar — epsv) > misse X epsv then go to 
LBL2; 
xstar := Vv; epsxsStar := epsv; signepsxstar := signy: 
go.to savexstar: 
end; - 
if abs(epsxstar — epsw) > misse X epsw then go to 
LBL2; 
xstar := w, epsxstar := epsw; SignepSxstar := signw; 
go to savexstar; 
LEBL2: 
jmax := 0; 
LBL: 
dx := (v—w)/ncrude; emax := 0.0; xxx := w — dx; 
for 7 := O step 1 until ncrude do 
begin 


XXX = xxx + dx; jmax := jmax + 1; 


LA4: 
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efmp := eps(xxx, ¢c, nn); 

if abs(etmp) > emax then 

begin 
emax := epsxstar := abs(etmp); 
signepsxstar := sign(etmp); 
ui= xSlar 1= XXX; 

u-+ dx; w= u— dx; 


< 
ll 


end; 
if dx > missx then go to LBL1; 
comment Make sure v and w are within bounds; 
if vy > x[i+1] then go to Z3; 
if w < x[i—1] then go to Z3; 
go to savexsta 
end; 
if count! > niter then go to savexstar; 
if epsu < epsw then 
begin 
if epsv < epsu then 


begin 
comment v is minimum; 
if xstar > u then 
begin . 
vi= xstar; epsv := epsxstar; go to L2; 
end; 
if xstar > w then 
begin 
epsv := epsu; Vi= U; 
epsu := epsxstar; u:= xstar; 
go to L2; 
end 
else 
begin 
Vi= U; epSV := epsu; 
u:= W; epsu := epsw; 
W i= xstar; epsw := epsxstar, 
go to L2; 
end; 
end 
else 
begin 
comment u is minimum; 
if xstar > v then 


begin 
U:= VY; epsu := epsy; 
v:= xstar; epsv := epSxstar,; 
go to L2; 
end; 
if xstar > w then 
begin 
u:= xstar; epsu := epsxstar; 
go to L2; 
end 
else 
begin 


u:= WwW; epsu:= epsw; 
Ww := xstar; epsw := epsxstar; 
go to L2; 
end; 
end; 
end 
else 
begin 
if epsv < epsw then 
begin 
comment v is minimum; go to Z4; 
end 
else 
begin 
comment w is minimum; if xstar > vy then 
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L3: 


I8: 


LS: 


begin 
Wi= Uy epsWw := epsu; 
u:= V5 epsu := epsv; 
v:= xSlar; epsv := epSxstar; 
go to L2; 

end; 

if xstar > u then 

begin 
W:= U5 epsw := epsu; 
u:= xStar; epsu := epsxstar; 

go to L2; 

end 

else 

begin 
Wi:= xSfar, epsw := epsxstar, 
go to L2; 

end; 

end; 
end; 


count2 := count2 + 1; 
if count2 > nhalf then go to trouble, 
alfa. := 0.5 X alfa; 
comment The factor 0.5 used in reducing alpha is arbitrarily 
chosen; 
go to LI; 
comment Replace x[i] by xstar after checking alternation of 
signs; 
savexstar: 
if i > 1 A signepsxstar X signepsx{i—1] + —1 then go to 
trouble; 
signepsx|i] := signepsxstar; 
xi]: := xstar; 
end; 
comment This is the end of the loop on i which finds all interior 
extrema. Now we proceed to locate the extrema at or near 
the two endpoints (left end, then right end); 
comment We assume beta > alfa; 
for i:= 0,n + 1 do 
begin 
count! := 0; count2 := 0; 


u:= x{ij; if i = Othen 

begin 
ifa <u then w := u + alfa X (a — wu) elsew: 
beta X (x{l] — w); 
v:=u-+alfaX (x[l] -— 4); 

end 

else 

begin 
ifb > u then w := u + alfa X (b — u) elsew := ut 
beta X% (x[n] — u); 
v:= u-+alfa X (x[n] — u); 

end; 

epsu := epsx{i]; Signu := signepsx{il; 

epsv := eps(v, c, nn); signv := sign(epsv); 

epsw := eps(w, c, nn); signw := sign(epsw); 

if signv ~ signu \/ signv ~ signw then go to L7; 

epsu := abs(epsu); epsv := abs(epsv); epsw := abs(epsw); 


u+ 


denom := 2.0 X (epsu X (v—w) + epsv K (w—u) + epsw X 
(u—v)); 

if denom = 0.0 then xstar := 0.5 X (w+yv) else xstar := 
0.5 * (vtw) + (v—u) X& (u-—w) X (epsyv — epsw)/ 


denom;, 
if i = 0 A («star <a \/ xstar > x[1]) then 
begin 

xstar := a; epsxstar := eps(a, c, nn); 


signepsxstar := sign(epsxstar); epsxstar := abs (epsxstar); 
end 


Li: 
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else 
ifi=n-+1 A (star > b A xstar < x{n}) then 
begin 
xstar := b; epsxstar := eps(b, c, nn); 
signepsxstar := sign(epsxstar); epsxstar := abs (epsxstar); 
end 


else 

begin 
epsxstar := eps(xstar, c, nn); 
signepsxstar := sign(epsxstar) ; 
epsxstar := abs(epsxstar); 

end; 


count! := countl + 1; 
if i = 0 A xstar > x[i] then go to £7; 
ifi=n+1 A xstar < x{n] then go to L7; 
if xstar = u \/ xstar = v \/ xstar = w then go to L6; 
if signepsxstar ~ signu \/ epsxstar < epsu \/ epsxstar < 
epsv \/ epsxstar < epsw then 
begin 
if epsu > epsv /\ epsu > epsw then 
begin 
xStar := u; epsxstar := epsu; 
signepsxstar := signu; go to L6; 
end; 
if epsv > epsu /\ epsv > epsw then 
begin 
xstar := Vv; epsxstar := epsv; 
signepsxstar := signv; go to L6; 


end; 

xstar := W; epsxStar := epsw, 

signepsxstar := signw; go to L6; 
end; 


if count] > niter then go to L6; 
if epsu < epsw then 
begin 
if epsv < epsu then 
begin 
comment v is minimum; 
vi= xStar; epsv := epsxstar; 


go to L5; 
end 
else 
begin 
comment u is minimum; 
u:= xstar; epsu := epsxStar,; 
go to L5; 
end; 
end 
else 
begin 
if epsv < epsw then 
begin 


comment vy is minimum; 
v:= xstar; epsv := epSxstar; 
go to L5; 

end 

else 

begin 
comment w is minimum; w := xstar; epsw := epsxstar,; 
go to L5; 

end 

end; 


count2 := count2 + 1; 

if count2 > nhalf then go to trouble; 

alfa := 0.5 X alfa; beta := 0.5 X beta; 

go to L8; 

comment Replace x{i] by xstar after checking its sign; 
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L6: 
if i = 0 A signepsxstar * signepsx(1] # — 1 then go to 
trouble; 
if i # 0 A signepsxstar X signepsx{n] ~ — 1 then go to 
trouble; 
signepsx[i] := signepsxstar; x{[i] := xstar; 
end; 
go to done; 
trouble: 
ok := false; go to L9; 
done: 
ok := true; 
L9; 


end quadraticmax; 
comment Procedure sfart computes the arrays which are then in- 
put to exchange to find the best approximation on the points at 
hand; 
procedure start (m, n, a, d, xi, chebyshev, f); 
value m,n; integer m, n; 
array a, d, xi; 
procedure chebyshev; real procedure /; 
begin 
integer i, 7; real array ¢(0:n]; 
for i := O step 1 until ™ do 
begin 
chebyshev. (n, xili], 0; 
for j := Ostep 1 until » do afi,j] := tj]; 
di] := f(xili]); 
end 
end sfart; 
comment Now the procedure remez; 
real epsc, alfa, beta, epsx, absepsc, absepsx, rcompare, dx, maxr, 
minr, tempr, minsep; 
integer m, i, itemp, j, niter, nloop, k, nadded, isub, maxri, ilast, 
signnow, jj; 
integer signnew; integer array refset(0:n + 1+n); 
comment Assume number of points added < n; 
integer array ptsadd(0 : n]); 
array clast{0 : + 1], xq, xqlast(0:n +1 +n); 
Boolean /firsttime, ok, convx, conve, addit; 
why := 0;k := kstart; 
comment Come here if k gets changed: 
newk: 
m:=n+1+(k —1) X @+ 2); 
begin 
array r, xi, d[(0 : m], aa[(0:m,0:n + 1]; 
firsttime := true; convx := false; conve := false; 
nloop := 0; 
comment This makes the initial points spaced according to the 
extrema of the Chebyshev polynomial of degree m — 1; 
for i :== Ostep 1 until m do 
xili] :== (@+5)/2.0 — (b—a) X cos((3.14159265359  i)/m)/ 


2.0; 
comment 3.14159... is 7; 
dx := (b—a)/m; 


comment To use equally spaced points a statement such as the 
following could be used. for i := 0 step 1 until m do xi{i] := 
atix dx; 

start(m, n, aa, d, xi, chebyshev, f), 

comment The following constants are used in testing for conver- 

gence 
epsc used in relative test on coefficients 
absepsc used in absolute test on coefficients 
epsx used in relative test on critical points 
absepsx used in absolute test on critical points 
rcompare used to compare relative magnitudes of max and 
min values of residual on the critical points; 

epsc :== 1.019 — 7; absepsc := 1.0;9 — 7; epsx := 1.010 — *: 
absepsx := 1.010 — 5; 

rcompare := 1.0000005; 
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comment epsx and absepsx should be the same as missx in pro- 
cedure quadraticmax. epsc and absepsc should be adjusted 
according to knowledge of the expected magnitudes of the 
coefficients (if known). It is best to depend on the critical 
points and/of the max and min of the residuals for conver- 
gence criteria; 

comment Now call on exchange to find the first approximation 
to the best approximating function; 

exchange (aa, d, c, m,n, refset, emax, singular, r); 

comment The subscripts of the points chosen are in array ref- 
set(O:n+1], the coefficients of the best approximating func- 
tion on the m points are in c[0:n], the residuals in r; 

comment The reference set, the coefficients at this step, and/or 
the residuals may be written at this point; 

for i := Ostep 1 until ” do clast(i] := c[i]; 

comment Now we are going to look for any extrema not given 
by the points chosen by exchange; 

comment Make sure critical points are algebraically ordered; 

for i := O step 1 until 1 do for j := i + 1 step 1 until» + | do 

begin 
if refset|j] < refser(i] then 


begin 
itemp := refsetlj]; refser|j] := refset(il; 
refset|i] := itemp; 
end 
end; 


nadded := 0; maxr:= 0; maxri:= 0; ilast := 0; 
signnow := sign(r(0]); 
for i := O step 1 until m + 1 do 
begin 
if 7 = m + 1 then go to LBL; 
if sign(rli]) # 0 A sign(r[i]) = signnow then 


LBL: 


begin 
if abs(r[i]) > maxr then 
begin maxri := i; maxr := abs(r{i]); end 
end 
else 
begin 
if i < m + 1 then signnow := sign(r[i]); 
addit := true; 
for 7 := Ostep 1 until » + 1 do 
begin 
for jj := ilast step 1 until i — 1 do 
begin 
if jj = refset[j] then addit := false, 
end 
end; 
if addit then 
begin 
nadded := nadded + 1; if nadded > n then 
begin 
comment We assume nadded is always < n. If nadded 
is > n, why is set to —1 and we go to the label 
trouble. This can be modified by changing this test 
and changing the declarations for ptsadd, refset, xq, 
and xqlast above; 
why := —1; 
go to trouble 
end; 
ptsadd(nadded) := maxri; 
refset [n + 1 + nadded] := maxri; 
end; 
if i < m-+ 1 then 
begin 
ilast := i; maxr := abs(rii]); | maxri := i; 
end 
end 
end; 


comment We now have n + 2 + nadded points to send to 
quadraticmax for adjustment; 
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m:= n+ nadded; 
comment Make sure critical points are algebraically ordered; 
- for i := 0 step 1 until » do for j := i + 1 step 1 until m + 1 
do 
begin 
if refset[j] < refset[i] then 
begin . 
itemp := refset[j]; refsetlj] := refsert[i); 
refset{i] := itemp; 


end 
end; 
for i:= O step 1 until m + 1 do xgq[i] := xilrefset [i]; 
niter := 2;. 


comment This is the number of times to iterate in guadraticmax; 
alfa := 0.15; beta := 0.2; 
comment a/fa and beta are used to determine the points used 
in guadraticmax to fit a parabola. They are arbitrary subject 
to: 0 < alfa < beta < 1. Also beta should be fairly small 
to keep the points on one side of zero; 
comment This is the beginning of the loop that calls on 
quadraticmax, exchange, etc.; 
loop: 
nloop := nloop + 1; 
quadraticmax(m, xq, niter, alfa, beta, ok, a, b, c, nadded, eps); 
if — ok then 
begin 
k:=k+1; ifk > kmax then 
begin why := 1; goto trouble; end; 
go. to newk; 
end; 
if —. firsttime then 
begin 
comment Compare the largest and smallest of the residuals 
at the critical points (after adjustment) ; 
comment Set minr to a large number; 
maxr := 0.0; minr := 1.01050; 
for i := O step 1 until 7 + 1 do 
begin 
addit := true; 
forj := 1 step 1 until nadded do if refset[i] = ptsadd{[j| 
then addit := false; 
if addit then 
begin 
tempr := abs(eps (xq [refset [i]], c, n)); 
if tempr > maxr then maxr := tempr else if tempr < 


minr then minr := tempr; 
end 
end; 
if maxr < rcompare X minr then why := 4; 
end; 


comment Compare xq to xqlast; 
if — firsttime then 
begin 
convx := true; 
for i := Ostep 1 until m + 1 do 
begin 
if abs(xq [i] — xqglast{i}) > absepsx then 
begin 
if abs (xq [i] — xqlast[i]) > epsx X abs(xq [i]) A 
xq{i] ~ 0.0 then convx := false; 
if xq[i] = 0.0 A abs(xq [i] — xglast{i]) > absepsx 
then convx :-= false; 
end; 
xglast{i] := xq[il; 
end 
end 
else 
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begin 
firsttime := false; 
for i := Ostep 1 until m + 1 do xqglast{i] := xq[i}; 
for i := O step 1 until n do clast|i] := c[i]; 
end; 
comment Get ready to call exchange again, 
start(m -+ 1, n, aa, d, xq, chebyshey, f); 
exchange(aa, d, c, m+ 1, n, refset, emax, singular, r); 
comment Now compare the new coefficients to the last set of 
coefficients; 
if — firsttime then 
begin 
conve := true; 
for i := O step 1 until n do 
begin 
if abs(c[i] — clast[i]) > epsc X abs(cli]) A cli] # 0.0 
then conve := false; 
if cli] = 0.0 A abs(cli] -- clast{i]) > absepsc then 
conve := false; clast{i] := cfi]; : 
end 
end; 
comment Set the parameter why to the proper value according 
to the following: 
why = 4 if maxr < rcompare X minr. 
why = 5 if “4” and convx = true. 
why = 6if “4” and conve = true. 
why = 7 if “4” and convx = conve = true. 
why = 8 if convx = true. 
why = 9 if conve = true. 
why = 10 if convx = conve = true. Any value of why > 
4 indicates convergence; 
if why = 4 /A.convx then why : 
if why = 4 A conve then why : 


= 5 
= 6 
if why = 5 A conve then why := 7 
= 8 
:= 9 
:= 1 


Hi 


if why = 0 A convx then why : F 
if why = 0 A conve then why 
if why = 8 A conve then why 
if why > 4 then go to converged; 
if nloop > loops then 
begin why := 3; go to trouble end; . 
comment We go to label srouble in calling program if no con- 

vergence after a number of iterations equal to loops; . 
go to loop; 

singular: 
why := 2; go to trouble; 
comment We come to singular if exchange gets into trouble; 

converged: 

end; 
comment End of block using m in array declarations; 
comment There are four exits to the label trouble... 
(why=1) if k gets > kmax 
(why = 2) if exchange gets into trouble 
(why =3) if no convergence after iterating Joops number of 
times 
(why = —1) if number of added points is greater than n; 
end remez 


? 
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Algorithm 415 


Algorithm for the Assignment 
Problem (Rectangular 
Matrices) [H] 
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Description 
This algorithm is a companion to [3] where the theoretical 
background is described. 
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Algorithm 

procedure assignment (a, n, m, x, total); 
value a, n, m; integer n, m; 
real total; array a; integer array x; 

comment: a[i, j] isan X m matrix, x[1], x[2], ..., x[n] are assigned 
integer values which minimize total := sum(i := 1(1)n) of the 
elements a[i, x[i]]. If m > n the x[i] are distinct and are a subset 
of the integers 1, 2, ..., m. If m = n the x[i] are a permutation 
of the integers 1, 2, ..., n. If m <n the set of x[i] consists of 
some permutation of the integers 1, 2, ..., m interspersed with 
n — m zeros. The permutation and the positions of the zeros are 
chosen in such a way as to minimize the above sum with the 
convention that a[i, o] is to be taken equal to zero. imin = 
min(n, m) and imax = max(n, m) must be such that: imin > 0, 
imax > 1. 
This procedure is based on that of Silver [1] which uses the 
assignment algorithm of Munkres [2]. Silver’s procedure has 
been extended to handle the case n + m; 
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begin 
switch switch := NEXT, L1, NEXT 1, MARK; 
real min; 
integer array c[1l:n], ch[1:m], lambda[1:m], muf[l:n], r{1:n], 
ylL:m]; 


integer chi, cl, cl0, i, j, k, 1, rl, rs, sw, imin, imax, flag; 

total := 0; imin := m; imax := n; 

ifn > m then go to JA; 

imin := n; imax := m; 

for i := 1 step 1 until n do 

begin 
min := afi, 1); 
for j := 2 step 1 until m do if a[i, j] < min then min := ali, j]; 
for j := 1 step 1 until m do ali, j] := ali, /] — min; 

total := total + min; 


end; 

ifm > n then goto JB; 
JA: 

for 7 := 1 step 1 until m do 

begin 


min := a{l, J); 
for i := 2 step 1 until do if a[i, j] < min then min: 
for i := 1 step 1 until n do afi, /] := ali, j] — min; 
total := total + min; 


ali, /); 


ll 


JB: 
for i := 1 step 1 until » do x[i] := 0; 
for 7 := 1 step 1 until m do y{j] : 
for i := 1 step 1 until x do 
begin 
for / := 1 step 1 until m do 
begin 
if ali, j] ~0 V x{i] 0 V yL/] ¥ Othen go to J1; 
x[} =f; yU) = 5 


i 
rd 


JI: 
end; 
end; 
comment Start labeling; 
START: 
flag :=njrl:=cl:=0;rs:= 1; 
for i := 1 step 1 until x do 
begin 
mui] := 0; 
if x[i] + O then go to /1; 
rl := rl +1; r{rl) := i; mu[i] := —1; 
flag := flag — 1; 
Ni: 


end; 

if flag = imin then go to FINI, 

for j := 1 step 1 until m do lambda{[j] := 0; 

comment Label and scan; 

LABEL: 

i:=r[rs];rs:=rs+1; 

for j := 1 step 1 until m do 

begin 
if ali, j] ~ 0 V lambda[j] + O then go to J2; 
lambda [j] := i; cl := cl + 1; c[cl] := J; 
if y[j] = O then go to MARK; 
rl:= rl +1; r[rl] = yly]; mulyl/] = 6 

J2: 

end; 

if rs S rl then go to LABEL; 

comment Renormalize; 

sw := 1;cl0 := cl; cbl := 0; 

for j := 1 step 1 until ™ do 

begin 
if lambda[j] # 0 then go to J3; 
chl := cbl + 1; cb{cbl] := j; 
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J3: 


end; 
min := a[r{1], cb[1]]; 
for k := 1 step 1 until r/ do 
begin 
for /:= 1 step 1 until cb/ do 
if a[r[k], cb{l}] < min then min := a[r[k], cb{1]]; 
end; 
total := total +- min X (rl+-cbhl—imax); 
for i := 1 step 1 until 1 do 
begin 
if mu{i] ~ 0 then go to 72; 
if cl0 < 1 then go to /3; 
for / := 1 step 1 until cl0 do a[i, c[/]] := ali, c[/]] + min; 
go to 73; 


12: 
for / := 1 step 1 until cb/ do 
begin 
ali, cb{t]] := afi, cb{l]] — min; 
go to switch[{sw); 
NEXT: 


if ali, cb{l}] # 0 V lambda{cb{l}] 0 then go to Z1; 
lambda(cb{!)] := i; 


Ll: 


B: 
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if y[cb{/]] = 0 then 
begin 
J:=cb{l]; sw:=2; gotoZi; 
end; 
cl:=cl+1; cfel]:=cb{l]; rl:=rl+1; 
r[rl] := y[cb{(l]]; 
end; 
end; 


go to switch|sw + 2]; 


NEXT 1: 


if cl0 = cl then go to LABEL; 

for i := cl0 + 1 step 1 until c/ do mu[y[efi}]] := cli); 
go to LABEL; 

comment Mark new column and permute; 


MARK: 


yl] := i := lambdalj); 

if x[/] = 0 then begin x[i] := /; 
end; 
k:=j; j:= xi]; xli]:=k; goto MARK; 


go to START; 


FINI: 
end 


0 
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Algorithm 416 


Rapid Computation of 
Coefficients of 
Interpolation Formulas [E1| 


Sven-Ake Gustafson* [Recd. 21 Aug. 1969] 
Computer Science Department, Stanford University, 
Stanford, CA 94305 


Key Words and Phrases: divided differences, Newton’s 
interpolation formula 
CR Category: 5.13 


Description 
This: algorithm is a companion to [1] where the theoretical 
background is described 


References 

1. Gustafson, Sven-Ake. Rapid computation of interpolation 
formulae and mechanical quadrature rules. Comm. ACM 14 
(Dec. 1971), 797-801. 


Algorithm 
procedure INTP (dx, f, c, ord, n); 


value; real array dx, f, c; 
integer array ord; integer n; 
begin 


comment IN7P determines the coefficients of the polynomial of de- 
gree less than » which reproduces given function values and 
divided differences. The parameters of IN7P are: 


idenlifier type comment 

n integer 

ord integer array Array bounds [1:7] 
dx, f,c real array Array bounds [1:7] 


n is the number of coefficients of the interpolating polynomial. 
ord gives the character of the input data: if ord[i] = 1 then x[i] 
should be an argument and /[i] the corresponding function value. 
But if ord[i] > 1 then /[/] should contain a divided difference with 
a number of arguments equal to ord{i]. In this case dx{i] should 
contain the difference between the argument of highest index of 
fi] and that of f[i-—1). 

Upon execution of INTP the coefficients of the desired poly- 
nomial are stored in c in such a manner that the coefficient in 
front of the power ¢*) is contained in c[i]. Other parameters are 
not changed. Caution: The given data must be such that it is 
possible to construct Newton’s interpolation formula with 
divided differences from them. We must also have ord[1] = 1. 
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Observe that if derivatives of f are given the corresponding 
divided differences with confluent arguments must be evaluated 
and given as input data. 


Examples of use of INTP: ; 
Example 1. Determine the polynomial of degree less than n which 
interpolates a function f at n distinct points x;, i = 1, 2, ...,”. 


Input data: dx{i] = x;, fli] = fi, ordi] = 1,i = 1, 2,..., 2”. 
Example 2. Let x1, x2, x3, x4 be four given points. We know 
fi, fir, fas, and fz . Determine the polynomial of degree 3 which 


reproduces these quantities. Input data: n = 4, 


dx{1] = x1 ordil] =1 fliJ=A 
dx(2] = x2— x1 ord[2])=2 f[2] =f 
dx|3] = x3 — x2 ord[3]=2 f[3) =fas 
dx[4] = x4 ora(4] = 1 fl4J=fa 


Example 3. The same problem when we are given f(—1), f’(— 1), 
f"(—1), and f(1). Input data: n = 4, 


ax{1]} = -—1 ord{ij=1 f(1] =f(—)) 
dax{2]} = 0 ord[2]=2 f[2] = f’(—1) 
dx[3] = 0 ord[3)=3 f[3] = 0.5-f"(—1) 
dx[4] = 1 ordj4)=1 [4] =f) 


For further details see [1]; 
integer i, j, k; real ai, h, d, xx; 
real array arg [1 : 1]; 
comment Initiate phase DI, 
for i := 1 step 1 until 7 do 
arg|i] := if ord[i] = 1 then dx[i] else dx[i] + argli— 1]; 
comment Phase DI; 
for i := 2 step 1 until n do 


begin 
j:= ordi); 
if 7 = 1 then go to divde; 
d:= ffi); 
for k := istep —1 until i — j + 2do/[k] := f[k—1]; 
fli-j+1] := d; 
h:= dx{i]; ai:= arg[il; 


for k := i — j + 2step 1 until i — 1 do 
Stk) := f{k] + flk—1] X @i—arg{k—1); 
fli] := fli] + fli-—1] X 4; 
argli] := ai; 
divde: 
for k := i — j step —1 until 1 do 
f(k) := (ftk+1]—f1k))/(argli]—argtk); 


end i-loop; 
comment phase DiI; 
c{1] := f[1]; ifm = 1 then go to ready; 
for i := 2 step 1 until n do 
begin 
xx := argli]; cli] := cli—1]; 


for k := i — 1 step —1 until 2do 
c[k] := —xx X c[k} + c{k—-1]; 
ce{1] := f[i] — xx X c[l] 
end second i-loop; 
ready: 
end INTP 


* Present Address: Inst. F. Informations Behandling (Numeisk an- 
alys), KTH, 10044 Stockholm, Sweden. 


COLLECTED ALGORITHMS FROM CACM 


Algorithm 417 


Rapid Computation of Weights 
of Interpolatory Quadrature 
Rules [D1] 


Sven-Ake Gustafson* [Recd. 21 Aug. 1969] 
Computer Science Department, Stanford University, 
Stanford, CA 94305 


Key Words and Phrases: divided differences 
CR Categories: 5.16 


Description 
This algorithm is a companion to [1] where the theoretical 
background is described 


Reference 

1. Gustafson, Sven- Res Rapid computation of interpolation 
formulae and mechanical quadrature rules. Comm. ACM 14 
(Dec. 1971), 797-801. 


Algorithm 
procedure INTG(y, dx, m, ord, n); 


valuen; real array y, dx, m; 
integer array ord; integer n; 
begin 


comment JN7G determines weights in quadrature rules of the form 


b ie 2 
i; f(t) dot) =X mifg@ (1) 


1 


Here f?”* can be a function value or derivative or divided dffer- 
ence of order 1. The weights m,; are determined such as to render 
the rule exact when the integrand fis a polynomial of degree less 
than n. The parameters of INTG are: 


identifier type comment 

n integer 

ord integer array Array bounds [1:1] 
y, dx,m_ real array Array bounds [1:7] | 


n is the number of abscissae in formula (1). ord gives the charac- 
ter of the quantities £27? if ordi] = 1 then f77"“ is the function 
value f; , if ord{i] = 2, then £2? is a divided difference with two 
arguments. (The procedure does not handle cases where ord[i] > 
2.) 

If ord[i] = 1, then dx[i] should contain the argument corre- 
sponding to f? ot @ , else dx[i] should contain the difference between 
the arguments of highest index in £2?“ and that of f°°4°. 

y should contain the moments, that is in y[r] must be stored 
the number 


b 
I t' da(t) 


* Present address: Inst. F. Informations Behandling (numerisk 
analys), KTH, 10044 Stockholm, Sweden. 
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Upon execution of INTG the weight rm; is stored in m[i]. Other 
parameters are not changed. Example of usé of INTG: Deter- 
mine the coefficients m: , mz , m3 , and im, in the rule 


[i 9 dx = mf(—1) + mf(—-1) + mf(1) + mf'(1) 
-1 


Input data: n = 4 


dx{1] = —1 ord{1] =1 yf[l) = 
dx(2] = 0 ord[2]}=2 y[2] =0 
dx[3] = 1 ord[3}=1 y[3] = 2/3 
adx[4] = O ord[4J)=2 y[4]=0 


Restriction: We can only have ord{i] == 1 or ord{i] = 2. Further- 
more the given data must be such that it is possible to construct 
Newton’s interpolation formula with divided differences from the 
set x; f?? i = 1, 2,..., n. We must also have ord[1] = 
For further details, see [1]; 
integer i,j,k; real t; 
real array x[1:]; 
comment Initiate phase PI; 
for i := 1 step 1 until n do 
begin 

mli] := yf]; 

xi] := if ord[i] = 1 then dx[i] else dx(i] + x[i—1] 
end; 
comment Phase P/; 
for 7 := 2 step 1 until 1 do 


begin 

t:= x[j—1]; 

for i := nstep —1 until j do m[i] := m[i}] — t X m[i—1] 
end; 


comment Phase PI; 
for kK := 1 step 1 until» — 1 do 
begin 
comment transform from descending diagonal k to descending 
diagonal k + 1; 
tk = n—1 N ord[n] = 2 then go to ready; 
= x|k];_ m[n] := m{n]/(x[n)—-9; 


fond: =n — 1step —1untilk + 2dom{[i] := (m[i] — m{i+1))/ 
(x[i]—1); 
if ord[|k +1] = 2 then 
begin 
m[(k+1] := m[{k+1] — m[k+2]; goto on; 
end; 
ifk + 1<nthen 
m{k+1] := (m[k+1] — m[k+2])/(if ord[k+1] = 2 then 
dax{|k+1] else x[k+1]—4; 
if ord[k] = 1 A ord[k+1] = 1 then 
begin 
m(k] := m[k] — m[k+1]; goto on 
end; 
for i:= k — 1 step —1 until 1 do 
if ord[i] = 1 then 
begin 
-j:=t, gotonext 
end; 
next: 
= m(k+1]; 
for i := k step —1 until j + 1 do m[i] := m[i] — t X dx{[i); 
mj] := m{j] — 4; 
On: 
end; 
ready: 
end INTG; 
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Algorithm 418 | 


Calculation of Fourier 
Integrals [D1] 


Bo Einarsson [Recd. 25 Aug. 1970, 30 Oct. 1970, and 
25 Jan. 1971] . 


Research Institute of National Defense, Box 98, 
S-147 00 Tumba, Sweden 


Key Words and Phrases: quadrature, Filon quadrature, 
integration, Filon integration, Fourier coefficients, Fourier 
integrals, Fourier series, spline, spline approximation, spline 
quadrature, extrapolation, Richardson extrapolation 

CR Categories: 5.16 


Description : 

The most commonly used formula for calculating Fourier in- 
tegrals is Filon’s formula, which is based on the approximation of 
the function by a quadratic in each double interval. In order to 
obtain a better approximation the cubic spline fit is used in [1]. 
The obtained formulas do not need the explicit calculation of the 
spline fit, but in addition to the function values at all intermediate 
points, the values of the first and second derivatives at the bound- 
ary points are required. However, these values are often obtained 
from symmetry conditions. If the derivatives at the end-points are 
unknown, they may be calculated from a cubic spline fit, for ex- 
ample by using some exterior points or by using two extra interior 
conditions for the spline fit. It can also be noted that in certain 
periodic cases the terms containing the derivatives will cancel, 
and their values will be superfluous. The use of Algorithm 353 
[2] is recommended if the frequency w/z is a positive integer and 
the interval is [0,1]. Test computations reported in [1] indicate 
that the spline formula is more accurate than Filon’s formula. 
Both are of the fourth order. The expansion of the error term in 
powers of the step length contains only even powers, and there- 
fore the use of Richardson extrapolation is very efficient. 

The algorithm presented here is similar to Algorithm 353. by 
Chase and Fosdick [2], but in the present routine, Richardson ex- 
trapolation is included in order to obtain faster convergence. 

The routine FSPL2 evaluates the integrals 


12 12 
C= i e~* cos (wx) dx and S = e~* sin (wx) dx. 
2 2 


Copyright © 1972, Association for Computing Machinery, Inc. 
General permission to republish, but not for profit, an algorithm 
is granted, provided that reference is made to this publication, to 
its date of issue, and to the fact that reprinting privileges were 
granted by permission of the Association for Computing Machinery. 
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using the algorithm described in [1]. FSPL2 contains a feature 
which selects an initial integration step size such that at least two 
quadrature nodes are within each full period of the trigonometric 
function, cf. [3]. This step size is reduced by halving until the 
specified accuracy is obtained or the maximum number of interval 
halvings of the original interval is reached. Two evaluations are 
always performed. If the interval [a, 5] is long, it is advised to take 
special precautions. 

The use of Richardson extrapolation, which is performed in 
the subroutine ENDT2, decreases the number of function evalua- 
tions by a factor 4 in several of the test examples. It is possible 
to introduce the fast Fourier transform in order to obtain faster 
computation of the inner loop of the algorithm. Another exten- 
sion is to calculate the central part of the integral with the spline 
algorithm and the tails with the method in [4], which gives accu- 
rate results even when the function f(x) is slowly decreasing if the 
frequency w is large. 

Finally we give some test examples for both single and double 
precision computation of 


b b 
C= : f(x) cos (wx) dx and S = [ F(x) sin (wx) dx 


Error in computed 


MAX LC LS W_ EPS Cc AY Cc AY 

SP Input 10] 1} tj) 0.05} 1076 

Output 5} 5 0.133645) 0.020190! —30.10-8| —15.1078 
SP Input 10; i} 1150.0 | 10-6 

Output 9| 9 0.001417] 0.002306} -13.10-8| —14.10-8 
DP Input 15] 1) 1) 0.05] 10-44 

Output 7| 7 0.133645] 0.020190} —68.10-27} —7.10-1? 
DP Input 15) 1] 1150.0 | 10-14 

Output 11] 11 0.001417) 0.002306} 3.10717 8.10727 
References 


1. Einarsson, Bo. Numerical calculation of Fourier integrals 
with cubic splines. BIT 8 (1968), 279-286. 

2. Chase, Stephen M., and Fosdick, Lloyd D. Algorithm 353, 
Filon quadrature. Comm, ACM 12 (Aug. 1969), 457-458. 

3. Einarsson, Bo. Remark on algorithm 353, Filon 
quadrature. Comm. ACM 13 (Apr. 1970), 263. 

4. Gustafson, Sven-Ake, and Dahlquist, Germund. On the 
computation of slowly convergent Fourier integrals. Presented at 
Nov. 1970 meeting in Oberwolfach and to appear in Methoden 
und Verfahren der Mathematischen Physik. 

5. Einarsson, Bo. On the calculation of Fourier integrals. 
Preprints of the IFIP Congress 71, Booklet TA-1, 
North-Holland Pub. Co., Amsterdam, 1971, pp. 99-103. To 
appear in Information Processing 71, same publication. 
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SUBROUTINE FSPL2 


* (F pA yBsFPA sFPB oF BAgF BBW 9EP Sy MAX 9 Cy Se LCy LS) 


THIS ROUTINE COMPUTES THE FOURIER INTEGRALS 
C=INTEGRAL F{X) COS WX DX FROM X=A TO X=B 
S=INTEGRAL F(X) SIN WX OX FROM X=A TO X#8 


WITH THE SPLINE PROCEDURE IN Be EINARSSON, NUMERICAL 
CALCULATION OF FOURIER INTEGRALS WITH CUBIC SPLINES,» 
BIT, VOL. 8» PPe 279-286, 1968. 


REPEATED RICHARDSON EXTRAPOLATION IS USED. 


THIS SUBROUTINE HAS ADAPTED SEVERAL IDEAS FROM 
ALGORITHM 353, FILON QUADRATURE BY CHASE AND FOSDICK, 
COMM. ACM, WOLe 12_ PP. 457-458, 1969. 


F(X)=THE FUNCTION TO BE INTEGRATED, SUPPLIED BY THE USER 
AND DECLARED ‘*EXTERNAL® IN THE CALLING PROGRAM. 


DATA PI / 36141592653589793 / 


A=LOWER QUADRATURE LIMIT ANO B=UPPER QUADRATURE LIMIT 
IF A.GE.B THE COMPUTATION IS 8YPASSED AND THE SIGNS OF 
LCy LS» AND EPS ARE CHANGED. 


FPA AND FPB ARE THE VALUES OF THE DERIVATIVE OF F(X). 
FBA AND FBB ARE THE CORRESPONDING VALUES OF THE SECOND 
DERIVATIVE AT THE POINTS A AND 6. 

W=THE ANGULAR FREQUENCY 


EPS = REQUIRED ACCURACY, DEFINED BY 
(ERROR| < EPS*(16+I1C1) 
AND 
IERROR| < EPS#*(1.+iS1) 
IF CONVERGENCE IS NOT OBTAINED, THE VALUE 
OF EPS IS RETURNED WITH NEGATIVE SIGN. 


MAX=THE MAXIMUM NUMBER OF PARTITIONS (INTERVAL HALVINGS) 
IN THIS ROUTINE THE INTERNAL VARIABLE MXN DEFINED BELOW 
1S USED INSTEAD OF MAX. 

LC POSITIVE ON ENTRY INDICATES THAT C IS WANTED. 

LS POSITIVE ON ENTRY INDICATES THAT S IS WANTED. 

ON EXIT LC AND LS GIVE THE NUMBER OF PARTITIONS USED 

FOR THE COMPUTATION OF C AND S. 


THIS ROUTINE CALLS THE SUBROUTINE ENOT2. 


DIMENSION PVTC(7} sPYTS(7) 
IF(EPS.LT.O-) GOTO & 
IF(A.LT.B) GOTO 10 
EPS=-EPS 
5 LC#=LC 
LSz-LS 
RETURN 
10 N=1 
WL=ABS(W) 
TEMP=2.0* (BA) *W1/PI 
IFUTEMP .GT e220) N*ALOG (TEMP) /0.693 
0-6932AL0G(2-.) ROUNDED DOWNWARDS. 
MXN=MAXO (MAX »N+1) 
FA=F(A) 
FB=F(B) 
COSA=COS (W1 *A) 
SINASSIN(WL*A) 
COSB=COS (W1 #8) 
SINB#SIN (W1*B) 
H= (B-A) /FLOAT (2 *4N) 
NSTOP224#4#N-1 
NSTal 
TMAX IS THE SWITCH-OVER POINT FOR TETA. 
ANALYSIS SHOWS THAT WITH A 56 BIT FLOATING POINT MANTISSA 
TMAX20 22 
1S SUITABLE, WHILE WITH A 24 BIT MANTISSA WE PREFER 
TMAX=1- 
TMAXB IS THE SWITCH-OVER POINT IN BETA, WHERE THE 
CANCELLATION IS STRONGEST. 
TMAXB25 . *TMAX 
LLC AND LLS ARE USED BY THE ROUTINE IN COMPUTED-GO-TO 
STATEMENTS.» AS SOON AS LLS AND LLC HAVE BEEN DEFINED, 
WE CAN USE LS ANO LC AS RETURN PARAMETERS (SEE ABOVE). 
KFCLS) 11512512 
11 LLS#2 
GOTO 13 
12 LLS=1 
13) TF OLC)14 914915 
14 LLGz2 
GOTO 17 
15 LLC=l 
17 CONT INUE 
SUMCOS=0 .5 *(FA*COSA+FB *COSB ) 
SUMSIN=0 o5*(FA*SINA+FB*SINB) 
ALL OF THE ABOVE IS EXECUTED ONLY ONCE PER CALL. 
NOW THE ITERATION BEGINS. 
THE CONSTANT *M* IS USED IN THE RICHAROSON EXTRAPOLATION. 
M-1 IS THE NUMBER OF TIMES THE ORIGINAL STEP LENGTH ‘H® 
HAS BEEN DIVIDED BY TWO. 
Mz] 
20 CONTINUE 
H22H *H 
TETA=W1 4H 
DO 65 I1#1,NSTOP »NST 
XaA+H*FLOAT (I) 
WX=W1 #X 


50 
55 
60 
65 


c 70 
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GOTO (50955) eLLS 
SUMS INjSUMSIN+F (X) *SIN (WX) 
GOTO (60765) sLLC 
SUMCOS#SUMCOS+F (X) *COS (WX) 
CONT INUE 

T2=TETA*TETA 

TEMP=1 .0-SIN(0 o5*TETA) ##2/165 

IF (TETA-TMAX) 70970575 
IS THE POWER SERIES FOR SMALL TETA, 75 IS THE CLOSED 


C FORM USED WITH LARGER VALUES OF TETA. 


C THE 
Cc GIV 
70 


COEFFICIENTS OF THE DIFFERENT POWER SERIES BELOW ARE 
EN IN EXACT FORM, COMPARE WITH THE REFERENCE ABOVE. 
ALFAsTETA* (1 60-T2*(2 60/15 00-T2% (19 60/1680.0- 


~T2*(13.0/25200 .0-T2* (293 .0/19958400.0- 


—T2*181 .0/619164000.0)))))/12.0 
DELTA&—1 60/12 004T2*(1 00/9060-T24(5 20/1 20960- 
-T2*(120/129600 60-7 2/11404800.0))) 
EPSIL=1 .0-T2*(1.0/6.0-T2*(0.0125-T 2%(17.0/30240.0- 
~T2*(31.0/1814400 .0-T2/2661120.0)))) 
T3=T2 
72 BETASTETA#H2 *() .0-T2/21 60*(1.0-T2* (120/48 .0- 
-T2*(1 0/3960 00-73/494208 60) )))/180.0 


GOTO 80 


C CLOSED FORM OF THE COEFFICIENTS. 


75 TEMP1L=(0 o5*TETA) *#2 
TEMP 2=SIN (005 ¥*TETA) **2/TEMP), 
TEMP3=SIN(TETA)/TETA 
ALFA=(TEMP-TEMP2*TEMP3 )/TETA 
DELTA=(TEMP-TEMP2)/T2 
EPSIL=TEMP2*TEMP2 
IF (TETA~TMAXB) 76976578 
76-T3=T2*(1 T2011 o/175 o-T2*(14/40800.-T2/12209400~))) 
GOTO 72 
78 BETA=(TEMP-TEMP3 )/ (TETA*W1 *W 1) 


C HAVE CALCULATED THE COEFFICIENTS», NOW READY FOR THE 


c 


INTEGRATION FORMULAS. 
80 GOTO (81,85),LLS 
81 TS=H*((BETA*FBB-ALFA FB) *COSB+ ( ALFA*FA-BETA*FBA) *COSA+ 
+DELTA*H* (FPB*SINB-FPA*S INA) +EP SIL*SUMSIN) /TEMP 
CALL ENDT2(PVTS TS sEPS eS oLLS om) 
LS=N 
85 GOTO (8690) sLLC 
B86 TC=H*( (ALFA *FB-BETA*FBB ) *SINB+( BE TA*F BA~ALFA*F A) *SINA+ 
+DELTA*H* (FPB*COSB-FPA*COSA)+EPSIL*SUMCOS) /TEMP 
CALL ENDT2 (PVTC yTC EPS oC o LLC 9M) 
LC=N 
90 CONT INUE 


C NOW TEST TO SEE IF DONE. 


IF(LLC+LLS—-3) 92592,100 
92 N=N+1 


C THIS IS THE BEGINNING OF THE ITERATION. 


AMANMNAANAAAAAAAND 


aa 


IF (N-MXN) 95 595599 
95 H=0.54H 
NST=2 
NSTOP=2*4N 
M=M+1 
GOTO 20 
99 EPS=-EPS 
100 CONTINUE 
1F (CLS 6GT 00 eAND oW oe LT 2020) 
RETURN 
END 


S=--S 


SUBROUTINE ENDT2 (PREVOT ,QUANT sEP Ss VALUE yL 9M) 


ENOT2 IS A SUBROUTINE THAT PERFORMS RICHARDSON EXTRA- 
POLATION OF THE VALUES "QUANT® WHICH ARE INTRODUCED INTO 
THE ROUTINE EACH TIME IT IS CALLED», EACH TIME WITH 
INCREASING VALUE OF 'M*, STARTING WITH M = 1. THE CURRENT 
VALUES ARE STORED IN THE ARRAY 'PREVOT', WHERE *PREVOT(1)! 
AT EXIT IS EQUAL TO *QUANT'. THE BEST VALUE FOR THE MOMENT 
IS GIVEN IN *VALUE*. ENDT2 REQUIRES THE PRESENT VALUE TO 
AGREE WITH THE PREVIOUS VALUE TD WITHIN EPS2, WHERE 

EPS2 = EPS*(1.0 + ABS(PRESENT VALUE)). 
EPS IS SUPPLIEO BY THE USER. 

THE ERROR EXPANSION IS OF THE FORM 
ERROR = C4#H #44 + CO*H**6 + COFH*HB + coe + CNHHEHN + wee 


DIMENSION PREVOT (7) sRICH(7) 


DATA RICH(1) / O0.0/, RICH(2) / 15.0/>% 
* RICH(3) / 6360/5 RICH(4) / 2550/5 
* RICH(5) / 1023.0/ RICH(6) / 4095.0/, 
* RICH(7) £16383 .0/ 

RICH(1) = 0 IS NOT USED 


RICH(K) = 2**(2*K) = ly 
TEMP 2=PREVOT (1) 
PREVOT (1) =QUANT 
TEMP 1 =QUANT 
IF(MeEQe1) GOTO 30 

20 REPS=EPS*(1.0+ABS (QUANT )) 
DO 23 K=2_—M 
DIFF=TEMP1-TEMP2 
IF(ABS(DIFFI-REPS) 25925522 
22 IF (KeEQ.8) GOTO 30 
TEMP1L=TEMPI+DIFF/RICH(K) 
TEMP2=PREVOT (K) 
PREVOT (K)=TEMP1 
23 CONT INUE 
GO TO 30 
25 L#2 
30 VALUE=TEMP1L 
RETURN 
END 
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Remark on Algorithm 418 [D1] 
Calculation of Fourier Integrals [Bo Einarsson, Comm. 
ACM 15 (Jan. 1972), 47-48] 


Bo Einarsson [Recd. 31 Jan. 1972] 
Research Institute of National Defense, Box 98, 
S-147 00 Tumba, Sweden - 


Key Words and Phrases: quadrature, Filon quadrature, 
integration, Filon integration, Fourier coefficients, Fourier 
integrals, Fourier series, spline, spline approximation, spline 
quadrature, extrapolation, Richardson extrapolation 

CR Categories: 5.16 


Algorithm 418 looks confusing since the first 12 lines of the 
Fortran listing have been lost at the printing. Another error is that 
the two formula lines in the description: are interchanged; the 
routine of course evaluates the general Fourier cosine and sine 
integrals. Finally, in the last line of the references, for ‘publica- 
tion,” read “publisher.”” The beginning of the algorithm is 


SUBROUTINE FSPL2 
* (F pA eB pF PA yFPB yF BA sF BB pW EPS oMAX yCoSelCeLlS) 


THIS ROUTINE COMPUTES THE FOURIER INTEGRALS 
“CSINTEGRAL F(X) COS WX OX FROM X=A TO. X=B 
S=INTEGRAL F(X) SIN WX DX FROM X=A TO.X=B 


WITH THE SPLINE PROCEDURE IN Be EINARSSON> NUMERICAL 
CALCULATION OF FOURTER INTEGRALS WITH CUBIC SPLINES, 
BITs VOL. 8s PP. 279-286, 1968. 


AANMNMARAANDND 


REPEATED RICHARDSON EXTRAPOLATION IS USED. 


Remark on Algorithm 418 [D1| 
Calculation of Fourier Integrals [Bo Einarsson, Comm. 
ACM 15 (Jan. 1972), 47-48] . 


Robert Piessens [Recd. | June 1973] 
Applied Mathematics and Programming Division, Uni- 
versity of Leuven, B-3030 Heverlee, Belgium 


The algorithm has been tested in double precision on an IBM 
370/155 with success. However, in the case that the Fourier cosine 
integral C and the Fourier sine integral S of the function F(x) are 
wanted simultaneously (LC and LS positive on entry), the efficiency 
can be improved, since each value of F(x) is then computed twice. 
This causes a considerable waste of computing time, which can 
easily be avoided by the following alterations: 

(i) insert statement 

FX = F(X) 

5 lines after statement 20. 

(11) replace statement 50 by 

50 SUMSIN = SUMSIN + FX*SIN(WX) 
and statement 60 by 

60 SUMCOS = SUMCOS + FX*COS(WX) 


418-P 3- RI 


COLLECTED ALGORITHMS FROM CACM 


Algorithm 419 


Zeros of a Complex 
Polynomial [C2] 


M.A. Jenkins 

Queen’s University, Kingston, Ontario, Canada 

and 

J.F. Traub* [Recd. 10 Aug. 1970] 

Department of Computer Science, Carnegie-Mellon 
University, Pittsburgh, PA 15213 


Key Words and Phrases: roots, roots of a polynomial, zeros of a 
polynomial 
CR Categories: 5.15 


Description 

The subroutine CPOLY is a Fortran program to find all the 
zeros of a complex polynomial by the three-stage complex algorithm 
described in Jenkins and Traub [4]. (An algorithm for real poly- 
nomials is given in [5].) The algorithm is similar in spirit to the 
two-stage algorithms studied by Traub [1, 2]. The program finds the 
zeros One at a time in roughly increasing order of modulus and 
deflates the polynomial to one of lower degree. The program is 
extremely fast and the timing is quite insensitive to the distribution 
of zeros. Extensive testing of an Algol version of the program, 
reported in Jenkins [3], has shown the program to be very reliable. 

The program is written in a portable subset of ANSI Fortran. 
It has been successfully used on the IBM 360/65, the GE 635 and 
the CDC 6600. The program is a translation of the Algol 60 pro- 
cedure cpolyzerofinder appearing in (3). 

MCON, the final subroutine of the program, sets four variables 
which describe the precision and range. of the floating point arith- 
metic being used. Instructions for setting MCON variables are given 
in the MCON comments. The algorithm will accept polynomials of 
maximal degree 49. 

The authors would like to thank K. Paciorek and M.T. Dolan 
for their assistance in preparing the Fortran version of the program 
and P. Businger and C. Lawson for suggesting improvements to the 
program. 


Copyright © 1972, Association for Computing Machinery, Inc. 
_ __ General permission to republish, but not for profit, an algorithm 
is granted, provided that reference is made to this publication, to 
its date of issue, and to the fact that reprinting privileges were 
granted by permission of the Association for Computing Machinery. 

* This work was done while J.F. Traub was at Bell Telephone 
Laboratories. 
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Algorithm 


SUBROUTINE CPOLY(OPReOPI,OEGREE.ZEROR,ZEROI es FAIL) 
FINOS THE ZEROS OF A COMPLEX POLYNOMIALS 
OPRe OPI ~ DOUBLE PRECISION VECTORS OF REAL AND 
IMAGINARY PARTS OF THE COEFFICIENTS IN 
OROER OF DECREASING POWERS. 
DEGREE - INTEGER DEGREE OF POLYNONIAL. 
ZERORe ZEROL - OUTPUT DOUBLE PRECISION VECTORS OF 
REAL AND IMAGINARY PARTS OF THE ZEROS. 
FAIL ~ OUTPUT LOGICAL PARAMETER, TRUE ONLY IF 
LEADING COEFFICIENT 1S ZERO OR IF CPOLY 
HAS FOUND FEWER THAN DEGREE ZEROS. 
THE PROGRAM HAS BEEN WRITTEN TQ REDUCE THE CHANCE OF OVERFLOW 
OCCURRING. IF IF DCES OCCUR, THERE IS STILL A POSSIBILITY THAT 
THE ZEROFINDER WILL WORK PROVIDED THE OVERFLOWED QUANTITY IS 
REPLACED BY A LARGE. NUMBER. 
COMMON AREA 
COMMON/GLOBAL/PRePI sHReH1eQPReQP Ep QHReQHI sSHRe SHI e 
* SReSTeTReTlsPVRePVIy ARE MRE, ETA, INFINSNN 
DOUBLE PRECISION SReSIyTReTIyPVRePVI pAREeMRESETAg INFINy 
* PR(50)_¢P1(50) sHR(50) pHI (50), QPR(50) eQP11503 ,QHRISO), 
* QHI (50) 2 SHR (50) »SHI(50) 
TO CHANGE THE SIZE OF POLYNOMIALS WHICH CAN BE SOLVED, REPLACE 
THE DIMENSION OF THE ARRAYS IN THE COMMON AREA~ 
OOUBLE PRECISION XXsVY¥~eCOSReSINRe SMALNOs BASE oXXXeZReZ Ie BNDe 
* OPRI1)sOPI(C1L) sZERORC LP SZEROI 1 )e 
* CMOO, SCALE, CAUCHY, DSORT 
LOGICAL FAIL,»CONV 
INTEGER DEGREEs CNT1eCNT2 
INITIALIZATION OF CONSTANTS 
CALL MCON(ETA,INFIN, SMALNO, BASE) 
ARE = ETA 
MRE = 2.0D0*0SQRT(2.000) *ETA 
XX = 270710678 
YY = -XX 
COSR = —.060756474 
SINR = 299756405 
FAIL = FALSE. 
NN = DEGREE+1 
ALGORITHM FAILS LF tte LEADING COEFFICIENT IS ZERO. 
IF (OPR(1) .Nee O.000 .OR- OPI(1) «NE. 0.000) 60 TO 10 
FAIL = .T“Ure 
RETURN . 
C REMOVE THE ZEROS AT THE ORIGIN IF ANY- 
10 IF COPREINN) -NE~ 02000 .OR. OPL(NN) .NE. 0.000) GO TO 20 
IDNN2 = DEGREE-NN+2 
ZEROR(IDNN2) = 0.000 
ZEROICIONN2) = 0.000 
NN = NN-1 
GO TO 1O 
C MAKE A COPY OF THE COEFFICIENTS. 
20 DO 30 I = 1,NN 
PROT) = OPRIT) 
PICT) = OPI(T) 
SHROUD) = CMOOCPRETDPICI)) 
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30 CONTINUE 
SCALE THE PCLYNOMIAL. 
BND = SCALE (NNeSHReETAs INF INsSMALNO,BASE) 
IF {BND .EQ. 1.000) GG TO 40 
0O 35 I = 1,NN 
PRCT) = BND*PR(T) 
PECL) = BND*PICT) 
35 CONTINUE 
START THE ALGORITHM FOR ONE ZERO. 
40 IF (NN.GT. 2) GO TO 50 
CALCULATE THE FINAL ZERO ANDO RETURN. 
CALL COLVID(-PR(2)9—-PI(2) ePRIL) PIC LI »eZEROR( DEGREE)» 
* ZEROL (OEGREE)) 
RETURN . 
CALCULATE BND, A LOWER BOUND ON THE MOOULUS OF THE ZEROS. 
50 00 60 ft = 1,NN 
SHR(T) = CMOD(PR(T),PICI)) 
60 CONTINUE : 
BND = CAUCHY (NNySHR,SHI) 
OUTER LOOP TO CONTROL 2 MAJOR PASSES WITH DIFFERENT SEQUENCES 
OF SHIFTS. : 
00 100 CNTL = 122 
FERST STAGE CALCULATION, 
CALL NOSHFT(5) 
(NNER LOOP TO SELECT A SHIFT, 
DO 90 CNT2 = 159 
SHIFT IS CHOSEN WITH MODULUS BND AND AMPLITUDE ROTATED BY 
94 DEGREES FROM THE PREVIOUS SHIFT. 
XXX = COSR*XX-SINR*YY 
YY = SINR*¥XX+COSR*EVY 
XX = XXX 
SR = BNO*XX 
SI = BND*YY 
CALCULATION, FIXED SHIFT. 
CALL FXSHFT(1LO*CNT2,ZReZ1-CONV) 
IF (sNOT. CONV) GO TO 80 
THE SECOND STAGE JUMPS DIRECTLY TO THE THIRD STAGE ITERATION. 
IF SUCCESSFUL THE ZERO IS STORED AND THE POLYNOMIAL DEFLATED. 
IDNN2 = DEGREE-NN+2 
ZEROR(TONN2) = ZR 
ZEROICIONN2) = ZI 


NO SHIFT. 


SECOND STAGE 


NN = NN-1 
DO 70 I = 1,NN 
PRT) = QPRCI) 
PICT) = QPICT) 
70 CONTINUE 
GO TO 40 
80 CONTINUE 
If THE ITERATION IS UNSUCCESSFUL ANOTHER SHIFT IS CHOSEN. 


90 CONTINUE : 
If 9 SHIFTS FAIL, THE OUTER LOOP IS REPEATED WITH ANOTHER 
SEQUENCE OF SHIFTS. 
100 CONTINUE 
THE ZEROFINDER HAS FAILED ON 
RETURN EMPTY HANOED. 
FAIL = .TRUE. 
RETURN 
END 
SUBROUTINE NOSHFT(LL) 
COMPUTES THE DERIVATIVE POLYNOMIAL AS THE INITIAL H 
POLYNOMIAL ANDO COMPUTES Li NO-SHIFT H POLYNOMIALS. 
COMMON AREA 
COMMON/GLOBAL/PRePI sHReHIgQPReQPIyQHReQHI sSHRe SHI 
* SReSTeTReTL»PVRePVI sAREe MRE ETAs INFINSNN 
DOUBLE PRECISION SReSTyTReTlePVRePVEZARE MRE SETA INF ING 
* PR(50)eP1(50) HRI50) -HEC5O) sQPR(50) yQP1 (50) eQHR( SO)» 
* QHI(50)5SHR(50) »SHE(50) 
DOUBLE PRECISION XNI»T1»T2eCMOD 
N = NN-1 
NML = N-1 
DO 10 I = 1,N 
XNI = NN-I 
HROE) = XNI*®PROETI/SFLOATIND 
HICT) = XNI*PICLS/SFLOATIN) 
10 CONTINUE 


DO 50 JJ = Lyell 
LF (CMODCHRUIN) pHIC(N) ) 
* GO TO 30 
CALL COLVID(-—PRENN) ePIC NN) sHROND SHION) p TR TID 
00 20 I = 1l»NML 
Jo= NN-I 
TL = HRIJ-1) 
T2 = HI(J-L) 
HRS) = TRETL-TI*T2¢PRUS) 
HI(Q J) = TRATZeTI*TLEP LCS) 
20 CONTINUE 
HROL) = PROLD 
HECL) = PIQL) 


TWO MAJOR PASSES. 


oLE. ETA*LOLODO®CMOD( PRIN) ePLIND)) 


GO TO 50 : 
{F THE CONSTANT TERM IS ESSENTIALLY ZERO, SHIFT H COEFFICIENTS. 
30 DO 40 [ = 1,NM1 : 
J = NN-I 


HROJ) = HRIJ-1) 
HI(J) = HI(CJ-L) 
40 CONTINUE 
HR(L) = 0,000" 
HI(1L) = 0.000 
50 CONTINUE 
RETURN 
END 
SUBROUTINE FXSHFT(L2,ZReZEe CONV) 
COMPUTES L2 FIXED-SHIFT H POLYNOMIALS AND TESTS FOR 
CONVERGENCE. 
INITIATES A VARIABLE-SHIFT ITERATION AND RETURNS WITH THE 
APPROXIMATE ZERO IF SUCCESSFUL. 
L2 - LIMIT CF FIXED SHIFT STEPS 
ZReZI — APPROXIMATE ZERO IF CONV IS .TRUE. 
CONV — LOGICAL INDICATING CONVERGENCE OF STAGE 3 ITERATION 
COMMON AREA 
COMMON/GLOBAL/PRePEsHReH1yQPReQP Ie QHReQHI » SHR» SHI 9 
* SReSTeTReTLePVRePVI», ARE sMREyETAy INFIN NN 
DOUBLE PRECISION SReSiyTReTlyPVRePVI ARE yMREvETAg INFINe 
* PR(50)eP 1050) pHRI50) HE (50) eQPRI50) sQP1 (50) yQHRI 50)» 
* QHI (50) » SHR (50) eSHI(50) 
DOUULE PRECISION ZReZI,OTReOTIySVSRye SVSI eCMOD 
LOGICAL CONVe TEST sPASD0,B00L 
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N = NN-1 
C EVALUATE P AT S. 
CALL POLYEVINNe SReSTePRePl ee OPReQPI ePVRyPVE) 
TEST = «TRUE. 
PASD = FALSE. 
C CALCULATE FIRST T = -P(S)/HIS). 
CALL CALCT(BO0L) 
C MAIN LOOP FER ONE SECOND STAGE STEP. 


00 50 J = Isl2 
OTR = TR. 
oT! = TI Z 
C COMPUTE NEXT H POLYNOMIAL ANDO NEW FT. 


CALL NEXTH(BOOL) 

CALL CALCT(BOOL) 

ZR = SR*+TR 

ZI = SI¢TI 
TEST FOR CONVERGENCt UNLESS STAGE 3 HAS FAILED ONCE OR THIS 
IS THE LAST H POLYNOMEAL. 

IF ¢ 8N0L OR. -NOT. TEST EQ. L2) GO TO 50 

LF (CMOD(TR-OTR,TI-OTI) .GE. 

IF (.NOT. PASO) GO TO 30 

THE WEAK CONVERGENCE TEST HAS BEEN PASSED TWICE, START THE 
THIRD STAGE ITERATIUN, AFTER SAVING THE CURRENT H POLYNOMEAL 
AND SHIFT. 


Oa 


eORe J 


Hon 


0O 10 I = IyN 


SHR(E) = HROI) 
SHI(T) = HICU) 
10 CONTINUE 
SVSR = SR 
SVSI = SI 
CALL VRSHFT(10,ZReZ1, CONV) 
IF ¢CONV) RETURN 
C THE ITERATION FAILED YO CONVERGE. 
C HeSePV AND T. 


TURN OFF TESTING AND RESTORE 


TEST = .FALSE. 
DO 20 I = 1l»N 
HRT) = SHRCID 
HICE) = SHICD) 
20 CONTINUE 
SR = SVSR 
SI = SVSI 
CALL POLYEVONNs SReSFePRePIeQPReQPESP VR, PVI) 
CALL CALCT(BOOL) 


GO TO 50 

30 PASO = .TRUE. 
GO TO 50 
40 PASO = «FALSE. 


50 CONTINUE ; 
C ATTEMPT AN ITERATION WETH FINAL H POLYNOMIAL FROM SECOND STAGE 
CALL VRSHFT( 10, ZR eZ eCONV) 
RETURN 
END 
SUBROUTINE VRSHFT(L3¢ZReZieCONV) 
CARRIES OUT THE THIRD STAGE ITERATION. 
L3 ~ LIMIT GF STEPS IN STAGE 3. 
ZReZl — ON ENTRY CONTAINS THE INITIAL ITERATE, 
ITERATION CONVERGES IT CONTAINS THE FINAL ITERATE 
ON EXIT. 
CONV - «TRUE. IF ITERATION CONVERGES 
COMMON AREA 
COMMON/GL OBAL/Pk, PI eHReH1 se QPReQPT sQHRsQHI» SHR» SHI 
* SReST,TReTlePVRePVIg ARE MRED ETAs INFIN,NN 
DOUBLE PRECISION SReSEyTReT1ePVRePVI eAREeMREGETAs INF I Ny 
* PR(50) P1150) pHR(50) pHE(50),QPR(50) OPE (50) sQHRISO)s 
* QHI (50) 2SHR (50) eSHI(50) 


IF THE 


fel elswakaksis) 


DOUBLE PRECISION ZRoeZIeMPeMSy OMPsRELSTP eR1ly»R2yCMODsDSQRT » ERREVe TP 


LOGICAL CONV,B,BOOL 
CONV = FALSE. 
B = FALSE. 
SR = ZR 
SI = ZI 
C MAIN LOOP FOR STAGE THREE. 
DO 60 IT = lelt3 
C EVALUATE P AT S AND TEST FOR CONVERGENCE. 
CALL POLYEVENN, SReSTePRePleQPReQPlyPVRyPVI) 
MP = CMOD(PVR,PVI) 
MS = CMOD(SReSI) 
IF (MP .GT. 20. 0D0*ERREV(INN, QPR»QPI eMSy MPs ARE o MRED) 
* GO TO 10 
C POLYNOMIAL VALUE IS SMALLER IN VALUE THAN A BOUND ON THE ERR0R 
C IN EVALUATING Py TERMINATE THE ITERATION. 
CONV = TRUE. 
ZR = SR 
Zt= SI 
RETURN 
eEQ. 1) GO TO 40 
IF (8 OR. MP .LT.OMP 
* GO TO 30 
C ITERATION HAS STALLED. PROBABLY A CLUSTER OF ZEROS. DO 5 FIXED 
C SHIFT STEPS INTO THE CLUSTER TO FORCE ONE ZERO TO DOMENATE. 
TR - SELSTP 
8 = .TRUE. . 
IF (RELSTP «LT. ETA) TP = ETA 
Ri = DSQRTITP) 
R2 = SR*¥11.0D004R1LI-SIF#R1 
SI = SR*R14ST*(1.0004R1) 
SR = R2 
CALL POLYEVENNyg SReSIePRePIeQPReQPIsPVR, PVE) 
00 20 J = 195 
CALL CALCT(B00L)} 
CALL NEXTH(BOOL) 
20 CONTINUE 
OmMP = INFIN 


10 If (1 


eOR. RELSTP .GE. ~05D0) 


GO TO 50 
C EXIT IF POLYNOMIAL VALUE INCREASES SIGNIFICANTLY. 
30 IF (MP*.100 .GT. OMP) RETURN 
40 OMP = MP 
C CALCULATE NEXT ITERATE. 
50 CALL CALCT(800L) 
CALL NEXTH(BOQL?) 
CALL CALCT(BOOL?) 
IF (B00L) GO TO 60 
RELSTP = CMOD{(TReTI)/CMODISRsSI) 
SR = SR4#TR | 
St = SI¢Tl 


0 


eSO0*CMODIZR»ZI)) GI TI 40 
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60 CONTINUE 
RETURN 
END 
SUBROUTINE CALCT(BOOL} 


C COMPUTES T = -P(S)/H(S). 


C BOOL 


~ LOGICAL, SET TRUE IF H(S) 1S ESSENTIALLY ZERO. 


C COMMON AREA 


COMMON/GLOBAL/PReo PI sHReH1leQPR QP Te QHReQHis SHR» SHI» 

* SReSieTReTl eo PVRePVI gs ARESMREZETAy INFINgNN 

DOUBLE PRECISION SReSleTReTlePVRePVI,AREsMREGETAg INFING 
* PRI50),P1(50) sHR(50) eHI (50) ¢QPR( 50) ,QPT (50) eQHRI501,9 
* QHI (50) ,SHR(50) eSHI(50) 

DOUBLE PRECISION HVReHVIeCMOD 

LOGICAL ®OOL 

N = NN-1 


C EVALUATE H(S). 


CALL. POLVYEVONsSReSTeHRe He QHRQHIZHVReHVI) 
BOOL: = CMOD(HVReHVI) «LE. ARE*LO.0D0*CMODI(HRIN) sHI(N) ) 
IF (BOOL) GO TO 10 
CALL COLVID(-PVR,—-PVIZHVReHVIsTRe TID 
RETURN 
10 TR = 02000 
Ti = 0.000 
RETURN 
END 
SUBROUTINE NEXTH(BOOL? 


C CALCULATES THE NEXT SHIFTED H POLYNOMIAL . 


C BOOL = 


LOGICAL, IF eTRUE. HIS) IS ESSENTIALLY ZERO 


C COMMON AREA 


C 


C EVALUATES A POLYNOMIAL 


COMMON/GLOBAL/PRy» PI sHReHLsQPR,QP 1» QHReQHIE es SHRe SHI» 
* SReSieTRe TL pPVR»ePVI»yARESMREsETAe INFINGNN 
DOUBLE PRECISION SReSleTReTlePVR»PVIsAREsMREGETAs INFINg 
* PRI50)2P1(50) sHRI50) sHI(50),QPR( 50) ,0P1(50) ,QHRI50), 
* QHI(50).SHR(50),SHI(50) 
DOUBLE PRECISION TleT2 
LOGICAL BOOL 
N = NN-1 
NMI = N=-1 
IF (BOOL? GO TO 20 
DO 10 J = 29N 
Ti = QHR{(J-1} 
T2 = QHI(J-L} 


HROJ) = TRETL-TI*F24QPR(II) 
HICJ) = TRETZ4TI*TIFQP ECS) 
10 CONTINUE 
HROL) = QPRCL) 
HT@L) = QPECL) 
RETURN 
(F HOS) IS ZERO REPLACE H WITH QH, 
20 DO 30 J = 2,5N 
HROJ) = QHR(J-1) 
HECS) = QHICJ-L) 
30 CONTINUE 
HR(L) = 0.000 
HI(L) = 06.000 
RETURN 
END 
SUBROUTINE POLYEVOUNNsSReSIePReP i eQReQlyPVRePVI) 
P AT S BY THE HORNER RECURRENCE 


C PLACING THE PARTIAL SUMS IN Q AND THE CUMPUTED VALUE IN PV. 


ANnNaana 


an 


DOUBLE PRECISION PRINN} sPLINNIsQRINNDSQLICNN), 
* SReSTsPVRePVIeT 
QR(ILT) = PRL) 
QUCk) = PLL) 
PVR = QRO1) 
PVI = QUCL) 
O00 10 I = 2_9NN 
T = PVR¥SR-PVIFSI+PR(T) 
PVT = PVRESI+PVI*SRAPI(T) 
PYR = T 
QRCT) = PVR 
QLCI) = PVI 
10 CONTINUE 
RETURN 
END 
DOUBLE PRECISION FUNCTION ERREVENNeQReQT oe MS eMPeARE pMRED 


BOUNDS THE ERROR IN EVALUATING THE POLYNOMIAL BY THE HORNER 
KECURRENCE - 

QR»QI -— THE PARTIAL SUMS 

4S -MODULUS OF THE POINT 

mp -MODULUS OF POLYNOMIAL VALUE 

ARE» MRE -ERROR BOUNDS ON COMPLEX ADDITION AND MULTIPLICATIIN 


DOUBLE PRECISION QR(NN) QI (NN) MS »MP,ARE gMRE,E,CMOD 
E = CMOC(QRCL),Q101)) *MRE/CARE+MRE) 
DO LO IT = 1L,NN 
E = E*MS+CMOD(OR( Te QIC1)) 
10 CONTINUE 
ERREV = E*(ARE+MRE)—MP*MRE 
RETURN 
END 
DOUBLE PRECISION FUNCTION CAUCHY(NN,»PT2Q) 


CAUCHY COMPUTES A LUWER BOUND ON THe MODULI OF THE ZEROS OF A 
POLYNOMIAL - PT IS THE MOOULUS GF THE CUEFFICIENTS. 


COUBLE PRECISION QUNND gs PTINND sXe XMeF yDXeDF oe 
* DABS «DEXP,0LOG 
PTCNN) = —PT(NN) 


COMPUTE UPPER ESTIMATE OF BOUND. 


N = NN-1 
X = DEXP( (DOLOGI-PTINN)) - DLOG(PTCLI})/FLOATIN) ) 
(F CPT(N)D.EQ.0.C00) GO TO 20 


IF NEWTON STEP AT THE ORIGIN IS BETTER» USE IT. 


XM = -PTI(NN)/PTIN)D 
TF OXMeLToX) X=XM 


CHOP THE. INTERVAL (0,X} UNITL Fa20.- 


20 XM = X*.1D0 
F = PTL) 
DO 30 I = 2,NN 
F = F*xXmMePT(.) 
30 CONTINUE 
IF (F.LE. 0.000) GO TO 40 
xX = XM 
GO TO 20 
40 OX = X 


C DO NEWTON ITERATION UNTIL X CONVERGES TO TWO DECIMAL PLACES. 


naaaaann 


Cc 


c 


c 


OAHOAMAAAMANAGMeSAAAANN 
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50 IF (OABS(DX/X) eLE. -005D0) GO TO 70 
Q(l) = PTL) 
00 60 I = 2,NN 
QCr) = QCE-L)*X+PTC(T) 
60 CONTINUE 
F = Q(NN} 
DF = QL) 
DO 65 I = 2sN 
DF = DF*x+Q(T) 
65 CONTINUE 
OX = F/OF 
X = X-DX 
GO TO 50 
70 CAUCHY = X 
RETURN 
END 


0 


DOUBLE PRECISION FUNCTION SCALE(NNsPTyE TAs INF IN» SMALNOgeBA SE) 


RETURNS A SCALE FACTOR TO MULTIPLY THE COEFFICIENTS OF THE 
POLYNOMIAL. THE SCALING 1S DONE TO AVOID OVERFLOW AND TO AVOID 
UNDETECTED UNDERFLOW INTERFERING WITH THE CONVERGENCE 
CRITERION. THE FACTOR IS A POWER OF THE BASE. 

PT - MOOULUS OF COEFFICIENTS OF P 

ETAsEINFIN»SMALNOZBASE - CONSTANTS DESCRIBING THE 

FLOATING POINT ARITHMETIC. 


DOUBLE PRECISION PT(NN),ETAsINFINySMALNDs BASE sHI sl Oy 
* MAX oe MEIN» Xe¢SCyDSQURT ¢ DLOG 
FIND LARGEST AND SMALLEST MOOULI OF COEFFICIENTS. 
HI = DSQRTCINFIN) 
LO = SMALNO/ETA 
MAX = 0.000 


MEN = INFIN 
00 10 f = IyNN 
: xX = PT(I) 


IF (X «GT. MAX) MAX = X 
IF (X .NE. 0.000 «AND. XeLToMIN) MIN = X 
10 CONTINUE 
SCALE ONLY IF THERE ARE VERY LARGE OR VERY SMALL COMPONENTS « 
SCALE = 1.000 
IF (MIN .GEe LO «AND. MAX eLE. HI} RETURN 
X = LO/MEN 
TF (X .GT. 1.000) GO TO 20 
SC = 1.000/ (0SQRT (MAX) *DSOQORTEMIN)) 
GO TO 30 
20 SC = Xx 
TF CINFIN/SC .GT. MAX) SC = 1.000 
30 L = DLOG(ISC)/DLOG(BASE) + 2.500 
SCALE = BASE*#L 
RETURN 
END 
SUBROUTINE COIVID(ARsALs BReBloCReC1) 
COMPLEX DIVISION C = A/By AVOIDING OVERFLOW. 
DOUBLE PRECTSION ARsAlsBReBleCReCieRsOegTyINFIN» DABS 
IF (BR NE} 0.000 .ORe BI NE. 0.000) GO TO 10 
DIVISION BY ZERO» C = INFINITY. 
CALL MCON (Te INFINeTsT) 
CR = INFIN 
Ci = INFIN 
RETURN 
10 IF ({DABSCBR) 
R = BR/BI 
D = BI+R*BR 
CR = (AR*R+AL)/D 
Ci = (AI*R-ARID/D 
RETURN 
20 R = BI/BR 
DO = BRtER*BI 
CR = (AR*AL*#RI/D 
CI = (AI-AR*R)/0 
RETURN 
END 
DOUBLE PRECISION FUNCTION CMOD(RsI) 
MODULUS OF A COMPLEX NUMBER AVOIDING OVERFLOW. 
DOUBLE PRECISION RoTsARe Aly DABS: DSQURT 
AR = DABSE(R) 
AI = DABS(I) 
IF (AR «GE. AI) GO TO 10 
CMOC = ALI*DSQRT(LODOF(AR/ATL DE*2) 
RETURN 
10 IF (AR .LE. AT) GO TO 20 
CMOO = AR*DSORTULODO#(AT/AR) F¥2) 
RETURN 
20 CMOD = AR*DSQRT(2.0D0) 
RETURN 
END 
SUBROUTINE MCON(ETAsINFINYsSMALNO, BASE) 
MCON PROVIDES MACHINE CONSTANTS USED IN VARIOUS PARTS OF THE 
PROGRAM. THE USER MAY FITHER SET THEM OIRECTLY OR USE THE 
STATEMENTS BELOW TO COMPUTE THEM. THE MEANING OF THE FOUR 
CONSTANTS ARE - 
ETA THE MAXIMUM RELATIVE REPRESENTATION ERROR 
WHICH CAN BE DESCRIBED AS THE SMALLEST POSITIVE 
FLOATING-POINT NUMBER SUCH THAT 1.000 + ETA IS 
GREATER THAN 1.000. 


«GE. DABS(BI1)) GO TO 20 


INFINY THE LARGEST FLOATING-POINT NUMBER 
SMALNO THE SMALLEST POSITIVE FLOATING-POINT NUMBER 
BASE THE BASE OF THE FLOATING-POINT NUMBER SYSTEM USED 


LET T BE THE NUMBER OF BASE-DIGITS IN EACH FLOATING-POINT 
NUMBER(DOUBLE PRECISION). THEN ETA IS EITHER .5*0**(1-T) 
OR B**(1-T) DEPENDING ON WHETHER ROUNDING OR TRUNCATION 
IS uSED. 
LET M BE THE LARGEST EXPONENT ANDO N THE SMALLEST EXPONENT 
IN THE NUMBER SYSTEM. THEN INFINY IS {1-BASE**(-T))*BASE*#M 
AND SMALNO IS BASE**N. 
THE VALUES FOR BASEsTseMeN BELOW CORRESPOND TO THE {BM/ 360. 

DOUBLE PRECISION ETAs INFINY»ySMALN(s BASE 

INTEGER MeNoT 

BASE = 16.2000 

Tz 14 

M = 63 

N = -65 

ETA = BASE**(1-T) 

INFINY = BASE*(1.0D0-BASE**(-T) ) #BASE**(M-1) 

SMALNO = (BASE**(N+3))/BASE**3 

RETURN 

END 


COLLECTED ALGORITHMS (cont.) 


Remark on Algorithm 419 [C2] 
Zeros of a Complex Polynomial [M.A. Jenkins and 
J.F. Traub, Comm. ACM 15 (Feb. 1972), 97-99] 


David H. Withers [Rec. 9 Oct. 1972 and 14 May 1973] 
IBM, Essex Junction, VT 04352 


The published algorithm has performed satisfactorily for all 
except one (degenerate) case. When removing zeros at the origin, 
the algorithm does not stop if all roots have been located. An error 
will occur if the polynomials, ¥” = 0 or ay = QO are given to the 
algorithm. The difficulty may be avoided by inserting after state- 
ment 40 the statement 


IF (NN.EQ. 1)RETURN 


The referee pointed out the second type of degenerate case above 
and two typographical errors: 

1. In the initialization of constants section COSR should be 
initialized by COSR = —.069756474. 

2. Inthe FUNCTIONS SCALE and CMOD, the declaration of 
DSQRT as DOUBLE PRECISION was accidentally typed as 
DSQURT. . 


419-P 4- 
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COLLECTED ALGORITHMS FROM CACM 


Algorithm 420 


Hidden-Line Plotting 
Program [J6] 


Hugh Williamson [Recd. 4 March 1970 and 4 Feb. 1971] 
Tracor Computing Corporation, Austin, Texas 


Key Words and Phrases: hidden-line plot, surface plot 
CR Categories: 4.9 


Description 

HIDE produces a two-dimensional representation of a surface 
or figure by plotting segments of a succession of curves; each curve 
is plotted where it is not hidden by any of the curves previously 
plotted (that is, where it does not fall below any of them as they 
appear in the two-dimensional representation). 

The calling sequence is described in some detail in the comment 
cards at the first of the subroutine. 

The following are options: 

(1) Translate the arrays before plotting to simulate stepping in the 
depth dimension. 

(2) Draw any of the following: an 84 by 11 inch border, axes, and 
a title. (Whether this option is exercised or not, labeling may be 
added by the calling program.) 

(3) Draw the unhidden part of the underside of a figure. In this 
case, the lines are assumed to be hidden where they fall above those 
previously plotted. This option together with the program’s capa- 
bility to plot the visible maximum can be used to represent the 
unhidden areas of both the top and the underside of a surface. This 
can be done by plotting all visible segments of each successive curve, 
beginning with the farthest in the foreground, as in the exemplary 
driver routine that produces the graph titled Test for plotting 
routine HIDE. Or all the segments to represent the top of the 
surface can be drawn first, and then all the segments to represent 
the underside. The method used in the driver routine listed is ad- 
vantageous in that only one of the curves to be plotted must be 
stored at a time, but it is disadvantageous in that two sets of working 
arrays are required, 

Explicit provisions are not made in HIDE for perspective plots 
or for rotations. If, however, the arrays to be plotted are properly 
transformed before HIDE is called, such effects can be achieved. 

The arrays XG, G, XH, and H must be dimensioned in the 
calling program. G vs. XG is the visual maximum function; that is, 
after the first » — 1 curves have been plotted, G vs. XG is the func- 
tion such that the nth curve falls below one or more of the first 
n — 1 curves (as they appear in the two-dimensional graph) at 
exactly the same points where it falls below G vs. XG. (Thus the 
intersections of the nth curve with G vs. XG are endpoints of inter- 
vals within which the nth curve is entirely hidden or entirely visible.) 

The number cf points used in arrays G and XG after n curves 
have been. plotted is the sum of: 
(1) the number of original data points of any of the first 7 curves 
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that lie on the curve G vs. XG, 

(2) the number of intersections of different curves that lie on G vs. 
XG (if the kth curve coincides with the maximum function of the 
first k — 1 curves over an interval, every data point of the kth curve 
with an abcissa within that interval is considered an intersection), 
and 

(3) the number of points needed to simulate discontinuities in the 
maximum function; this number is no greater than four times the 
number of curves to be plotted for the graph. 

An adequate dimension for XG and G and for the working 
arrays XH and A is an upper bound for the number of points that 
will be needed for the visual maximum function. 

Developed on Tracor Computing Corporation’s UNIVAC 1108 
system, HIDE calls several basic systems plotting routines. In the 
listing, these calls are preceded by comment cards with asterisks 
across the lines. If HIDE were to be used on a different com- 
puter system, calls could be substituted to the corresponding 
routines of that system, or, if more flexibility were desired, to 
user-supplied routines. 

Although partially explained in comment cards, the calling 
sequences of systems routines called by HIDE will be described 
more fully here. On TCC’s system, these routines write pen codes 
on magnetic tapes, which are used to drive offline drum plotters. 


PDATA(X,Y,N,J,L,XMIN, DX, YMIN,D Y,HT) 

This routine plots curves. 

X is the abcissa array. 

Y is the ordinate array. 

N is the number of points (X(J), Y(I)) to be plotted. 

| J | is the number of data points from plotted symbol to plotted 
symbol. If J = 0, a line plot will be produced. If J is negative, only 
the symbols will be plotted. If J is positive, both the line and the 
symbols will be plotted. 

L specifies the symbol to be plotted (the table correlating values 
of L with symbols would be of interest only to users of TCC’s 
system). 

XMIN is the x value at the plotting reference point, which is 
the origin for plotting pen movements (but not necessarily the data 
origin). 

DX is the x increment per inch for the plot. 

YMIN is the y value at the plotting reference point. 

DY is the y increment per inch. 

HT is the height in inches of the syrnbols. 


MOVPEN(X, Y,]) 

(X,Y) is the point in inches relative to the reference point to 
which the plotting pen is to be moved. 

|| = 1 if the pen is to be left as it was prior to this call (up 
or down). 

[1] = 2 if the pen is to be placed down before movement. 

| 1 | = 3 if the pen is to be picked up before movement. 

If I is negative, (X,Y) will become the new reference point. 
Other options exist which are not used by HIDE. 


PSYMB(X,Y,HT,T,TH,N) 

This routine plots alphanumeric information. 

(X,Y) is the position in inches relative to the reference point 
of the lower left-hand corner of the first symbol. 

HT is the height of the symbols. 

T is the starting location in core for the information to be 
plotted. 

TH is the angle in degrees counter-clockwise relative to hori- 
zontal at which the symbols are to be plotted. 

N is the number of symbols to be plotted. 
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PAXIS(X, Y,T7,N,S,TH,FMIN,DF) 

This routine draws and labels a linear axis. 

(X,Y) is the point in inches relative to the plotting reference 
point of the beginning of the axis. 

T is the starting location in core for a title for the axis. 

| N | is the number of characters in the title. If N is negative, 
the labeling will be on the clockwise side of the axis; otherwise, on 
the counter-clockwise side. 

S is the length in inches of the axis. 

TH is the counter-clockwise angle in degrees relative to hori- 
zontal at which the axis is to be drawn. 

FMIN is the data value at the start of the axis. 

DF is the data increment per inch. FMIN and DF are necessary 
for labeling the axis. 


PLTOFF, which is not called by HIDE but is called by driver 
routines, writes the remaining information in the buffer on the plot 
tape and writes an end-of-file mark. 

If HIDE were to be used on a computer system with word 
length different from 36 bits, it is possible that EPS1, the relative 
abscissa increment used to simulate discontinuities in the visual 
maximum function, should be changed. EPS1, which is defined in 
a data statement near the beginning of the program, should be one 
or two orders of magnitude larger than the smallest recognizable 
relative difference in single precision floating point arithmetic. 

The helpful suggestions made by the referees for improving the 
capabilities of HIDE are greatly appreciated by the author. 


Algorithm 


Cc 
C THIS DRIVER ROUTINE F@R HIDE PRODUCES THE GRAPH TITLED 
C TEST FOR PLOTTING ROUTINE HIDE. 
Cc 
DIMENSION XC€150)s5YC150)5Y10150)5XG(500)5GC500)5XHC500) 
1 »HC500) »XG10500)5G1(€500)sTIC14) 
DATA NGsNGIsNsNlsNFNS»MAXDIMs XMIN»s DELTAXs YMINs DELTAYs 
1 XLNTHs YLNTHs XX/09 -3s 1502-1505 26850020091 -05s—10s 
2 ©67Ts bee Ber 30 LAIS92ES4/ 
EQUIVALENCE (XHC1)sTIC1)) : 
READ 1t»TI 
1 FO@RMATC13A6sA2) 
STEP = 3+141592654/74-5 
XC1) = Oo 


D@ 21 = 2sN 
XCD) = XCI1-1)+STEP 
2 YICI)D = eQeSINCXCI)) 
STEP = 3+141592654/12-5 


THE CALLS T@ HIDE NECESSARY T@ PL@T THE TOP AND BOTTOM 
CUNDERSIDE) GF A SURFACE ARE MADE IN THE F@LLOWING LOOP. 


aaaga 


D9 3 I = 1sNFNS 
Cz = C@S¢Z2) 
D® 4 J = 1sN 
4 Y¥OSD B VICII*CZ=CEXPC~ CXC I) “XX KA CZ-XX) RRO) H 
1 COSC1eTS¥C CXC I) XX) HHBt(ZOXX) HRD) IELTS 


PLOT THE PART OF THE ITH CURVE THAT LIES @N THE UNHIDDEN 
PART OF THE TOP OF THE SURFACE. : 


aaaa 


CALL HIDECXsYsXGs Gs XHsH»sNGsMAXDIMs No NFNS» TL» XLNTHs 
1 YLNTH»s XMIN»s DEL TAX» YMINs DELTAY) 


PLOT THE PART OF THE ITH CURVE THAT LIES ON THE UNHIDDEN 
PART OF THE UNDERSIDE @F THE SURFACE. 

CN@TE+ IF PART OF THE UNDERSIDE FALLS BEL@W YMINs BUT 
STAYS WITHIN THE DESIRED AREA @N THE PLOTs HIDE WILL STILL 
PERFORM THE PLOTTING C@RRECTLY-~) 


aaanaanaaa 


CALL HIDECXs Ys XG12G1s5XHsH»NG1»MAXDIMsN1»s Os 6GHNOTTLED 
1 XLNTHs YLNTH»s XMIN»s DELTAX» YMINs DELTAY) 
3 Z = Z+STEP 


CALL SYSTEMS ROUTINES TO MOVE THE PEN OFF THE GRAPH T@ THE 
RIGHT AND TO TERMINATE THE PLOT. 


aaaa 


CALL MOVPENC10+2-2+3-3) 

CALL PLTOFF 

END 

SUBROUTINE HIDE (Xe¥9XGeGeXHeHeNGeMAXDIMeNLeNFNS TITLE, 
1 XUNTHs YENTHs XMIN» DEL TAX s YMIN,DELTAY) 


THIS SUBROUTINE PRODUCES A 2-DIMENSIONAL REPRESENTATION OF 
A 3-DIMENSICNAL FIGURE OR SURFACE. 

THE FIRST CALL TO HIDE IS FOR INITIALIZATION AND PLOTTING 

THE CURVE FARTHEST IN THE FOREGROUND. ON EACH SUBSEQUENT 

CALL» A CURVE FARTHER IN THE BACKGROUND IS PLOTTED. 


ieislelalels! 


AAMOMNHAAAAMNAHAAGAHNHOAAAAAAAANAAAAN AGES AOGOAHAAGADANMNAAGAAANAHAANAADHOANDHAAGDAOAAAAAAHOAKAOAAGAANAAAAAAND 


aaa aaa 
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X IS THE ABCISSA ARRAY FOR THE CURVE TO BE PLOTTED BY 

HIDE ON THIS CALL. THE X VALUES MUST BE INCREASING. 

1f XC) GE XCE#L) FOR SOME I, MAXDIM WILL BE SET TO ZERO, 

AND A RETURN WILL BE EXECUTED. 

Y 1S THE ORCINATE ARRAY. 

G VSe XG 1S THE CURRENT VISUAL’ MAXIMUM FUNCTION ON EACH 

RETURN FROM HIDE. 

XH AND H ARE WORKING ARRAYS. 

ON EACH RETURN FROM HIDE, NG 1S THE NUMBER OF POINTS IN 

THE CURRENT MAXIMUM FUNCTION. 

ON THE FIRST CALL. NG IS A NONPOSITIVE INTEGER WHICH 

SPECIFIES CERTAIN OPTLONS. 

~1 DO NOT DRAW THE & 1/2 BY 11 INCH BORDER. 

2 PLOT UNHIDDEN MINIMUM RATHER THAN MAXIMUM. IN THIS 
CASEe G VS» XG WILL BE THE NEGATIVE OF THE VISUAL 
MINIMUM FUNCTION. 

-3 DO NOT PLOT BORDER, PLOT MINIMUM RATHER THAN MAXIMUM. 

© PLOT BORDER,» PLOT MAXIMUM. 

IF THE BORDER IS DRAWN, ITS LEFT. BOTTOM CORNER WILL BE 

WHERE THE PLOTTING REFERENCE POINT WAS JUST BEFORE THE 

FIRST CALL TO HIDE, AND THE REFERENCE POINT WILL BE MOVED 

1 INCH RIGHT AND 2 INCHES UP. IF THE BORDER IS NOT DRAWNy 

THE REFERENCE POINT WILL NOT BE MOVED BY HIDE. 

MAXDIM IS THE DIMENSION IN THE CALLING PROGRAM OF THE 

ARRAYS XGe Ge XHy AND He. IF ONE OF THESE ARRAYS WOULD 

HAVE BEEN OVERFLOWED, MAXDIM IS SET EQUAL TO ITS NEGATIVEs 

AND A RETURN IS EXECUTED. 

N1 IS THE NUMBER OF POINTS (X(1)eY(1)) TO BE PLOTTED IN 

A GIVEN CALL TO HIDE. 

IF NL IS LESS THAN O» Y VS. X WEILL NOT BE PLOTTED, BUT ON 

SUBSEQUENT CALLS» PLOTTING WILL BE DONE AS IF 

COXCDD YUL) 2, L2Le-N1L) HAD BEEN PLOTTEO (WHERE UNHIDDEN). 

N1 WILL BE RETURNED AS ITS ABSOLUTE VALUE. 

NENS IS THE TOTAL NO. OF CURVES TO BE PLOTTED FOR THIS 

GRAPH IF TRANSLATING THE ARRAYS TO SIMULATE STEPPING IN 

THE DEPTH DIMENSION IS DESIRED. IF NO TRANSLATION IS 

DESIRED, NFNS SHOULD BE NEGATIVE. IF THE SAME TRANSLATION 

AS IN THE PREVIOUS CALL TO HIDE IS DESIRED, NFNS SHOULD BE 

ZERO. THE NFNS=0 OPTION MAY BE SPECIFIED FOR INDIVIQUAL 

CURVES AFTER THE FIRST FOR A GIVEN GRAPH. ALL 

TRANSLATIONS WHICH ARE PERFORMED WILL HAVE EQUAL STEP SIZE 

DETERMINED BY THE VALUES IN THE INITIAL CALL FOR XLNTHs 

YUNTHe AND NFENS. 

TITLE IS AN 80-CHARACTER TITLE. 

1f TITLE€1)=6HNOTTLE, THE TITLE WILL. NOT BE PLOTTED. 

TITLE(2) ANC XHC(L) OR HCL) MAY BE THE SAME LOCATION IF THE 

TITLE 1S NOT NEEDED AFTER IT IS PLOTTED. 

XLNTH IS: THE LENGTH IN INCHES OF THE HORIZONTAL AXIS. 

If XUNTH IS LESS THAN O» THE X-AXIS AND THE DEPTH AXIS 

WILL NOT BE DRAWN. IN ANY CASE UNLESS THIS OPTION IS 

SUPPRESSED THROUGH NFNSs THE ITH CURVE WILL BE TRANSLATED 

(1-1}*#(9.-ABS(XLNTHD)/(NFNS-1) INCHES TO THE LEFT. THIS 

PLUS A SIMILAR VERTICAL TRANSLATION IS DONE TO SIMULATE 

STEPPING IN THE DEPTH DIMENSION. 

XMIN-(9.-ABS(XENTH) )*®DELTAX WILL BE THE ABCISSA VALUE AT 

THE PLOTTING REFERENCE POINT (WHICH IS WHERE THE 

HORIZONTAL AND VERTICAL AXES WOULD INTERSECT IF ORAWN). 

YLNTH IS THE LENGTH OF THE VERTICAL AXIS IN INCHES. 

IF YLNTH. 1S LESS THAN 0, THE VERTICAL AND DEPTH AXES WILL 

NOT BE DRAWN. BUT IN ANY CASE, UNLESS THIS OPTION IS 

SUPPRESSED THROUGH NFNS, THE ITH CURVE WILL BE TRANSLATED 

(T=1)*(60-ABS (YLNTH))/(NENS-L) INCHES UP TO SIMULATE 

STEPPING. IN THE DEPTH DIMENSION. YMIN-(6.-ABSCYLNTH))* 

DELTAY WILL BE THE OROINATE VALUE AT THE PLOTTING 

REFERENCE PCINT. 

IF TRANSLATIONS ARE PERFORMED, X AND Y WILL BE RESTORED TO 

THEIR ORIGINAL VALUES BEFORE THE RETURN TO THE CALLING 

PROGRAM. 

NOTE THAT IF ABS(XLNTH)=9_ AND ABSUYLNTH)=6, THERE WILL BE 

NO TRANSLATION, AND» IF BORDER AND AXES ARE NOT DRAWN, THE 

DIMENSIONS CF THE PLOT ARE UNSPECIFIED. 

If THE AXES AND BORDER ARE DRAWN, THE TOP OF THE VERTICAL 

AXIS AND THE RIGHT END OF THE HORIZONTAL AXIS ARE FIXED 

RELATIVE TO THE BORDER, AND THE DEPTH AXIS JOINS THE LEFT 

END OF THE HORIZONTAL AXIS ANO THE BOTTOM OF THE VERTICAL AXIS. 

XMIN IS A LCWER BOUND FOR X. 

OELTAX IS THE X DATA INCREMENT PER INCH FOR THE PLOT. 

XMEN AND DELTAX DETERMINE THE PLOTTING SCALE FOR X. 

(SEE ABOVE.) 

YMEN AND DELTAYs SIMILARLY, DETERMINE THE SCALE FOR Y. 

IF AN ERROR RETURN IS MADE FROM HIDE, ALL FURTHER CALLS 

WELL RESULT ONLY IN THE EXECUTION OF A RETURN UNLESS 

MAXDIM IS RESET TO A POSITIVE VALUE. 


DEMENSTCN XCLIsYCL) eXGOLDeGUL I eHC LI SXHCLIZTITLECL) 
INTEGER TITLE 


THE ONLY PURPOSE OF THE FOLLOWING EQUIVALENCE STATEMENT IS 
TO SAVE STORAGE. 

EQUIVALENCE (Khe EWHICH)s (K2e¢SLOPE), (FNSMLeZ1)5 

1 CIGGP1L»KL), (Kl N2) 


EPS1 IS THE RELATIVE ABCISSA INCREMENT USED TO SIMULATE 
DISCONTINUITIES IN THE MAXEMUM. FUNCTION. 

DATA. EPS1/.000001/ 

DATA NOTTLE/6HNOTTLE/ 


THE FOLLOWING STATEMENT FUNCTION COMPUTES THE ORDINATE ON 
THE LINE JOINING (XI,YI) AND (XIP1,YIPL) CORRESPONDING TO 
THE ABCISSA XX. 
FUXXeXL eV eXIPLeYIPL) = VEe(XX-XEP*(VIPI-VID/(XIPI-XI) 
TF(MAXDIM.LE.0) RETURN 
DO 71 I = 2yeNl 
TFCXCI-1) LT X00} GO TO 71 
MAXDIM = 0 
GO Tc 75 
71 CONTINUE 
{FPLOT = 1 
fF(NL-GT.0) GO TO 76 
NL = -N1 
IFPLOT = 0 
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76 IFING.GT.0) GO TO 5000 
IF(NL+4.LE,HAXDIM) GO TO 74 
MAXDIM = -MAXDIM 
75 RETURN 
Cc 
C WE WANT SIGN = 1 IF WE ARE PLOTTING MAXIMUM, = ~1 IF 
C MINIMUM. 
74 SIGN. = 1a 
TFEC(NGeLT.-1) SIGN = le 


Cc 
C THE KTH CURVE TO BE PLOTTED WILL (OPTIONALLY) BE 
C TRANSLATED BY THE VECTOR (~DXINeDVINI*(K-1) TO SIMULATE 
C STEPPING IN THE DEPTH OI[MENSION. 
LFINFNS.LE.0) GO TO 46 
FNSML = NENS-1 
OXIN = (9.-ABS(XLNTH) *OELTAX/FNSM1L 
DYIN = (6e-ABS(YLNTH) )#OELTAY/FNSML 
c 
C SYSTEMS ROUTINE MOVPEN MOVES THE PEN TO A POINT WHOSE 
C COGROINATES ARE SPECIFIED IN INCHES BY THE FIRST TWO 
C PARAMETERS. THE PEN IS PICKED UP IF THE ABSOLUTE VALUE OF 
C THE THIRD PARAMETER IS 35 IS PUT OOWN IF 2» AND IS LEFT AS 
C AFTER LAST CALL IF 1. IF THE THIRD PARAMETER 1S NEGATIVE? 
C A NEW REFERENCE POINT WILL BE ESTABLISHED. 
46 TFINGSEC.-1.OReNGEQ.-3) GO TO 41 
c 
C ORAW 8 1/2 BY LL INCH BORDER. 
C ERERAERERAR AREER ARE EERE REREEREREREEAOEEREEEEAEREEEE SEE E EES 


CALL MOVPEN(L1. 0002) 
CALL MOVPEN( 11. 98.501) 
CALL MOVPEN(Oce8e5el) 
CALL MOVPEN( Oss 0cy1) 

CALL MOVPEN( Leg 2009-3) 


CALL SYSTEMS ROUTINE TO PLOT THE 80-CHARACTER TITLE. 
THE FIRST TnO ARGUMENTS ARE THE COORDINATES IN INCHES 
RELATIVE TO THE REFERENCE POINT OF THE LOWER LEFT-HAND 
CORNER OF TKE FIRST CHARACTER. THE THIRD ARGUMENT 
DETERMINES THE HEIGHT IN INCHES OF THE CHARACTERS. THE 
FIFTH ARGUMENT GIVES THE ANGLE RELATIVE TO HORIZONTAL OF 
THE PLOTTED CHARACTERS. 
ARERR EERE HARES ANS KEE SSE ERASE EEE EEEEEEEREREEEESE 
41 IF(TITLECL) «NEJNOTTLE) CALL PSYMB(-.285-Leyel4e 
1 TITLE, 0.580) 
IF(XUNTHLLT.0.) GO TO 42 


OaAOAAHAOAaAND 


CALL SYSTEMS ROUTINE TO DRAW THE HORIZONTAL AXIS. THE 

LEFT END {S SPECIFIEO IN INCHES RELATIVE TO THE REFERENCE 

POINT BY THE FERST TWO ARGUMENTS. 

FERERERER EEE AEA EERE EERE EERE EERE EE SE EEEEE EEE SEERA SEES 
CALL PAXIS(9.-XLNTHsOeeLH 9—LeXLNTHs Ons XMIN DEL TAX) 
IFCYLNTF.LT.0.) GO TO 43 


aAaAaAana 


ORAW THE DEPTH AXIS. 
FETE AED EEE EE ERK EERE EER ER ERE RERE SESE EER ESTE REE EES E 
CALL MOVPEN(7.~XUNTHe Ooe 3} 
CALL MOVPEN(0.96.—-YLNTHs2) 
42 TFCYLNTK.LT.0.) GO TO 43 


aaa 


Cc 
C DRAW THE VERTICAL AXIS» THE BOTTOM POINT IS SPECIFIED IN 
C INCHES RELATIVE TO THE REFERENCE POINT 8Y THE FIRST TWO 
C ARGUMENTS. 
C EERE REED KEE HED EERE ARE EEREREEER ESE KEE EEE EEE ERES ESTE 

CALL PAXIS(0.¢6e-YLNTHe LH 2 Le YENTHs, 9002 YMENe DEL TAY) 
c 
C CURVES SUCCESSIVELY FARTHER EN THE BACKGROUND WILL BE 
C PLOTTED WHERE THEY ARE NOT HIDDEN BY G VS. XG. G VS XG 
C WILL BE UPDATED EACH TIME A NEW CURVE IS ORAWN AND WILL BE 
C THE VISUAL MAXIMUM (OR MINIMUM) FUNCTION OF THE CURVES 
C ALREADY PLOTTED. 

43 INDEXT=3 
00 3 3 = LeNl 
XGUINDEXT) = X(J) 
GCINCEXT) = SIGN*FY(J) 
3 ENDEXT = INDEXT+1 

c 
C THE FOLLOWING PRECAUTIONARY STEP IS USED IN PLACE OF A 
C TEST IN SUBROUTINE LOOKUP TO SEE IF THE VALUE FOR WHICH WE 
C WANT AN INDEX IS OUTSIDE THE TABLE. 
C THE LAST XG VALUE WELL 8E SET EQUAL TO THE LAST ABSCISSA 
C OF THE CURVE TO BE PLOTTED IN THE NEXT CALL TO HIDE. 

EPS = EPS1*(ABS (XMIN) 4ABS(DELTAX)) 

NG = N14+4 

XGOL) = -FNSML*DXIN#XMIN-ABS( XMIN)-ABSUXG(3))-Le 

XG(2) = XG(3)-EPS 

XGENL#3) = XGEN1L42)4EPS 

ZZ=YMIN 

IF(SIGN.LT.0.) 22 = -YMIN-50. *DELTAY 

Gtlk = 22 

G2) = 72 

GUN1L4+3) = 22 

GING) = 22 
Cc 
C CALL SYSTEMS ROUTINE TO PRODUCE A LINE PLOT OF 
C (XU eVEL)eT=LeN1) - THIS ES THE CURVE FARTHEST IN THE 
C FOREGROUND. 
C XSTART ES THE X VALUE AT THE REFERENCE POINT. 

XSTART = XMIN-(9.-ABS(XLNTH) ) *DELTAX 
c 

LEC LFPLCTCEQe1) CALL PDATACXsYVeN1lyOeleXSTART eDELTAX, 

L YMINyDELTAY,» 07} 

DXKK = Ce. 

DYKK = C. 

RELINC = DELTAX/DELTAY 

XG(NG) = SIGN 

RETURN 


Cc 
C STATEMENT 5C00 IS REACHED IF ANY EXCEPT THE CURVE FARTHEST 
C IN THE FOREGROUND IS TO BE PLOTTED. 
5000 SIGN = XG(NG) 
XGING) = XENL) 
Cc 
C TRANSLATE TRE ARRAYS BEFORE PLOTTING TO SIMULATE STEPPING 
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C IN THE DEPTt DIMENSION. 
EFUNFNS) 52248949 
49 DXKK = CXKK+OXIN 
DYKK = CYKK4+OVIN 
48 00 4 J = IsNl 
YQ) = SIGN*(Y(J)4DYKK) 
4 XC) = XC JI-OXKK 
52 CALL LOCKUPUX(1) eXG(1)yJJ) 
1F(JJ.GEeMAXOIM) GO TO 700 
DO 31 J = lsJJ 
XH(J) = XGOJ) 
31 H(J) = GOS) 
1G = JJel 
XH(IG) = X(1) 
HUIG) = FOXOL De XGCII) GOSS) eXGUIG) oGUIG)) 


WE WILL BE MAKING TABLE LOOKUPS FOR AN INCREASING SEQUENCE 
OF NUMBERS - THEREFORE» WE DO NOT HAVE TO SEARCH FROM THE © 
FIRST OF THE (XG AND X) TABLES EACH TIME. HENCE INDEXG 
ANO INDEXT. 

TNOEXG = JJ 

INDEXT = 1 

z1 = x2) 

FL = HCIGI-YCL) 

IT = 2 

JJ 2 1G 

IF(HOCIG).GE-¥(1)) GO TO 32 

IF(JSeGEeMAXOIM) GO TO 700 

JJ = 1641 

H(JJ) = YCL) 

XH( JJ) = ZI+EPS 

32 LAST = C 
Xl = ZL 


aQOoan 


Cc 
C FINO THE FIRST ZERO, 222 OF THE FUNCTION G-Y TO THE RIGHT 
C OF Zhe 
1100 IFCXGCIG)LTXC1T)) GO TO 1001 
c 
C 0O NOT JUMP IF WE ARE TO LOOK FOR A “ERO BETWEEN Xi AND 
C KCE). 
IWHICH = 0 
X2 = X(IT) 
F2 = FUK2eXGCIG—-1)¢G(IG—-1l)eXGCIGieGCIG)I-YCIT) 
Iv = [T#l 
GO TO 1C02 


Cc 
C COME TO 1001 IF WE ARE TO LOOK FOR A ZERO BETWEEN X1 AND 
C XG(IG). 

LOOL X2 = XG(IG) 


IWHICH = 1 
F2 = GCIGI-F(X2,XC1T-L),YCIT-~L)sMCETISYCIT)) 
1G = I1G41 


THE FUNCTION (G-Y¥) HAS A ZERO Z2 SUCH THAT X1 LE d2 LE X2 
1f AND ONLY IF (G-¥ AT XL) * (G-Y AT X2) LE O. 
(G-Y IS ASSUMED, FOR PLOTTING PURPOSESe TO BE LINEAR ON 
EACH INTERV4L (XteX2).) 
1002 IF(FL*F2.G6T.0.) GO TO 1005 
SLOPE = (F2-FL)/€X2-X1L) 
IGG = 1G-1-IWHICH 
ItT = [1-2+1WHICH 
IF (ABS (SLOPE*RELINC).GT.~L-E-6) GC TO LOOT 


aAaAaanga 


c 
C IF G AND Y CEFFER IMPERCEPTIBLY (FOR PLOTTING PURPOSES) 
C ON THE INTERVAL (X1,X2)_— SET Z22=X2. THIS STEP PREVENTS 
€ DIVISION BY ZERO. 

Z2 = XK2 

GO TO 1006 


c 
C OTHERWISE, COMPUTE THE ZERO Z2. 
L007 Z2 = X1-F1/SLOPE 
GO TO 1C06 
Cc 
¢ TF NO ZERO WAS FOUND BETWEEN X1 AND X2, CONTINUE THE 
C SEARCH FOR ZEROES. 
1005 Xl = X2 
Fl = F2 
IFCITSLELNL) GO TG 1100 
c 
C If THE END CF THE X TABLE HAS BEEN REACHED, CONSIDER THE 
C INTERVAL FRCM THE LAST ZERO FOUND TO THE END OF THE X 
C TABLE (PLOT, UPDATE MAXIMUM FUNCTION AS INDICATED). 
1008 LAST = 1 
Z2 = X(N) 
CALL LOCKUP(Z2_XG(INDEXG), IGG) 
IGG = INDEXG+tIGG-1 
ITT = NI-1 


IT IS NECESSARY TO PLOT Y VS. X ON THE INTERVAL (Z1922) 
ONLY IF ¥Y 1S UNHIODEN AT EACH ZZ SUCH THAT 21 LT 22 LT Z2. 
we CHOOSE 22 NEAR THE LEFT ENC OF THE INTERVAL FOR 
EFFICIENCY IN THE TABLE LOOKUP. 
NOTE THAT IT IS MORE EFFICEENT TO CHOUSE THIS VALUE FOR Z2 
THAN: SAY, o9PFXCINDEXT D+ .OL*XCINDEXT¢#1) 5 WHICH WOULD 
ELIMINATE ONE OF THE TWO TABLE LOOKUPS, BUT WOULD 
NECESSITATE A TEST TO NETERMINE IF ZZ WAS BETWEEN 21 AND 
12. 
1006 22 = .99¥*Z1+.01L#22 

CALL LOCKUP(ZZ,XtINDEXT),K1) 

CALL LOCKUP(ZZ2+XGUINDEXG) sK2) 

KL = KL¢INDEXT-1 

K2 = K2+INDEXG-1 

TFCECZZeXCKEDSY(KL) aXCKL FLV CKLE1)) GT. 

L FOZZ eXG(K2)9GEK2) eXGCK241),G(K241))) GO TO 7 


AANANAMRAAAND 


c 
C IF Y IS HIOCEN BETWEEN Z1 AND Z2,y UPDATE THE MAXIMUM 
C FUNCTION 
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C FOR GENERALITY: THE MAXIMUM FUNCTION IS UPOATED EVEN IF 


c 


C 
Cc 
c 


OOaADADR aang eOaan 


MAOAAN 


Oana 


aanaa 


THIS 1S THE (NFNS)TH CURVE. 
1F(SJ4+1GG-INDEXG.GE-MAXDIM) GO TO 700 
IFC INDEXG.EQ.1GG) GO TO 712 
Jl = INCEXG41 
00 12 1 = JbeIGG 
JJ = JJ+1 
XH(JJ) = XGCE) 
l2 H(JJ) = GOD) 
712 JJ = JJel 
MH(JJI) = Z2 
H( JS) = FCZ2 XG 1GG),GC1GG),XG(IGG+l),GUIGG41)) 
INDEXG = IGG 
INDEXT = ITT 
GO TO 6C 


rf T TS NOT HIDDEN BETWEEN 21 AND Z2_, UPDATE THE MAXIMUM 
FUNCTION ANC PLOT. 
7 NGRAPH = ITT-INDEXT +42 
TFC JJ+NGRAPH-L2GT.MAXDIM) GO TO 700 
N22 JJ 
IFCNGRAPH.EQ.2) GO TO 9 
Jl = INCEXTe¢L 
00 1k £ = JLvITT 
JJ = JIL 
KH(JJ) = XCD) 
11 H(JJ) = VCE) 
9 JI = SIO] 
XH(JJ) = Z2 
HOSJ) = FUZZ eXCUTT) oVCLTT) XC ITT +L ye VCITT+1)) 


CALL SYSTEMS ROUTINE TO PRODUCE LINE PLOT OF 
(XH(T) sHOT) » T=N2sN2+NGRAPH-1). 
RERRREREREREKEERERE ERE RERE EERE RSE EERE EHEC ERE RERAEEE 
TFCLFPLCOTLEQ.1) CALL PDATA(XH(N2),HIN2) pNGRAPH) Opole 
1 XSTARTsDELTAX »SIGN#YMING 
2 STGN*DELTAY,.07} 


INDEXT = ITT 
INDEXG = IGG 
60 FF(LAST.EQ.1) GO TO 6l 


XL = X2 
FL = F2 
Zl = 22 


AFTER PLOTTING ANO/UR UPDATENG THE MAXIMUM FUNCTION ON THE 
INTERVAL (21222). SEARCH FOR THE NEXT ZERO IF THE END OF 
THE ABCISSA TABLE XT HAS NOT BEEN REACHED. 

TF(ITSLE-NL) GO TO 1100 

GO TO 1C08 


AFTER Y VS. X HAS BEEN PLOTTED, FINISH UPDATING AND STORE 
THE NEW MAXIMUM FUNCTION. 
ALLOW FUR TRE PUSSIBILITY FHAT THE PREVIOUS MAXIMUM 
FUNCTION EXTENDS TO THE RIGHT OF THE FUNCTION JUST 
PLOTTED. 
61 TF(XG(NG).LE.XG(NG-1)) NG = NG-1 
IF (XG(NG).LE.XENL)) GO TO 33 
IF (JJ#34NG-1GG.GT.MAXDIM) GO TO 700 
XH(JJ#1) = XH(JI)FEPS 
JJo2 JJ41 
HOI) = FOCONL) pXGCIGG) GL 1GG 9 XG4 16641) GU TGG41)) 
IGGPL = 1GG+1 
DO 34 J = IGGPL»NG 
JJ = Jel 
XH(JJ) = XGOJ) 
34 HISd) = GEN) 
33 NG # JJ42 
IF(NG.GTsMAXDIM} GO TO 700 
DO 3 I = LeJJ 
GOT) = HUE) 
13 MGT) = XHOD) 


iol 


THE FOLLOWING PRECAUTIONARY STEP IS USED IN PLACE OF A 
TEST IN SUBROUTINE LOOKUP TO SEE IF THE VALUE FOR WHICH WE 
WANT AN INOEX IS OUTSEOE THE TABLE. 
THE LAST XG VALUE WILL BE SET EQuAL TO THE LAST ABCISSA 
OF THE NEXT CURVE TO BE PLOTTED. 

XG(JSJ+L) = XGISIIFEPS 

GOJJ#1) = YMIN+UYKK 

TFCSIGN.LT.0.) GJUt1l) = -YMIN-50. *DELTAY#+DYKK 

GING) = G(JJ+1) 


RESTORE ARRAYS X ANC Y SEFORE RETURNING. 
66 IF(NFNS.LT.20) GU TO 53 ; 
0O 82 I = 1yNl 
MCE) = XC TPO CXKK 
82 VL) = STGN*Y(1)-DYKK 
53 XG(NG) = SIGN 
RETURN 


IF STATEMENT 700 IS REACHED, DIMENSIONS WOULD HAVE BEEN 
EXCEEDED. SEE COMMENTS ON CALLING SEQUENCE FOR HIDE. 
700 MAXDIM = -MAXDIM 

GO TO 6€é 

END 

SUBROUTINE LOOKUP(X »XTBL oJ) 


THIS SUBROUTINE IS CALLED BY HIDE TO PERFORM A TABLE 
LOOKUP. BECAUSE OF PRECAUTIONS TAKEN IN HIDE, A TEST TO 
SEE 1F x IS OUTSIDE THE TABLE IS UNNECESSARY. 


DIMENSICN XTBLOL) 
J= 2 
4 TFCXTBL(J)-X) 19203 
lv Jel 
GO TO 4 
2 RETURN 
3 J = J-l 
RETURN 
END 


Driver 


(a) Test for plotting routine HIDE. 


(b) Test case for HIDE. 


(c) Geometrical test case. 
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Remark on Algorithm 420 [J6] 
Hidden-Line Plotting Program [H. Williamson, Comm. 
ACM 15 (Feb. 1972), 100-103] 


1.D.G. Macleod and A.M. Collins ([Recd. 19 June 
1972] Department of Engineering Physics, Research 
School of Physical Sciences, Australian National Uni- 
versity, Canberra, A.C.T. 2600, Australia 


The number of point pairs to be plotted in subroutine HIDE is 
indicated by the magnitude of parameter N1. If N1 is less than zero, 
the visual maximum function is updated but no plotting is carried 
out. In this case, however, the construction 

DO 711 = 2,N1 

IF (X(I—1) .LT. X(1)) GO TO 71 

MAXDIM = 0 

GO TO 75 
71 CONTINUE 
is nonstandard and may lead to undesirable results. If the check for 
increasing X values is to be retained when N1 is negative, its abso- 
lute value should be used as the terminal value of the DO loop. 

In sections 8.3.2 and 10.1.3, ANSI Fortran [1] indicates that 
where X is an array there should be no distinction between the use 
of X and the use of X [1] as parameters in a procedure reference. 
Nevertheless, some Fortran implementations (and languages such as 
Algol and PL/I) make such a distinction, in which case subroutine 
LOOKUP and the calls to it would have to be appropriately 
modified. 


References 
1. American National Standards Institute: Fortran. Publication 
X3.9-1966. 
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Hidden-Line Plotting Program 
[Hugh Williamson, Comm. ACM 15 (Feb. 
100-103] 


1972), 


Hugh Williamson [Recd. 9 Oct. 1972] 
National Con-Serv, Incorporated, Austin, Texas 


The input quantities to subroutine H/DE referred to in the 
following paragraphs (e.g. NI, NFNS, “input curve to be plotted”) 
are described in the block of comment statements at the beginning 
of HIDE as originally published. 

If Ni <0, DO loop 71 is not executed properly, since the 
upper limit, NI, is less than the lower limit, 2. This affects only 
checking for monotonicity in the input abscissa array; otherwise, 
if the inputs are correct, the performance of the program is not 
atfected. 

The error is corrected if the first 11 executable statements are 
replaced by the following (the first executable statement of the 
original program, which is not changed, is listed for clarity): 


IF(MAXDIM.LE.0) RETURN 
IFPLOT = 1 

IF(N1.GT.0) GO TO 76 

Ni = —NI1 

IFPLOT = 0 


420-P 5- R1 


Fig. 1. Without verticals. 


76 DOTII = 2,NI 
IF(X(I—1).LT.XU)) GO TO 71 
MAXDIM = 0 
GO TO 75 

71 CONTINUE 

IF(NG.GT.0) GO TO 5000 


On computers in which all variables are not automatically 
set to zero before execution, FNSM{I is not properly initialized 
if NFNS < 0. To correct this, simply insert the statement 


FNSMI = 0. 
before the statement 
LF(NFNS.LE.O) GO TO 46 


The latter is the sixth statement after Fortran statement num- 
ber 74. 

FNSM1 will still be improperly defined if NFNS = 1. If 
only one curve is to be plotted, however, translating to simulate 
stepping in the depth dimension will not be done, so set NFNS = 
—1 for only one curve to be plotted. 

In some cases, the three-dimensional surface is easier to visual- 
ize if (nearly) vertical lines are drawn at the left edge of each curve; 
this effect is illustrated by Figures t and 2. The verticals are 
added by inserting (XMIN-e, YMIN) as the first point in each 
input curve to be plotted, where « is a small positive number 
(10-*& DELTAX would be appropriate). 

The author appreciates very much the comments received 
from readers of Communications regarding implementation of 
HIDE on different computers. 
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Remark on Algorithm 420 [J6] 
Hidden-Line Plotting Program [Hugh Williamson, 
Comm. ACM 15 (Feb. 1972), 100-103]. 


Blaine Gaither [Recd. 3 Apr. 1973] 
New Mexico Institute of Mining and Technology 
(TERA), Socorro, INM 87801 


The algorithm was compiled and run without corrections on an 
IBM 360 G44. It has been in use for a year now with no problems. 
However, there is danger of division by zero if NFNS equals 1. To 
eliminate this danger the statement: 


IF(NFNS.EQ.1) NFNS = —-1 
should be inserted between the statements: 
IF(NG.LT.—1) SIGN = —1 


IF(NFNS.LE.0) GO TO 46 
Depth axis may be added by the following changes. Where 
ZMIN and ZMAX are the values for the nearest and farthest curves 
respectively, replace the continuation card of HIDE’s subroutine 
statement with: 
1 XLNTH,YLNTH,XMIN,DELTAX,YMIN,DELTAY, 
ZMIN,ZMAX) 
In place of the statement labeled 42 insert: 
42 DELZ = ZMAX — ZMIN 
IF (DELZ) 9601, 9602, 9601 
9601 XSC = XLNTH —°9. 
YSC = 6. — YLNTH 
IF (XSC) 9604, 9603, 9604 
9603 ANGZ = 90. 
GO TO 9605 
9604 ANGZ = ATAN(YSC/XSC)+*57.29578 
9605 ZLEN = SQRT(XSC«*«XSC-+ YSC*YSC) 
IF (ZLEN—1.) 9602, 9602, 9606 
9606 CALL PAXIS (0.,YSC,1H,—1,ZLEN,ANGZ,ZMAX, 
—DELZ/ZLEN) 
9602 IF (YLNTH.LT.0.) GO TO 43 
If ZMIN equals ZMAX or if the length of the depth axis would 
be less than or equal to 1., these changes will have no effect. The 
max and min numbers on the depth axis may overlap with those of 
the horizontal and vertical axis. 


Remark on Algorithm 420 [J6| 
Hidden-Line Plotting Program [Hugh Williamson, 
Comm. ACM 15 (Feb. 1972) 100-103. ]} 


T.M.R. Ellis [Recd. 26 Mar. 1973 and 30 July 1973] 
Computing Services, University of Sheffield, England 


Algorithm 420 has been implemented on an ICL 1907 computer 
and used to plot the surface entitled “Test for Plotting Routine 
Hide” as well as a number of other surfaces. The system plotting 
routines for the ICL 1900 series computers more or less duplicate 
those used by Williamson, except in the case of PDATA for which 
no equivalent routine exists. There is however a system routine 
which draws a smooth curve through a set of points, and only slight 
modifications were required to reproduce the exact effect of PDATA. 

The implementation was checked by the satisfactory reproduc- 
tion of the “Test for Plotting Routine Hide,” and subsequently it 
produced good representations of other surfaces. However, when 
attempting to plot a square-based pyramid, the program failed due 
to an error in HIDE. 

When H/DE is searching for points at which the current line 
appears and disappears, it searches for the zeros of a function 
(G—Y) where G is the current visual maximum (i.e. as already 
drawn) and Y is the current ordinate (as to be drawn). This search 
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is carried out by comparing the values of the function (G— Y) at 
adjacent points in the current line (Y) andor the current visual 
maximum (G), as shown in Figure 1. 

Due to the fact that each line drawn.is shifted upward and to 
the left, in order to simulate perspective, data points on successive 
lines which in the actual surface would have the same abscissa will 
have different abscissa in the drawing. Thus X0 and X1 might repre- 
sent the same value of the abscissa in the surface. At XO and X1 in 
the above drawing the function (G— Y) has a negative value, while 
at X2 and X3 it is positive. Clearly if Fl and F2 are the values of 
(G—Y) at X1 and X2 there is a zero between X!1 and X2 if and only 
if F1 and F2 have opposite signs. This is tested for by the statement: 
1002 IF(F1*F2.GT.0.) GO TO 1005 

If a zero is found to exist, its abscissa is calculated by linear 
interpolation, the slope of the line being determined by the next 
statement:: 

SLOPE = (F2—Fl)‘(X2—X1) 

A check is subsequently made to-avoid dividing by zero if 
SLOPE is too small. 

In the case of the square based pyramid referred to above, the 
projection used was such that it was viewed down its rear face, and 
therefore all lines traversing the far face of the pyramid were both 
parallel to one another and passed through the same point on the 


COLLECTED ALGORITHMS (cont.) 


graph (the peak of the pyramid). Thus for a part of their length all 
the lines after that which goes over the peak are drawn on top of 
each other, as shown in Figure 2. When plotting the second of these 
coincident lines the respective G and Y functions are therefore as 
shown In the exploded form in Figure 3. 

This clearly means that for a number of consecutive abscissa 
values both FI and F2 are zero. Due to the way in which H/JDE 
keeps track of its path along the two functions G and Y, the effect 
of both Fl and F2 being zero is for the abscissa (X1) corresponding 
to the first of the two ‘'zeros"’ to be entered in the visual maximum 
array for a second time. During the plotting of the next line there- 
fore. the visual maximum function G vs. XG has two identical 
entries, and thus the stage comes when X1 corresponds to the first, 
and X2 to the second (see Figure 4). 

If, as in this case, this (third) line would be coincident with the 
second (and the first) at this point, then Fl = F2 = O and the test 
at 1002 (above) will lead to the calculation of SLOPE, and hence 
fatlure due to the division by zero (X2—X1). 

The problem can, however, be very easily corrected by insert- 
ing the following statement immediately after the statement with 
label 1002: 

IF(FL.EQ.FZ) GO TO 1005 

Since this statement can only be reached if F1l*F2 1s less than or 
equal to zero, then clearly the jump will be made if and only if 
Fl = F2 = 0. In this case the second ‘zero"’ is ignored. and the 
program proceeds satisfactorily. 


Remark on Algorithm 420 [J6] 

Hidden-Line Plotting Program [Hugh Williamson, 
Comm. ACM 15 (Feb. 1972) 100-103] and Remark on 
Algorithm 420 [T.M.R. Ellis, Comm. ACM 17 (June 
1974), 324-325] 


T.M.R. Ellis [Recd. 8 July 1974] Computing Services, 
University of Sheffield, England 


There was an unfortunate printing error in my Remark on 
Algorithm 420 which made nonsense of the whole thing. The state- 
ment which should be inserted to correct the error discussed should, 
of course, be: 


IF(F1.EQ.F2) GO TO 1005 
and not: IF(F1.EQ.FZ) GO TO 1005 as printed. 
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Algorithm 421 


Complex Gamma Function 
with Error Control [S14] 


Hirondo Kuki* (Recd. 17 Aug. 1970 and 21 June 1971) 
Computation Center, The University of Chicago, 
Chicago, Illinois 


Key words and phrases: complex gamma function, gamma 
function, complex loggamma function, loggamma function, round-off 
error control, inherent error control, run-time error estimates, error 
estimates, special functions 

CR Categories: 4.9, 5.11, 5.12 


Description 

This Fortran program computes either the gamma function or 
the loggamma function of a complex variable in double precision. 
In addition, it provides an error estimate of the computed answer. 
The calling sequences are: 

CALL CDLGAM (Z, W, E, 0) for the loggamma, and 

CALL CDLGAM (Z, W, E, 1) for the gamma, 

where Z is the double precision complex argument, W is the answer 
of the same type, and E is a single precision real variable. Before the 
call, the value of E is an estimate of the error in Z, and after the call, 
it is an estimate of the error in W. 

For details of the characteristics of the program, an analysis of 
the algorithm, and the nature of the error estimate, see [1]. 

This program was tested on an IBM System 360 Model 65. A 
slightly modified version was used for this purpose to take advan- 
tage of the availability of such facilities as the ENTRY statement 
and functions of the type double precision complex. Compiled by 
OS/FORTRAN-H, opt. 2, it required 3188 bytes of storage. Per- 


* Deceased. 
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formance statistics on samples of 500 arguments each, from seven 


disjoint reg: 


ions within the square 


{z = 21 + ize;|z1|,|z2| < 30}, 
were as follows: 


Region I 0<z,and|z| <3 
RegionIIT O<z,and3 <|z| < 10 
Region III 0 <z: < 10, —30 <z2 < 30, and 10 < |z| 
RegionIV 10 <2) < 30, —10 <z: < 10 
RegionV 10<2z, < 30,10 <|z2| < 30 
Region VI —30<z,:<0,-l<z<1 
Region VII —30 <z; <0,1 <|z2] < 30 

time* 
Region logT r Max error** RMS error** 
I 2100 2470 =. 2.3 K 10715 8.7 X 10716 
II 1800 2230 = 7.6 & 10715 2.8 X 10715 
lll 1700 1930 1.6:%10-* 7.8 X 10715 
IV 920 1500 1.4 & 10-44 7.1 X 1075 
Vv 1000 1500 1.6 X 10°14 7.9 X 10715 
VI 2130 2330 =2.6 X 10714 7.9 * 10715 
VII 1900 2050. +2.4 X 10714 9.5 X 10715 


* Average time in us. 
** Generated absolute errors for computation of the log- 
gamma function. 


Essentially the same statistics were obtained as generated rela- 
tive errors for computation of gamma function. Statistics on the 
effectiveness of the error estimate are found in [1]. 


References 
1. Kuki, H. Complex gamma function with error control. 
Comm. ACM 15 (Apr. 1972), 262-267. 
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SUBROUTINE CDLGAMCCARGs CANS» ERROR» L FO) 


C COMPLEX GAMMA AND L@GGAMMA FUNCTIONS WITH ERR@R ESTIMATE 
Cc 
C CARG = A COMPLEX ARGUMENTs GIVEN AS A VECT@R @F 2 DOUBLE 
Cc PRECISION ELEMENTS CONSISTING OF THE REAL COMPONENT 
c F@LLQOWED BY THE IMAGINARY COMP@NENT 
C CANS = THE COMPLEX ANSWER» @F THE SAME TYPE AS CARG 
C ERRGR = A REAL VARIABLE. IT STANDS FOR AN ESTIMATE @F THE 
Cc ABS@LUTE ERROR OF THE ARGUMENT AS INPUT. AS OUTPUT 
Cc IT GIVES AN ESTIMATE @F THE ABS@LUTE (FOR L@GGAMMA) 
Cc @R THE RELATIVE (F@R GAMMA) ERROR OF THE ANSWER 
C LFO = FLAGe SET IT TO 0 FOR LOGGAMMAs AND 1 F@R GAMMA 
DOUBLE PRECISION CARG(2)sCANS(2),COEFC 7)» FOsF1sG0sGt> 
$ PI,DPIsHL2PsAL2P»s DELTA» DEOs DELsZ1+Z2ZZ1 2 Wie Wes V1 e 
$ AsBs Us UL» 925 UUL, UU2s UUUL » UUU2s V1 V25 VV V2 Ths T2e 
$ H»H1sH2sAL1sAL2s DN» EPS» OMEGA 
DATA COEFC(1)/+0-641025641025641026D-2/ 
DATA COEFC2)7-0-191752691752691753D-27 
DATA COEFC3)/+0-8 417508 4175086 41751D-37 
DATA COEFC 4) 7-0. 59523809 523809 5238D-3/ 
DATA C@EFCS)7+0+793650793650793651D-3/ 
DATA COEFC 6)/-0¢277777777777777778D-27 
DATA COEFC7)/+0+833333333333333333D-1/ 
DATA FO/8 40-07385296052619D0/s F1/20-00!123082189 4200D0/ 
DATA GO/1680-147705921052 4D0/s G17180+01 4770 470520 42D0/ 
DATA PI/3+¢1415926535897932 4D0/ 
DATA DP1I/6-28318530717958648D0/ 
DATA HL2P/0-91893853320 46727 42D0/ 
DATA AL2P/1t -83787706640934548D0/ 
C CONSTANTS EPS AND @MEGA ARE MACHINE DEPENDENT. 
C EPS IS THE BASIC ROUND-OFF UNIT. F@R S/360 MACHINES, 
C IT IS CHGSEN TO BE 16**-13. FOR BINARY MACHINE OF N- 
C BIT ACCURACY SET IT TO 2%*C-N+1)5 AND INITIALIZE DEO 
C AS 5-0 RATHER THAN AS 2-9 
C OMEGA IS THE LARGEST NUMBER REPRESENTABLE BY THE FLOAT 
C POINT REPRESENTATION OF THE MACHINE. FOR S/360 
Cc MACHINESs IT IS SLIGHTLY LESS THAN 16*#*63.- 


DATA EPS/2-2D-16/ 

DATA OMEGA/7 -23700538D7 5/7 
Zi = CARGC1) 

Z@ = CARGC2) 

DELTA = ABSCERRGR> 

DEO = 2-0D0 

DE1 = 0-0 


COLLECTED ALGORITHMS (cont.) ~421-P 2- 0 


© FORCE SIGN @F IMAGINARY PART GF ARG T@ NON-NEGATIVE C 10**9, T@ SAVE TIME AND TO AVOID IRRELEVANT UNDERFL®W 
LFL = 0 Vv1 = 0-0 
IF (Z2 «GE. 0-0) G@ To 20 vv2 = 0-0 
LFi = 1 IF CA «GT. 1-0D18) G@ TO 220 
Z2 = -22 uul = ZIZA 
20 LF2 = 0 uu2 = -Z2/A 
IF (Z1 «GE. 0-0) G@ Ta 100 UUUI = UUI*UUL - UU2*UU2 
© CASE WHEN REAL PART @F ARG IS NEGATIVE UuU2 = UUI*UU2*2.0D0 
LF2 = 1 VV1 = COEFCE) 
LF = LFI-1 D@ 210 J = 257 
Tl = AL2P - Pl*z2 B= VVI*eUUUI - VveeUUU2 
T2 = PI*(0.SDO = Z1) Vv2 = VV1eUUU2 + VV2*UUUI 
U = -DPI*Z2 210 VV1 = B + COEFCJ) 
IF (U «GE. -0-1054D0) G3 TO 40 B= VVIKUUL - VV2*eUU2 
A = 0.0D0 VV2 = VVIeUU2 + VV2*eUUL 
C IF EekU «LT. 10#*(-17). IGNORE IT TO SAVE TIME AND T@ AVOID vvl = B 
C IRRELEVANT UNDERFLOW 220 WI = CCCVVI + HL@P) - WI) - 21) + VI 
IF CU «LE. ~39-15D0) GO Te 30 we = (cvV2 - W2) - 22) + v2 
A = DEXPCU)D DEO = DEO + DABSCV1) + DABSCV2) 
30 HI = 1.0D0 - A IF (K «LE. 0) DEL = DEL + ALI 
G@ T@ 50 C FINAL ASSEMBLY 
40 U2 = UU IF (LF2 .NE- 0) G2 T@ 310 
A = ~UsCFL*eU2 + FO) IF (LFO -EQ- 0) GQ TB 400 
Hl = CA + Ad/CCUZ + G1)*U2 + GO + AD A = DEXPCWL> 
A = 1.0D0 - HI Wi = A*DCOS(W2) 
C DINT IS THE D@UBLE PRECISION VERSION OF AINT» INTEGER EX- W2 = A*DSINCW2) 
C  TRACTIGN. THIS FUNCTION IS N@T INCLUDED IN ANSI F@RTRAN IF (LF3 «EQ. 0) G@ T@ 400 
C .« WHEN THIS FUNCTI@N IS N@T PR@VIDED BY THE SYSTEM,. Bo = CWLEUL + W2eU2) 7 U 
© EITHER SUPPLY IT AS AN EXTERNAL SUBR@UTINE CAND TYPE THE W2 = CW2*U1 - WIeU2) 7 U 
C NAME DINT AS D@UBLE PRECISION)» OR MODIFY THE NEXT Wi = B 
C STATEMENT AS THE EXAMPLE FOR S/360 INDICATES. FOR S/360 G@ T@ 400 
C REPLACE IT WITH 310 H = HI*HI + H2*H2 
c IF CH «EQ. 0-0) G@ TA 500 
c DOUBLE PRECISI@N SCALE IF (LFO -EQ. 0) G@ T@ 320 
c DATA SCALE/Z 4F00000000000000/ IF CH «GT» 1.0D-2) G@ T@ 330 
Cc 50 B = ZI - ¢¢Z1t - 0.5D0) + SCALE) 320 A = DLOGCH)*0.5D0 
50 B = 21 - DINTC(Z1 - 0.5D0) IF CH «LE. 1+0D-2) DEO = DEO - A 
H2 = A*DSINCDPI*B) IF (LFO «NE+ 0) GO TB 330 
B = DSINCPI*B) Wl = (TI - A) - Wi 
Ht = Hi + (B+B)*B*A W2 = (T2 - DATAN2(H2sH1)) - We 
H = DABSCH2) + Ht - DPI¥A*DELTA G@ Te 400 
IF CH «LEe 060) GO TB 500 330 Tl = Tl - WI 
DEO = DEO + DABSCT1) + T2 T2 = T2 - we 
DEL = PI + DPI+*A/H A = DEXPCT1) 
Zt = 1-0D0 - Z1 Tl = A¥*DCOS(T2) 
C CASE WHEN NEITHER REAL PART NOR IMAGINARY PART OF ARG IS T2 = AXDSINCT2) 
C NEGATIVE. DEFINE THRESH@LD CURVE T@ BE THE BROKEN LINES Wl = (TI*HI + T2*H2)/H 
© CONNECTING POINTS LOFO*I, 10F4.142«I1, 0.1F14-.042*1,AND W2 = (T2+*HI - T1*H2)/H 
C O+1FQMEGA*I IF (LF3 -E@. 0) G@ T@ 400 
100 LF3 = 0 B= WikUl - W2*U2 
Yl = ZI - 0.5DO W2 = W1l*U2 + W2*U1 
Wi = 0.0 Wl = 8B 
W2 = 0.0 400 IF (LFL «NE- 0) We = -W2 
K = 0 C TRUNCATI@N ERRGR OF STIRLINGS FORMULA IS UP T@ 3*10#e-17. 
B = DMAX1(O.1D0, DMINIC10-0D0, 14.142D0-Z2)) - Z1 DE = DEO*EPS + 3.0D-!7 + DEIL*DELTA 
IF (8B «LE. 0-0) G@ TA 200 G@ Ta 600 
C CASE WHEN REAL PART @F ARG IS BETWEEN 0 AND THRESH@LD : CASE WHEN ARGUMENT IS T0@ CLOSE TQ A SINGULARITY 
LF3 = 1 
ZZ1 = Z) SOO WI = OMEGA 
N = B + 1-Q0D0 W2 = OMEGA 
DN = N DE? = OMEGA 
Zi = Z1 + DN c 
A = Z1*Z1 + Z2*Z2 600 CANS(1) = WI 
V1lo= Z1ZA CANS(2) = W2 
V2 = -Z2/A ERRGR = DE} 
C INITIALIZE U1+U2*I AS THE RIGHTM@ST FACTOR 1-1/¢Z+N) RETURN 
UL = 1-0D0 ~ V1 END. 
u2 = -v2 
K = 6-0D0 - Z2*0-6D0 = ZZ} 


IF (K «LE- 0) G@ TO 120 
CG F@RWARD ASSEMBLY @F FACTORS (€Z+J-1)/¢Z+N) 


N =N-~- K 

UU1 = ¢€ZZ1*Z1 + Z2*Z2) 7 A 

UU2 = DN*Z2/A 

vV1 = 0-0 

vv2 = 0.0 

D@ 110 J = 15K 
Bo = UL*CUUL+VV1) - U2*CUU2+VV2) 
U2 = Ub*eCUU2+VV2) + U2*CUUI+FVVID 
Ul = 8B 
vvi VVI + Vi 


110 vv2 = vv2 + V2 
120 IF (N -LE-« 1) G@ TO 140 
C BACKWARD ASSEMBLY OF FACT@RS 1-J/¢(Z+N) 


vv = VI 

vve = v2 

D@ 130 J = 2sN 
VV1 = VV1 + VI 


vv2 = vv2 + v2 
B = Ul*¢!.ODO - VVI) + U2*vVe 
U2 = -UI*xVV2 + U2*C1.0D0 - VV1> 
130 ul = B 
140 U = UI*eUL + U2*uU2 
IF (U -EQ@- 0-0) GO TO 500 
IF (LFO -EQ@- 0) G@ TO 150 
IF «K eLE- 0) GO T@ 200 
150 ALI = DL@GCU)*0.SD0 
IF (LFO «NE- 0) GO TO 160 
Wk = ALI 
W2 = DATAN2(U2.U1) 
IF ¢(W2 «LT. 0-0) W2 = W2 + DPI 
IF ¢(K .«LEe 0) GO T@ 200 
160 A = 2Z1 + Z2 - DELTA 
IF €A »LE- 0.0) GO TO S00 
DEO = DEO - ALI 
DE1 = DE! + 2-0D0 + 1-0D0/A 
C CASE WHEN REAL PART @F ARG IS GREATER THAN THRESH@LD 
200 A = Z1*Z1 + Z2*Z2 
AL1 = DL@GCA)*0-5D0 
AL2 = DATAN2¢(Z2sZ1) 
Vi = YI*ALI - Z2*AL2 
V2 = YI*AL2 + Z2*AL! 
© EVALUATE ASYMPTOTIC TERMS. IGN@RE THIS TERMs IF ABS VALCARG) -GT- 
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Algorithm 422 
Minimal Spanning Tree [H] 


V. Kevin M. Whitney (Recd. 4 May 1970, 13 Oct. 1970, 
and 3 Aug. 1971) 

Department of Electrical Engineering, University of 
Michigan, Ann Arbor, MI 48104 


Key Words and Phrases: spanning tree, minimal spanning tree, 
maximal spanning tree 
CR Category: 5.32 


Description 

This algorithm generates a spanning tree of minimal total edge 
Jength for an undirected graph specified by an array of inter-node 
edge lengths using a technique suggested by Dijkstra [1]. Execution 
time is proportional to the square of the number of nodes of the 
graph; a minimal spanning tree for a graph of 5O nodes is generated 
in 0.1 seconds on an IBM System 360/67. Previous algorithms [2, 3, 
4, 5] require an amount of computation which depends on the graph 
topology and edge lengths and are best suited to graphs with few 
edges. 

The nodes of the graph are assumed to be numbered from 1 to 
N. The length of an edge from node / to node J is given by array 
element DM(I, J). If there is no edge from node J to node J, 
DMU, J) is given a value larger than the length of the longest edge 
of the graph, say 101°. The diagonal elements of array DM are not 
used and may have any value. After execution of the algorithm, the 
edges of a minimal spanning tree are specified by pairs of nodes in 
array MST and the total edge length is given by CST. 

The Dijkstra algorithm grows a minimal spanning tree by suc- 
cessively adjoining the nearest remaining node to a partially formed 
tree until all nodes of the graph are included in the tree. At each 
iterative step the nodes not yet included in the tree are stored in 
array NIT. The node of the partially completed tree nearest to node 
NIT(D is stored in Ji(I), and the length of edge from NIT(I) to 
JI) is stored in UI(J). Hence the node not yet in the tree which is 
nearest to a node of the tree may be found by searching for the 
minimal element of array UJ. That node, KP, is added to the tree 
and removed from array NIT. For each node remaining in array 
NIT, the distance from the nearest node of the tree (stored in array 
UI) is compared to the distance from KP, the new node of the tree, 
and arrays UI and JI are updated if the new distance is shorter. The 
nearest node selection and list updating are performed N — 1 times 
until all nodes are in the tree. A proof that this algorithm finds a 
minimal spanning tree and a discussion of the related shortest path 
tree algorithm will be found in either of references [1] or [6]. 

Most of the execution time for this algorithm is spent in the 
search and updating statements between statements labeled 200 and 
500 which are executed N — 1 times. Since on the Ath execution a 
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list of N—K items is searched and updated, the total execution time 
is proportional to N?. 

If the graph represented by the inter-node edge length array DM 
is not connected, the procedure will generate a minimal spanning 
forest containing the minimal spanning trees of the disjoint compo- 
nents joined together by edges of length 10!°. A disconnected graph 
is indicated by a value of 10'° for variable UK at step 500 during 
execution of the algorithm. 

The algorithm may be simply modified to find a spanning tree of 
maximal total length by changing the loop between statements 300 
and 400 to search for the most distant rather than for the nearest 
remaining node to be adjoined to the partially completed tree. 

The data storage required for the algorithm may be reduced 
from N(N + 5) locations to 5N locations by replacing array DM with 
an edge length function which calculates the required inter-node 
edge lengths as they are needed. Such a function will be called 
N(N — 1)/2 times and may extend considerably the.size of problem 
which can be solved by this algorithm on a machine with limited 
core storage. 

Acknowledgment is due E.L. Lawler for bringing reference [1] 
to my attention. This work was supported in part by Rome Air 
Development Center Contract F30602-69-C-0214 with the Systems 
Engineering Laboratory of the University of Michigan. 
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Algorithm 
SUBROUTINE DMTOMSCDMsNoMSTsIMSTs CST) 
Cc 
Cc THLS SUBROUTINE FINDS A SET OF EDGES OF A LINEAR GRAPH 
Cc COMPRISING A TREE WITH MINIMAL TOTAL EDGE LENGTHe THE 
Cc GRAPH IS SPECIFIED AS AN ARRAY OF INTER-NGDE EDGE LENGTHS. 
Cc THE EDGES @F THE MINIMAL SPANNING TKEE @F THE GRAPH ARE 
Cc PLACED IN ARRAY MST. EXECUTION TIME IS PK@PORTIONAL TO 
Cc THE SQUARE @F THE NUMBER @F MOQDES- 
Cc ‘. 
Cc CALLING SEQUENCE VARIABLES ARE: 
c 
Cc DM AKRAY OF “INTER-N@DE EDGE LENGTHS. 
Cc DMCIsJ) (1 eLEe IoJ eLEe IN) IS THE LENGTH OF 
Cc AN EDGE FROM NODE I TO NODE. Je IF THERE IS NO 
Cc EDGE FROM NODE I TO N@DE J» SET DMCI,J>=10-**10 
Cc N N@DES ARE NUMBERED 1s 25 eeos Neo 
Cc 
Cc MST ARRAY IN &HICH EDGE LIST @F MST IS PLACED. MSTC1>1) 
Cc IS THE @RIGINAL N@DE AND MSTC2s1) IS THE TERMINAL 
Cc NODE OF EDGE I FOR 1! eLE- I e«LEe IMSTo 
Cc IMST NUMBER GF EDGES IN ARKAY MST- 
Cc CST SUM @F EDGE LENGTHS @F EDGES OF TKEE- 
Cc 
Cc PR@GRAM VARIABLES 3 
c 
Cc NIT ARRAY @F NODEd NOT YET IN TREE. 
c NITP NUMBER GF NODES IN ARRAY NIT 
Cc JICI) N@DE OF PARTIAL MST CLOSEST TO NODE NITCI)- 
c UICL) LENGTH GF EDGE FROM NITCI) TO JICI)+ 
c KP NEXT N@DE TO BE ADDED T@ ARRAY MST-~ 


COLLECTED ALGORITHMS (cont.) 


DIMENSION DMC 50s 50)sMST(2s 50) 
DIMENSI@N UIC50).J1050),NITC 50) 


INITIALIZE NODE LABEL ARRAYS 


aaa 


CST=0- 

NITP=N-1} 

KP=N 

IMST#0 

D@ 100 I=1sNITP 
NITCI)=1 
UECI)=DMC1.sKP) 

100 JICI>=KP 


aaa 


UPDATE LABELS @F N@DES NOT YET IN TREE- 


200 DO 300 I=1l»NITP 
NI = NITCI) 
D=DMCNIsKP) 
IFCUICI) LED? GO TQ 300 
UICI)=D 
JICT=KP 
300 CONTINUE 


FIND NODE OUTSIDE TREE NEAREST TO TREE. 


aaa 


UKSUICT) 

D@ 400 I=1,NITP 
IFCUICI)-GT.UK) G@ Te 400 
UK=UICI) 

K=I 
400 CONTINUE 


PUT NODES OF APPROPRIATE EDGE INT@ ARRAY MST. 


aaa 


IMST=IMST+1 

MSTCLs IMST)SNITCK) 
MSTC2s IMSTI2IICK) 
CST=CST+UK 
KP=NITCK) 


DELETE NEW TREE N@DE FROM ARRAY IT. 


aaa 


UICKIFULCNITP) 
NETCKI=ENITCNITP) 
JICKI=JIONITP) 
NITP=NITP-1 

500 IF (NITP.»NE.0) GO Toe 200 


WHEN ALL N@DES ARE IN TREE» QUIT. 


aaa 


RETURN 
END 


Remark on Algorithm 422 [H] 
Minimal Spanning Tree [V.K.M. Whitney, Comm. 
ACM 15 (Apr. 1972), 273-4] 


B.W. Kernighan [Recd. 23 June 1972] Bell Telephone 
Laboratories, Incorporated, Murray Hill, New Jersey 


An integer-arithmetic version of Algorithm 422 has been tested 
on the Honeywell 6070 using the Fortran A compiler, on several 
graphs. The algorithm produced correct results in all cases. 

Algorithm 422 computes the minimal spanning tree by suc- 
cessively adding the nearest remaining node to a partially formed 
tree until all nodes of the graph are included in the tree. This pro- 
cedure, which the author attributes to Dijkstra [1], was in fact in- 
dependently developed by R.C. Prim [2], two years earlier. 
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Algorithm 423 
Linear Equation Solver [F4] 


Cleve B. Moler (Recd. 1 July 1970 and 1 Dec. 1970) 
Department of Mathematics, The University of 
Michigan, Ann Arbor, MI 48104 

(This work was supported by the Office of Naval 
Research under contract NR 044-377.) 


Key Words and Phrases: matrix algorithms, linear equations, 
Fortran, paged memory, virtual memory, array processing 
CR Categories: 4.22, 4.32, 5.14 


Description 

These routines are modifications of, and intended as replace- 
ments for, the corresponding routines in [1]. The modifications in- 
crease efficiency while retaining accuracy and ease of use. Considera- 
tion is made of the effect of Fortran array storage conventions and 
paged dynamic memory allocation schemes. When translated by a 
good Fortran compiler, the routines should be competitive with 
programs written directly in machine language. For more details, 
See [2]. 

Both routines must be used to solve a system of linear equations, 
Ax = b. DECOMP carries out that part of the computation which 
depends only on the matrix A. SOLVE uses these results to obtain 
the solution for any right hand side 5. 


References 
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Algorithnt 


SUBROUTINE VECOMP(N, NOIMN, A, EP) 
REAL ACNDIM,NDIM),T 
INTEGER I PCNDIIA) 


MATRIX TRIANGULARIZATION BY GAUSSIAN ELIMINATION. 
INPUT... 

N = ORDER QF H1ATREX. 

ND!) = DECLARED DINENSION OF ARRAY A. 

A = HATRIX TO BE TRIANGULARIZED. 

OUTPUT... 

ACI,J), t.LE.Jd = UPPER TRIANGULAR FACTOR, U. ; 
ACI,J), [.GT.J = MULTIPLIERS = LOWER TRIANGULAR 
FACTOR, [nL . 

1P(K), KeLT.N = INDEX OF K-TH PIVOT ROW, 

(PCN) = (-1)** (NUMBER OF INTERCHANGES) OR 0 . 
USE 'SOLVE' TU OBTAIN SOLUTION OF LINEAR SYSTEM. 
DETERIICA) = EPCN)*AC1,1)*#A(2,2)4...*ACH MN). 

IF IP(N)#=0, A IS SINGULAR, SOLVE WILL DIVIDE GY ZERO. 
INTERCHANGES FINISHED IN U , ONLY PARTLY INL. 


ANQMDANAANANANAAAANAONAN 


IP(N) = 1 

00 6 K = 1,N 
IFCK.EQ.N) GO TO 5 
KP1l = Kel 
"= K 


vo 1 1 = KP1,N 


ANMNAQANQNAANRNMNANAO 


TF CABS (ACHE, K)) .GT.ABS(CAGI,K))) 4 = I 
1 CONTINUE 
IP(K) = 1 
1FCt.NE.K) 
T = ACH,K) 
AG4I,K) = ACK,K) 
ACK,K) = T 
IF(T.EQ.0.) GO TO 5 
DO 2 | = KPI,N 
2 ACI,K) = -ACI,K)/T 
DO 4& J = KPI,N 
T = ACH,J) 
ACii,J) = ACK,J) 
ACK,J) = T 
IFC(T.EQ.0.) GO To & 
DO 3 I = KPI,N 


IP(N) = -IP(N) 


3 ACI,J) = ACI) + ACH, RK) aT 
4 CONTINUE 
5 IFCACK,K).EQ.0.) TPC(H) = 0 
6 CONTINUE 
RETURN 
END 


SUBROUTINE SOLVE(N, NOIM, A, B, IP) 
REAL ACNDI, NDI) OCHO ,T 
INTEGER IPCNOIT) 


SOLUTION OF LINEAR SYSTEM, A*X =u. 
INPUT... 
N = ORDER OF TiATRIX. 
NDItt = VECLARED DIWENSION OF ARRAY A . 
A = TRIEANGULARIZED MATRIX OBTAINED FROM 'DECUIIP'. 
B = RIGHT HAND SIDE VECTOR. 
IP = PEVOT VECTOR OBTAINED FROM 'DECONP!. 
JO NOT USE IF UECUIIP HAS SET IP(N)=0. 
UUTPUT.. 
B = SOLUTION VECTOR, X . 


JF(N.EQ.1) GO TO 9 
Nido Nel 
vO 7 K = 1,Nil 

KP1 = K+l 

it = IP(K) 

T = BC) 


KP1,N 
7 BCI) C1) # ACL, K)*T 
vO 8 KB = 1,11 
KN1 = N-KB 
K = KN1e1 
BCK) = BCK)/ACK,K) 
T = -B(K) 
vO 8 1 = 1,Kill 
8 BCT) = BCI) # ACL,K)*T 
9 BC(1) = BC(1)/AC1,1) 
RETURN 
END 


B 
T 
! 


= 
= 
! 

ry 
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Algorithm 424 


Clenshaw-Curtis 
Quadrature [D-l] 


W. Morven Gentleman (Recd. 5 Oct. 1970 and 
13 Aug. 1971) 
University of Waterloo, Waterloo, Ontario, Canada 


Key Words and Phrases: quadrature, Chebyshev series, cosine 
transform, fast Fourier transform 
CR Categories: 5.1 


Description 

Clenshaw-Curtis quadrature is one of the most effective auto- 
matic quadrature schemes available, particularly for integrands with 
some continuous derivatives. It can also be used for any piecewise 
continuous integrand, although it is not recommended for inte- 
grands with discontinuities. 

The automatic scheme [1] consists of evaluating the N + 1 point 
Clenshaw-Curtis quadrature formula, together with some error 
estimate, for a sequence of N’s until the estimated absolute error 
ESTERR is less than the product of the tolerated relative error 
TOLERR and the absolute value of the current estimate of the 
integral, or until the permitted number of function evaluations 
would be exceeded. The function subprogram CCQUAD uses the 
sequence N = 6, 18, ... , 2#3**M. The error estimate used is the 
absolute difference between the integral estimates for the current and 
preceding choices of NV. Other error estimates exist [1] although they 
are not as reliable, and the cosine transform CSXFRM(1), ..., 
CSXFRM(USED) is returned so they can be computed if desired. 
The N + 1 point Clenshaw-Curtis quadrature formula shifts the 
interval (A, B) to the interval (—1, 1), then integrates the poly- 
nomial which interpolates the integrand F at the Chebyshev points 
cos(rs/N), s = 0, 1, ..., N. Because the cosine transform is an 
explicit representation of this polynomial, an approximation to the 
indefinite integral of the integrand in the interval can be obtained 
from the indefinite integral of this polynomial, which is another 
reason why the cosine transform is returned. 

Earlier implementations of this quadrature scheme [e.g. 4] 
computed the cosine transform by a recursive method which was 
slow and suffered from rounding error, but CCQUAD uses a variant 
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of the fast Fourier transform [2, 3] and is very fast and very resistant 
to rounding errors. Timings on several machines indicate that the 
total cost of the quadrature can be well described as the cost of 
computing two sines and two cosines for each integrand value used, 
plus, of course, the cost of computing the integrand values them- 
selves. The variant of the FFT used obtains all sines and cosines as 
required, and does not build tables or march recurrance relations. 
Using a separate subroutine (R3PASS) to perform the passes of the 
FFT on interleaved subsequences of the original sequence is a device 
introduced by G. Sande to force compilers for many machines to 
generate optimal code for the FFT. 

There is no requirement in the subprogram CCQUAD that A 
be less than B. There is also no requirement that TOLERR be posi- 
tive: if it is not, the maximum permitted number of integrand values 
will be used. The stopping rule always depends on relative error: if 
this is meaningless because the true integral vanishes, the maximum 
permitted number of integrand values will be used. Because the 
number of nested DO loops used to generate integrand values in digit 
reversed order is fixed at eight, the maximum number of integrand 
values permitted is the smaller of LIMIT and 2*3*«9 + 1 = 39367. 
This should be ample but the restriction is easily changed. 

Throughout the subprogram CCQUAD various statements 
appear with a C in column 1. If these comments are replaced by the 
statements themselves, intermediate results are written on unit 
number 6, enabling one to follow the decision process of the scheme. 
This can be very instructive in understanding the way the scheme 
works. 
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Algorithm 
REAL FUNCTION CCQUAL (F,A,B,TCLERR, LIMIT, ESTFRR,USED, 
CSXERM) 


aa 


INPUT ARGUMENTS- 
REAL F,A,B,TOLERR 
INTEGER LIMIT 

OUTPUT ARGUMENTS- 
REAL FSTERR, CSXFRM (LIMIT) 
INTEGER USED 


a 


USING CLENSHAW-CURTIS QUADRATURE, THIS FUNCTION SUB- 
PROGRAM ATTEMPTS TO INTEGRATE THE FUNCTION FP FROM A TO 8B 
TO AT LEAST THE REQUESTELC RELATIVE ACCURACY TOL®RR, WHILE 
USING NO MCRE THAN LIMIT FUNCTION FVALUATIONS,. IF THIS 
CAN BSE DONE, CCQUAD RETURNS THE VALUE OF THE INTEGRAL, 
ESTERR RETURNS AN ESTIMATE OF THE ABSOLUTE FRROR ACTUALLY 
COMMITTED, USFD RFTURNS THE NUMBEP OF FONCTION VALUES 
ACTUALLY USED, AND CSXFRM(1) ,.0+-CSXFRM(USFD) CCNTAINS 
N=USEO-1 TIMES THE BISCRETE COSINE TRANSPORM, AS USUALLY 
DEFINED, OF THE INTEGRAND IN THE INTERVAL. IF THE 
REQUESTED ACCUPACY CANNOT BE ATTAINED WITH THE NUMBER OF 
FUNCTION EVALUATIONS PFRMTTTFC, THE LAST (AND PRESUMABLY 
BEST) ANSWER OBTAINFD TS RETURNED, 


aagaaaaNaANnaaAaNnaANn 


REAL PI,&T3,C°NTRE, WIDTH, SHIFT, FUND, ANGLE,C,S 
REAL CLNOINT, NEWINT 
REAL T1,T2,T3,T4,T5,T6,T7,T8,T9,TIO0,TI1,T12 


COLLECTED ALGORITHMS (cont.) 


C INSERT THE FCLLOWING STATEMENT 


c 


ana 


Cc 
c 
Cc 
Cc 


c 


c 


an 


aaaaad 


c 


Cc 
Cc 


TO TRACE PROGRAM FLOW 

REAL SCLINT,SCLERR 

INTEGER N,N2,N3,N LESS 1,N LESS 3,MAX,M MAX,J,STEP 
INTEGER L(8),L1,12,13,L4,L5,16,1L7,18 

INTEGSR J1,32,J33,04,935,36,37,038,3 REV 

FOUTVALENCE (L(1),L1), (L(2),L2), (L(3) ,L3), (L(4),L4), 
. (L(5),L5), (L (4) ,L6), (L(7) ,U7), (L(8),L8), (J8,J REV) 
DATA PI,RT3/ 3.141592653589E0, 1.732050807568E0 + 
DATA M MAX/ 3 Y/ 


INITIALIZATION 
CENTRES (A+B) *,5F9 
WIDTH= (B-A) *.5EC 
MAX=MINO (LIMIT, 2*3**(4 MAX#1)) 
DO 10 J=1,M MAX 
L(J)=1 
10 CCNTINUE 


COSINE TRANSFORM 
CCMPUTE DOUBLF THE COSINE TRANSFORM WITH N=6 


, 


N=6 

SAMPLE FUNCTION 
CSXFRM (1) =F (A) 
CSXFRM (7) =F (3) 
SHIFT=WICTH¥RT3*, SEO 
CSXFRM (2) =F (CENTRE-SHTFT) 
CSXPFRM (6) =F (CENTRE+SHIFT) 
SHIFT=WIDTH*.529 
CSXFRM (3) =F (CENTRE-SHIFT) 
CSXPRM (5) =F (CENTRS+SHIFT) 
CSXFRM (4) =F (CSNTRE) 

EVALUATE THE FACTORFD N=6 COSINE TRANSFORM 
TI=CSXFRM (1) *CSXFRM(7) 

_ T2=CSXFRM (1) -CSXFRM (7) 
T3=2. FO*CSKFRM (4) 
T4=CSXFRM (2) +CSXFRM (6) 
T5= (CSXFRM (2) ~-CSXF2M (6) ) *BT3 
T6=CSXFRM (3) #CSXFR4 (5) 
T7=CSXFRM (3) -CSKXERM (5) 
T8=T14+2,E0*T6 
T9=2,F0*T4+T3 
T10=T2+T7 
T11=1T1-T6 
T12=T4-T3 
CSXFRM (1) =T8479 
CSXFRM (2) =T104#TS 
CSXPRM (3) =T114#T12 
CSXFRM (4) =T2-2,FO*T7 
CSXFRM (5) =T11-T12 
CSXFRM (6) =T10-T5 
CSXFRM (7) =T8-T9 


USED=7 
GO TO INTEGRAL COMPUTATION, BUT FIRST COMPUTE INTEGRAL FOR 
N=2 

GO TO 299 
COMPUTE REFINED APPROXIMATION 
SAMPLE FUNCTION AT TNTERMEDIATE PCINTS IN DIGIT REVERSFD 
CRDER. AS THE SEQUENCE IS GENERATED, CONPUTE THE FIRST 
(RADIX FOUR TRANSFORM) PASS OF THE FAST FOURIER TRANSFORM 
100 DO 119 J=2,M MAX 


L (J-1) =L (J) 
119 CGNTINUE 
L(M MAX) =3¥*L(M MAX=-1) 
J=USED 
FUND=PI/FLOAT (3*N) 
DO 120 J1=1,L1,1 
DO 120 J2=J91,L2,11 
DO 120 J3+J2,L3,L2 
DO 120 J4=J3,L4,1L3 
DO 120 J5=J4,L5,1L4 
DO 120 J6=35,L6,L5 
DO 120 J7=J6,L7,16 
DO 120 J8=J7,L8,L7 
ANGLE=FUND*FLOAT (3*J REV-2) 
SHI FT=WIDTH*COS (ANGLE) 
T1=F (CENTRE-SHIFT) 
T3=F (CENTRE+SHIFT) 
SHIFT=WIDTH*SIN (ANGLE) 
T2=F (CENTRE+SHIFT) 
T4=F (CENTRE-SHIFT) 
T5=T1+T3 
T6=T2+TY 
CSXFRM (J +1) =T54+T6 
CSXFRM (J+2) =7T1-T3 
CSXFRM (J+3) =T5-T6 
CSXPRM (+4) =T2-TH 
J=I+4 
120 CONTINUE 
DO RADIX 3 PASSES OF FAST FOURIER TRANSFORMS 
N2=2*N 
STEP=4 
150 J1I=USED+STEP 
J2=USED+2¥*STEP 
CALL R3PASS {N2,SIEP,N2-2*STEP,CSXFRM (USED+1), 
‘ CSXPRM(J141) ,CSXFRM(J241)) 
STEP=3*STEP : 
IP (STEP .LT. 


N) GO TO 150 


COMBINE RESULTS 
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C FIRST DO J=0 AND J=N 


c 


AANANAANN io) 


aaa 


a aagqaaaaa aaaa 


aA AaAANN aAANAAA 


aagaaaaaa 


-C HAT OF THESE TRANSFORMS, 


TI=CSXFRM (1) 
T2=CSXFRM(USED+1) 
CSXPRM (1) =T1#2, E0*T2 
CSXFRM (USED+1) =T1-T2 
TI=CSXFRM (N41) 
T2=CSXPRM(N2+2) 
CSXFRM (N41) =T14T2 
CSXFRM (N242) =T1I-2,E%*T2 
DO REMAINING VALUTS CF J 
N3=3*N 
N LESS 1=N-1. 
DO 180 J=1,N 
J=Nted 
J2=N3-J 
ANGLE=FUND*FLOAT (J) 
C=COS (ANGLE) 
S=STN (ANGL®) 
T1=CRCSXPRM (142) -S*¥CSXFRM (3242) 
T2= (S¥*CSXFRM (J 1#2) #C¥CSXFRM (3242) ) *RT3 
CSXFRM(J1+2) =CSXFRM(J4+1) -T1-T2 
CSXFRM (J2+2) =CSXFRM (341) -T1#T2 
CSXFRM (J +1) =CSXFRM (J41) #2, 70*71 
CONTINUE 
UNSCRAMBLE 
TI=CSXFRM (N2+1) 
T2=CSXFRM(N2+42) 
DO 190 J=1,N LFSS 14 
J1=USED+Jd 
J2=N2+7 
CSX FRM (J2) =CSXFR4 (31) 
CSXFRM(J1) =CSXFEM(J2+#2) 
CCNTINUE 
CSXFRM(N3) =T1 
CSXFRM (N3+1) =12 
N=N3 
USFD=N#1 
GO TO INTEGRAL COMPUTATTCN 
GO TO 210 


NCW 


LESS 1 


180 
NOW 


190 


INTFGRAL EVALUATION 
INTEGRAL ESTIMATES ARL NOT SCALED BY WIDTH*N/2 
UNTIL FOUNCTICN CCQUAD “RETUPNS. 


WHEN N=6,RVALUATE INTEGRAL FCR N=? 
200 OLDINT=(1142.20*1T3) /3.F% 


EVALUATE NEW ESTIMATE OF INTEGRAL 
210 N LESS 3=N-3 
NEWINT=. SEQ*&CSXFEM (USED) /PLOAT (1-N**2) 
pO 220 J=1,N LESS 3,2 
J REV=N-d 
NEWINT=NEWINT#CSXFRM(J PEV)/FLOAT (J REV*(2-J REV)) 
22C CCNTINUS 
NEWINT=NEWINI+, SFO*CSXFRM (1) 


TEST IF BONE 

TEST IF FSTIMATED ERPRC2 ADEQUATE 
ESTERR=ABS (OLDINT*3., FO-NEWINT) 

INSERT THE FOLLOWING FOS STATEMENTS TO TRACE PROGRAM FLCW 
SCLINT=WIDTH*NSWINT/FLOAT (N/2) 
SCLERR=WIDTH* (OLDINT* 3. FO-NEWINT) /FLOAT (N/2) 
WRITE (6,900) N,SCLINT,SCLERR 

900 FORMAT (3H N=,15,231 INTEGRAL ESTIMATED AS 

. 7H ERROR ,815.3) 

IF (ABS (NEWINT) *TOLERR 

IF ESTIMATED FRRCR TOC LARGE, 
OLDINT=NFWINT 
IF (3*N+#1 .LE. MAX) GO TO 100 

IF REFINEMENT NOL PERMITTEL, OR IF ESTIMATED 

SATISFACTORY, RESCALF ANSWERS AND RETURN 

INSERT THE FCLLOWING "WO. STATEMENTS TO TRACE 
WRITE (4,910) 

910 FORMAT (25H REFINEMFNT NOT PEPMITTEN) 

400 CCONAD=WIDTH*NEWINT/FLOAT (N/2) 
FESTERR=WIDTH*ESTERR/ FLOAT (N/2) 

RETURN 
END 


7£15.8, 


.GE. ESTERR) GO TN 400 
REFINE SAMPLING IF PERMITTED 


ERROR 


PROGRAM FLOW 


SURROUTINE P3PASS 
RADIX 3 PASS FOR FAST FCURIER 
OF LENGTH N2 

INTEGER N2,M,LFENGTH 

REAL XQ (LENGTH) ,X1(LENGTH) ,X2 (LENGTH) 

THE NOTATION OF REFER®NCFS 2 AND 3 TS USED IN THIS 
SUBROUTINE. , 

M IS THE LENGTH OF “H& TRANSFORM ALRFADY ACCOMPLISHED, 
I.E. THE NUMBFR OF DISTINCT VALUES OF THE FREQUFNCY INDEX 
AND THE SPACING OF THE 
SEQUENCES TO BE TRANSFORMED. EXPLICIT USE IS MADE OF THE 
FACT THAT M IS EVEN ANN NOT LESS THAN FOUR. 

INTEGER HALF M,M3,K,KC,K1,J,J0,01 

REAL TWOPI,HAF®T3,RSUM,RDIFF,RSUM2,ISUM,IDIFF,1IDIFF2 

REAL FUNC, ANGL2,C1,51,C2,S2,R0,R1,R2,10,11,12 

DATA TWOPI, HAFET3/ 6,283185307E%, »866025N03E0 / 

HALF M=(M-1) 72 

M3=M*3 

FOUND=TWOPI/FLOAT (43) 


(N2,M,LENGTH,X9,X1,X2) 
TPANSFORM OF REAL SEQUENCE 


COLLECTED ALGORITHMS (cont.) 


C CO ALL TRANSFORMS FOR C HAT=0, I.F. TWINDLE FACTOR UNITY 
DO 10 K=1,N2,83 
RSUM= (X1(K) #X2(K)) 
RDIFF= (X1(K)-X2(K)) *HAFRT3 
X1(K) =X0 (K) -hSOM*, SEO 
X2(K) =PDIFF 
XC (K) =XC (K) +F SUM 
10 CONTINUE 
C DO ALL TRANSFORMS FOr C HAT=CAP Cy/2, I.E. 
C £E(RB/5) 
J=M/2+1 
DO 29 K=J3,N2,%3 
RSUM= (X1(K) ¢X2(K)) #HAFRT3 
RDIPFP=(X1(K)-X2(K)) 
X1(K) =XO (K) -RDIFF 
X2 (K) =BSUM 
XO (K) =XO (K) ¢RDIPF*,5E0 
20 CONTINUF 
DO ALL TRANSFORMS FOR REMAINING VALUES OF C HAT. 
THAT C HAT AND CAP C-C HAT MUST BF PAIRFD 
CHCOSE A FREQUENCY INIEX 
DO 4O J=1,HALF M 
JO=I1 
Ji=M-J+1 
COMPUTE THE TWITNDLE FACTCR 
ANS LF=FUNO*FLCAT (J) 
C1=COS (ANGL®) 
S1=STN(ANGL™) 
C2=C1**2-S1 "42 


TWIDDLE FACTOR 


OBSERVE 


aaa 


a 


Remark on Algorithm 424 [D 1] 
Clenshaw-Curtis Quadrature [W.M.Gentleman, Comm. 
ACM 15 (May 1972), 353-355.| 


Albert J. Good |Recd. 19 December 1972] Systems, 
Science and Software, La Jolla, CA 92037 
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S$2=2, 21*S1*C1 
C CHOOSE THE REPLICATICN 
DO 30 KO=J30,N2,43 

K1=KO-6091 

C OBTAIN TWILDDLED VALUES 
RO=KG(K4) 
19=X5(K1) 
RI=CV*X4 (KC) -S1¥*X1(K 1) 
LV=SV*X1(K%) #C1#X 1 (K 1) 
R2=C2*X2 (KN) -S$2*X2 (K1) 
T2=S2*X2 (KC) #C2*K2(K1) 

C CCMPUTE TRANSFORMS AND PETURN IN PLACE 
RSUM=P1+P2 
ROIFF=(R1-22) *HAFRT3 
RSUM2=R9-.559*RSUM 
ISUM=I1¢12 
IDIFF= (11-12) *HAPRT3 
IDTFR2=I9-.5FC#ISUM 
XO (KS) SROeRSIM 
XM (K1) =RSUM24¢IDIFF 
X1(K) =RSUM2-IDTFE 
X1(K1) =2DTFE¢IDIFF2 
X2(K9) =RDIFF-INIFF2 
X2(K1) =19¢TSUM 

30 CONTINUE 
4Q CONTINUE 
RETURN 
END 


As published, this algorithm will not execute correctly under 
some compilers (e.g. Fortran V in the Univac 1108). One minor 
change is sufficient for proper operation: replace the variable J REV 
by the index J8 inside the DO 120 loop. 

The appearance of J REV and J8 in an EQUIVALENCE state- 
ment is not meaningful since the memory location associated with 
a DO loop index does not always contain the current value of the 
index (this depends on the compiler). 


ACM Transactions on Mathematical Software, Vol. 5, No. 2, June 1979, Pages 240. 


REMARK ON ALGORITHM 424 
Clenshaw-Curtis Quadrature [01] 


[W.M. Gentleman, Comm. ACM 15, 5 (May 1972), 353-355] 


K.O. Geddes [Recd 1 February 1978 and 17 April 1978] 
Department of Computer Science, University of Waterloo, Waterloo, Ont., Can- 
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This algorithm may be used to compute the Chebyshev series coefficients for a 
function F which is continuous on the interval [—1,1], as noted in [1]. For this 
purpose, function CCQUAD would be called with A = —1, B = 1, and appropriate 
values of TOLERR and LIMIT. (For some applications, one would prefer instead 
to state the number of Chebyshev series coefficients desired.) The comments in 
function CCQUAD indicate that the array CSXFRM contains, on return, N = 
USED -— 1 times the discrete cosine transform of F. Therefore, the values 


CSXFRM(K)/N, 1s K = NUMBER, 
for some NUMBER <= USED, should be estimates for the first NUMBER 


Chebyshev series coefficients of F. 


However, the published code produces an array CSXFRM with an incorrect 
sign on each value CSXFRM(K) for K even (i.e. the odd Chebyshev series 
coefficients will all have incorrect signs). This error does not affect the value of 
the definite integral computed by the algorithm because only the even terms in 
the Chebyshev series enter into the computation of the definite integral. The 
error does, however, affect the stated claim that “because the cosine transform is 


an explicit representation... 


, an approximation to the indefinite integral .. . can 


be obtained from the indefinite integral [of the truncated Chebyshev series].” 


The error can be corrected as follows. 


COLLECTED ALGORITHMS (cont.) 424-P 4-0 


Change the eighth and ninth executable statements 
CSXFRM(1) = F(A) to CSXFRM(1) = F(B) 
CSXFRM(7) = F(B) to CSXFRM(7) = F(A) 
Change the statements one and four lines below this 
SHIFT = WIDTH+RT3+.5EO to SHIFT = —WIDTH*RT3+.5EO 
SHIFT = WIDTH:«.5EO to SHIFT = —WIDTH*.5EO 
Change the second and fifth statements following the eight nested “DO 120” 
statements 


SHIFT = WIDTH*COS(ANGLE) to SHIFT = -WIDTH*COS(ANGLE) 
SHIFT = WIDTH*SIN(ANGLE) to SHIFT = —WIDTH*«SIN(ANGLE) 


REFERENCES 
1. GEDDEs, K.O. Near-minimax polynomial approximation in an elliptical region. SIAM J. Numer. 
Anal. 15 (1978), 1225-1233. 
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Algorithm 425 


Generation of Random Cor- 
related Normal Variables [G5] 


Rex L. Hurst 

Applied Statistics-Computer Science, Utah State 
University, Logan, UT 84321 

and 

Robert E. Knop* [Recd. 12 March 1970, 23 March 1971, 
and 9 Nov. 1971] 

Department of Physics, Florida State University, 
Tallahassee, FL 32306 


Key Words and Phrases: random number, normal density, 
normal distribution, Gaussian density, Gaussian distribution, 
simulation, Monte Carlo 

CR Categories: 5.5 


Description 
We have programmed and made timing comparisons for two 
algorithms which sample the multivariate normal density 


N(u, V) = | V7 |/(2n)*-exp(— 1/2(¥ — «)” VY = w)) (1) 


where V is ann X 7” covariance matrix, u is an 7 component vector 
of means, and Y is an» component random vector [1]. 

The first algorithm proceeds by rotating coordinates to a system 
in which the covariance matrix is diagonal. In this system the multi- 
variate normal density becomes equal to the product of its marginal 
densities, and each marginal density can be sampled independently 
of the others. After obtaining a sample vector in this rotated system, 
the coordinates are rotated back to the original system. In the 
following discussion this will be referred to as the matrix diagonali- 
zation algorithm [1]. 

The second algorithm proceeds by decomposing the multi- 
variate normal density into the product of the marginal density of 
the first variate times the joint density of the remaining variates, 
conditional upon the value sampled for the first. This joint density is 
determined once the first variate has been sampled from its marginal 
density. The procedure is then applied to the second variate and 
iterated until values have been assigned to all components of the 
sample vector. In the following discussion this will be referred to as 
the conditional decomposition algorithm [1]. 

Both algorithms require that the covariance matrix be positive 
definite, and that it modify the argument /ENT to indicate if this 
condition was not satisfied. Both algorithms perform extensive cal- 
culations on the covariance matrix the first time it is used. Subse- 
quent sample vectors with the same covariance matrix bypass these 
calculations with considerable savings in execution time. Tests with 
eight variables produced the following execution times on an IBM 
360/44: 


* This work was supported in part by the United States Atomic 
Energy Commission. 
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1 Matrix 1 Matrix 200 Matrices 
500 1000 1 
Observations Observations Observation 
Matrix 37 sec 72 sec 143 sec 
diagonalization 
Conditional 35 sec 68 sec 14 sec 
decomposition 


We note that the conditional decomposition algorithm executes 
more rapidly in all cases. 

Matrix Diagonalization. Suppose we define A to be the desired 
correlation structure; A can always be represented as BI B7. We 
know the characteristic values \,; of A are defined as the roots of the 
characteristic equation 


|4 — rAd | = 0. (2) 


The characteristic vector is a vector not identically zero satisfy- 
ing, for characteristic value A; 


(A — AW)X; = 0. (3) 


If A is symmetric, all roots different, and X; are normalized, 
then 


XTX; = 83 
where 4;; is the Kronecker delta. Let C be the matrix of characteris- 
tic vectors and D be a diagonal matrix of the characteristic roots: 


C=(X, X2°-°:] 


NT 0 aie 


Then 
cC’C =landCC’ =1. (6) 


The matrix C is thus orthogonal [2]. 
For an orthogonal matrix C and a symmetrix matrix A 


CTAC = DanddA = CDC’, (7) 
therefore 
A=CD IDC’, (8) 


and we see that the matrix required to transform a set of independent 
normal variates to a new set with correlation matrix 4 is B = C D, 
If A is distributed according to N(O, A) (cf. (1)) and we define: 


a, O «rs ba 
S=/0 oa and w=] me 


then (SZ + yz) is distributed according to N(u, 2) where 2 is the 
variance-covariance matrix. To save computational time the matrix 
B may be defined 


B=SCD} (10) 


Subroutine RANVR receives a correlation matrix A, a vector of 
desired standard deviations SD, a positive definite test variable 
IENT, an argument for a random number generator [ARG, variables 
for defining the order of A(NV) and the order of the arrays used 
NI, and work arrays X, Y, and Z. Z is the return array. Upon return 
the diagonal of A contains the roots and the columns of X 
the vectors. 

It requires a subroutine for computing characteristic values and 
vectors for real symmetric matrices [3-7], a subroutine for generat- 


COLLECTED ALGORITHMS (cont.) 


ing random normal deviates [8-12] which in turn requires a sub- 
routine for generating random uniform numbers [13, 14]..We use a 
modification of Seraphin [14], which allows the generation of 
different sequences by modifying an entry argument. 


Calling sequence (BZ desired means) 


IENT = —1 


CALL RANVR(A, X, Y,Z, SD, NV, NI, IENT, IARG) 
IF (IENT-LE-0)  GOTO5 
DO41=1,NV 

4. ZY) =Z) + BZD 


5 Error handling if not positive definite. 


Conditional Decomposition. To achieve the conditional decom- 
position of the multivariate normal density N(O, V), we begin by 
partitioning the covariance matrix into the scalar v,, the 
1X (2 — 1) and (2 — 1) X 1 vectors Vi. and V2 , and the (#” — 
1) x (n — ]) matrix V2. : 


Vu{[ "2 Vie (11) 
( Vu ye) 
The inverse covariance matrix we represent as: 
R 
yon { "u 12 (12) 
& “ 


From V V~! = I we obtain the following relations: 


1/vun = ni + RR Ra ; 
Roo = (Vee — Vie Vor/vi1)7}. 


The quadratic form of the multivariate normal density N(O, V) can 
be written as: 


yTyoy = y,7) ( ™ Rie yi (14) 
(1 27) Gs ee) € 


Multiplying this out results in 
YTVIUY = yurtyi + (Ye? R2Ye + YoTRayr + yiRi2¥2). (15) 


Performing the matrix analog of completing the square on the term 
involving ¥2 allows this to be written as 


(13) 


YTV7Y = yin — RR Ra) 


oe e 16 
+ (Yo — (Raz Ray) TR Y2 — Ry Ray). (16) 

Substituting from (13) we obtain 
YTV1Y = y2/vy + (Y2 —- Voiy1/V¥i1) TV 22 — VoiViy2/Vi1) (17) 


(Yo — Voryi/v11). 


Thus the multivariate normal density N(0, V) can be separated into 
the marginal density N(O, vi1) of the variate y, times the joint den- 
sity 


N(Varyi/ vir, V22 — VoiVie/v11) (18) 


of the vector Y2 conditional upon y»,. This procedure is then re- 
peated until every component of the random vector Y has been 
assigned a value. 

Subroutine RNVR receives a covariance matrix A, a positive 
definite test variable JENT, an argument for a random number 
generator IARG, variables defining the order of A(NV) and the 
order of thearrays used N/,and work arrays X, B, C. Xis the return 
array. 

It requires a subroutine for generating random normal deviates 
which requires a subroutine for generating random uniform 
numbers. 
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Calling Sequence (BZ desired means) 


IENT = —1 


CALL RNVR (Z, A, Y, C, NV, NI, IENT, IARG) 
IF (IENT.LE.0) GO TO 5 
DO41=1,NV 

4 2D) =Z(f) + BZD 


5 Error handling if not positive definite. 
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Algorithm 


SUBROUTINE RNVR(XsAsBoCoNVs NI» TENTs IARG) 
GENERATES A RANDOM NORMAL VECTOR (M.S) 
A INPUT COVARIANCE MATRIX,» CONDITIONAL MOMENTS KETURN 
Z»Y¥eC» WORK ARRAYS» RETURN VEC1OR OF RAND@M NOKMAL VARIABLES IN Z 
NVONI ORDER OF COVARIANCE MATRIX, ORDEK OF ARRAY 
TENT ~t= INITIAL ENTRY 
O= RETURN IF NOT POSITIVE DEFINITE 
1= RETURN IF POSITIVE DEFINITE 
TARG ARGUMENT FORK KANDOM NUMBEK GENERATOR 
DIMENSION XCNIDSACNIONID SBONI Do CONT) 
IFCIENT) 15926 
«ee COMPUTE CONDITIONAL MOMENTS 
1 NASNV-1 
O0@ 4 K=1+NA 
T2ACK,K) 
1FCT) 1021052 
2 NB=K+1 
CCK) #SO@RTCT) 
De 3 I=NBsNV 
3 ACL AKI PACKS L/T 
D2 4 I=NBeNV 
D@ 4 Jslsnv 
4 ACLs J FACT se JD“ACIs KI PACKS JD 
IFCACNVSNV)) 1021055 
5S IENT=t 
CCNV) *SQRTCACNVeNV) > 
C ##e COMPUTE A RANDOM VECTOR 
6 D8 8 I=teNV 
BCT >®RNORCIARG) #CC( TI) 
XC1T>SBCT) 
IFCI.EQ@-3> GO TO 8 
NB=I-1 
08 7 J=1,NB 
XCTIEXCTD+ACT » J2*BC UD 
CONTINUE 
RETURN 
TENT=0 
RETURN. 
END 


AaANAAAAAA 


a 


- 
owunn 


FUNCTION KNORKCIRD 
GENERATES A RAND@M NORMAL NUMBER (0,1) 
IARG IS A LARGE @DD INTEGER FORK A BEGINNING ARGUMENT 


aa 
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Cc REQUIRES FUNCTION RN WHICH GENERATES A UNIF@RM KAND@M NUMBER O-1 

DATA 1707 
IFCI-GT-0)68 TO 30 

10 X#2.O08RNCTIR-1.0 
Y=2.0*#RNCIRI-1.0 
SEXex+ynay 
IFCSeGE-€t-0))968 Te 10 
5=S@RT(-2.0*#ALOG(S)/S) 
RNORIX*S 
GQ2=Y*eS 
Ila} 
G@ T@ 40 

30 RNOR*GO2 
I=0 

40 RETURN 
END 


Remark on Algorithm 425 [G5| 

Generation of Random Correlated Normal Variables 
{Rex L. Hurst and Robert E. Knop, Comm. ACM 15 
(May 1972), 355-357] 


R.L. Page [Recd. 3 Oct. 1973] . 
Computer Science Program, Colorado State University, 
Fort Collins, CO 80521 


The work array parameters B and C of SUBROUTINE RNVR, 
which may prove cumbersome for some users, may be removed 
by making some minor changes. The removal of C is simple: simply 
change references to C(/) to A(/, J). (The diagonal of A is presently 
unused once the conditional moments are computed.) 

The vector X can be used in place of B provided its components 
are computed in reverse order. Thus, DO loop 8 (starting at state- 
ment 6) becomes two separate loops as shown below. 


6 DOTI = 1, NV 
7 Xi) = RNOR(UARG)+A(I, 1) 
DO 81 = 2,NV 
NB = NV-I+1 
DO 8 J = 1, NB 
8 XCUNB+1) = X(NB+1)+A(NB-+1, J)*X(J) 


The revised algorithm was tested on covariance matrices of 
orders two through six. Assuming the algorithm generates sample 
vectors from the zero mean normal distribution with the given co- 
variance, the difference between the sample covariance and the 
given covariance, divided by the standard error of the covariance 
estimator, would give samples from a standard normal distribution. 
Our test did not contradict this assumption since 37 of 55 of these 
numbers, 67 percent, were in the range —1 to 1 (one would expect 
about 68 percent) and 54 of 55, 98 percent, were in the range —2 
to 2 (one would expect about 95 percent). 


425-P 3- R1 


COLLECTED ALGORITHMS FROM CACM 


Algorithm 426 
Merge Sort Algorithm [MI]] 


C. Bron (Recd. 4 Feb. 1970 and 10 May 1971) 
Technological University, Eindhoven, The Netherlands 


Key Words and Phrases: sort, merge 
CR Categories: 5.31 


Description 

Sorting by means of a two-way merge has a reputation of re- 
quiring a clerically complicated and cumbersome program. This 
ALGOL 60 procedure demonstrates that, using recursion, an ele- 
gant and efficient algorithm can be designed, the correctness of 
which is easily proved [2]. Sorting # Objects gives rise to a maximum 
recursion depth of [loge(# — 1) + 2]. This procedure is particularly 
suitable for sorting when it is not desirable to move the 7 objects 
physically in store and the sorting criterion is not simple. In that 
case it is reasonable to take the number of compare operations as a 
measure for the speed of the algorithm. When » is an integral power 
of 2, this number will be comprised between (7 & log») /2 when the 
objects are sorted to begin withand (n X loge — 1+ 1) as an upper 
limit. When is not an integral power of 2, the above formulas are 
approximate. 

It is assumed that each object can in some way be uniquely 
identified by one of the integers from 1 to 7. This correspondence 
has to be supplied in the call by replacing /i and /o by two integer 
variables and the Jensen parameter /oafterhi by a Boolean expres- 
sion that yields the value true if the object identified by /o has to 
follow the object identified by Ai in the ordered sequences, and 
false otherwise. Let e; be the identifying integer of the ith object in 
the ordered sequence. Upon return from the procedure sort delivers 
the value of e, and the pointer array prt will be filled in such a way 
that pntle;]) = ei4,, 1 < i <n, and pntle,] = 0. Therefore the 
bounds of the actual array supplied for prt will have to include the 
range [l:n]. Sorted subsequences that arise during the sorting 
process will have a similar chain structure. 

The essence of the algorithm is to be found in the procedure 
head. It has the duty to form an ordered chain of desired length 
(deslen) from the objects identified by count + 1 through count + 
deslen. It does so by introducing a chain of length 1, consisting of 
object count + 1, and then repeatedly doubling the length of that 
chain by merging it with a chain of equal length the creation of 
which is left to a recursive call on Aead. If des/en is not an integral 
power of 2, a chain of length des/en can not be built by repeatedly 
doubling. In that case, before the last merge operation, a chain of 
length (desired length — present length) is created and merged with 
the present chain to produce the required result. 

As an example of a call on the sorting procedure we supply 
sort(10 000, chain, i, j, Ali] > Al[j]) although it should be stressed 
that the present version of the algorithm is not efficient when the 
sorting criterion is as simple as a comparison of two array elements. 
In such a case one does not only gain by replacing the calls on the 
formal parameter Joafterhi by A{lo] > A[hi] and declaring /o and hi 
as local variables of the procedure sort, but also one might resort to 
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in situ sorting techniques like [1] that do not need the auxiliary 
array pnt. A comparison of this algorithm with QUICKERSORT 
[1] conducted under equivalent circumstances on the ALGOL sys- 
tem for the EL X8 showed no significant difference in speed when 
sorting arrays containing random numbers. 

Acknowledgment. The author is grateful to Prof. E.W. Dijk- 
stra for his contributions to this version of the algorithm, and to the 
referee for his careful analysis and valuable suggestions. 


References 

1. Scowen, R.S. Quickersort, Comm. ACM & (Oct. 1965), 669-670. 
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Algorithm 
integer procedure sort(n, pnt, lo, hi, loafterhi) ; 
value 7; integer 7, /o, hi; integer array pnt; 
Boolean loafterhi; 
begin 
integer count, link; 
comment /ink is a working location for merging; 
integer procedure head(des/len) ; 
value deslen; integer deslen; 
comment The value of Aead will be the identifying integer of the 
object leading the sorted chain; 
begin 
integer beg, len, nextlen; 
INTRODUCE NEW CHAIN OF LENGTH 1: 
SUPPLY WITH END MARKER: 
MAKE beg POINT TO ITS HEAD: 
beg := count := count + 1; pnt{[beg] := 0; len := 1; 
TEST: TO SEE WHETHER DESIRED LENGTH HAS BEEN 
REACHED: 
if len < deslen then 
begin 
nextlen := if len < deslen — len then len 
else deslen — len; 
INTRODUCE NEW CHAIN: 
hi := head(nextlen) ; 
AND START MERGING: 
FIND LEADING OBJECT OF MERGED CHAIN: 
lo := beg, 
if loafterhi then 
begin beg := hi; hi := lo; lo := beg end, 
INITIALIZE CHAIN ON MECHANISM: 
link := lo; 
CHAIN ON: 
lo := pnt{link]; 
TEST FOR END OF lo CHAIN: 
if /o ~ O then 
begin 
ADD LINK TO CHAIN: 
if loafterhi then 
begin 
SWITCH LINK TO hi CHAIN: 
pnt\link] := link := hi; hi := lo 


end 
else 
STEP DOWN IN lo CHAIN: 
link := lo; 
go to CHAIN ON 
end; 


COLLECTED ALGORITHMS (cont.) 


APPEND REMAINING TAIL: 
pnillink] := hi; 
len := len + nextlen; 


go to TEST 
end; 
head := beg 
end head; 
count := 0; sort := head(n) 
end sort; 


Remark on Algorithm 426 
Merge Sort Algorithm [M1] 
(C. Bron, Comm. ACM 15 (May 1972), 357-358] 


C. Bron [Recd. 5 Nov. 1973] 
Technological University of Twente, P.O. Box 217, 
Enschede, The Netherlands 


A remark in [3 p. 158] suggested to the author that Algorithm 
426 needs only very minor modifications in order to handle the 
sorting of records that are chained to begin with. The algorithm then 
rearranges the chain and needs no additional array to store chaining 
information. Furthermore, the assumption that we should be able 
to associate each of the integers from 1 to n with each of the n 
records to be sorted is no longer necessary [2]. 


References 


1. Bron, C. Algorithm 426, Merge Sort Algorithm. Comm. ACM 
15 (May 1972), 358. 
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Algorithm 427 
Fourier Cosine Integral [D1] 


Peter Linz (8 June 1970, 3 Dec. 1970, and 11 Feb. 1971) 
Department of Mathematics, University of California, 
Davis, CA 95616 


Key Words and Phrases: numerical integration, quadrature, 
adaptive quadrature, Filon quadrature, Fourier coefficients, Fourier 
integrals 

CR Categories: 5.16 


Description 
The function FRCOS approximates 


Chay I 1(0) cos (wt) at 
0 
by numerical evaluation of 
T 
Cr(f,o) = f(t) cos (wt) dt. 
0 


The calling parameters for the function are: 
1. FC is thename of the function subprogram, supplied by the user 
which computes f(r). It is assumed that f(r) is bounded in [0, ©) 
andissuch that lim Cr (fw) = Cf). 

To 


2. W represents w. It will normally be positive, although w = 0 will 
be handled correctly. In the latter case the algorithm reduces to an 
adaptive Simpson’s rule. There is, however, some inefficiency in this 
because the cosine routine is used to compute cos(0.0) and some 
additional bookkeeping is done. The inefficiency may become signifi- 
cant if the time taken by the cosine routine is comparable to the time 
required to evaluate f(t). The program will not work correctly for 
negative w. 

3. T should be chosen such that 


Cr(f, w) = C(f, w) 
within the required accuracy. The program actually evaluates 
Cra(f, w) where 7A is chosen as follows: 


(a) if 27°27 < wT < 2"2r, forn > —9, 
then TA. = 2°+27/w, 
(b) if wT < 27/512, then TA = T. 


If an upper limit 2"2z is desired without adjustment, the 7 specified 
should be slightly smaller than this number (to avoid round-off 
error problems). 

4. ET specifies the required (absolute) accuracy. The routine 
attempts to compute an answer which differs from Cra (f, w) by less 
than ET. 

5. HL represents an upper limit on the stepsize; the integral over an 
interval is not considered to have converged unless the size of the 
interval is less than HL. Normally, HZ can be chosen quite large, say 
T/10. However, when the integrand has a sharp peak, the choice of 
HL may be difficult. If it is chosen too large the peak may be missed 
altogether; if it is chosen small the computations become inefficient, 
since the limit is enforced everywhere. In such cases it might be 
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preferable to use a variable HL, computed by means of a subpro- 
gram. FRCOS can be modified easily to do this. 

The computations are done by means of an adaptive quadrature 
method described in detail in [1]. In summary, the approximate 
value of the integral over an interval {a, 5], denoted by f, is com- 
puted as follows: 


(1) Ifb—a< 1/256, Simpson’s rule is used. 
(2) If 1/2560 < b — a < 2n/w, Filon’s method (referred to as 
FILON 2 below) is used. Here f is computed by 


I = h{w,cos (wa) + we sin (wa)} f(a) 


+ hw3cos (“SS =i (‘ z *) 


+ h{w, cos ae — wesin (wb)} /(d), 


where 
h = (b —a)/2, 
7 | 4 | \ 
wilwh) = hat Joos (2wh) — he cos (wh) sin (wh) + 3 { : 
1 . 4, 
We(wh) = opis - sin (2wh) + i sin? (wh) — 2h : 
w3(wh) = Te sin (wh) — cos win 


In the routine weights are needed only for wh = 1/2?, p = 1, 2,..., 
9. They have been precomputed to 14 significant digits and are 
stored in the arrays WIC, W2C, W3C, such that W1C(1) contains 
wi(r/2), W1C(2) contains w,;(r/4), W2C(1) contains w.(7/2), etc. 
If higher accuracy is required the computation of the w’s from the 
above formulas must be done with some care, since for small w/ 
large cancellation errors may occur. The use of multiple precision is 
recommended. See one may use the series expansions 


2k 2-2 
wi(wh) = D(- yo a EDr ns a eh) 
w(wh) = y(-1 Gi ap wi, 

= 8i 2-2 
Ww3(wh) = — 2d (—1)? Gre)! mety (wh) 


(3) If b — a = 2nr/w, a special case of Filon’s rule (called FILON 
1) is used. Here - 


spe a+b \ 
eae ya ~ (25 J+ rot, 


The error is estimated by halving each interval and comparing 
the two estimates thus obtained. We denote by / the integral over 
[a, 6], by J) and J, the approximations with stepsize (6 — a)/2 and 
(6 — a)/4, respectively and write 


h=I+e, 
h=I+e. 


If we know a such that 


€) & Qé0, 
then 
=~ a(Io <= 1,)/d = a). 


A given interval is split into parts until the estimated error is below 
a certain bound; once this is accomplished its contribution is added 


COLLECTED ALGORITHMS (cont.) 


to the total integral and the next interval is considered. The error 

“allotted” to each interval depends on the size of the interval as well 

as on an estimate of the errors of all previously converged intervals. 
The ratios a used in the error estimation are derived in [1]. 

The final expressions are: 


(1) for Simpson’s rule a = 1/16, 
(2) for FILON 1 


__ (b — a)#/32 — 6/es® 
(6 = a)?/8 — 6/0?’ 
(3) for FILON 2 
12 sin (p) — p?sin (p) — 6p — 6p cos (p) 
12 sin (p) — 4p? sin (p) — 12 cos (p) 


a(p) = 


where p = w(b — a)/2. 


For FILON 2 the a’s are needed for p = 7/2?, p = 1, 2,..., 9. 
They were precomputed and stored in the array ER, with ER(1) 
containing a(m/2), etc. 

Computed values of FC are saved for later use, and it is possi- 
ble that the space assigned for this purpose is exhausted before the 
computations are completed. In this case the routine returns with an 
error indication. (In the present implementation the value of FRCOS 
is set to 1.0 10°, although this may be changed to suit the user.) 
Usually this occurs only if the routine is used improperly (e.g. ET 
has been ‘specified so small that, due to round-off errors, the accu- 
racy criterion cannot be met). While the assigned space appears to 
be adequate for most purposes, the user can easily change this by, 
say, doubling the sizes of the arrays FS, PVAL, and AS, and chang- 
ing the overflow test. 

The user should keep in mind that such an adaptive approach 
does not guarantee that the final answer has an error less than ET; 
accidental (false) convergence is always a possibility. While empiri- 
cal evidence suggests that FRCOS is relatively immune to this, some 
examples. of false convergence were encountered during the test of 
the algorithm. The user should always try to safeguard against this 
possibility, for example by making F7 smaller than required, or by 
doing the computations twice with different values of ET and HL. 


References 
1. Linz, P. An adaptive quadrature algorithm for Fourier cosine 
integrals. (Unpublished manuscript available from author.) 


Algorithm 


FUNCTION FRCOSCFCowel oF Tot) 
THES ROUTINE COMPUTES THE FOURIER COSINE INTEGRAL FROM 
ZERO TO INFINITY OF FCLTPECOS( wet) BY AN ADAPTIVE 
QUADRATURE METHOD USING A COMBINATION OF FILON AND 
SIMPSON RULES 
PARAMETERS 
FC -mMUST BE DECLARED EXTERNAL IN CALLING PROGRAM 
W -VALUE MUST BE NON-NEGATIVE 
T “UPPER LIMIT FOR QUAORATURE-SHOULD NORMALLY BE CHOSEN 
SUCH THAT REST OF INTEGRAL 15 NEGLIGIBLE. THE ACTUAL 
LIMIT USED BY THE PROGRAM MAY BE SOMEWHAT LARGER THAN 
THE GIVEN T(SEE INTRODUCTORY COMMENTS) « 
ET -REQUESTEO ACCURACY (ABSOLUTE } 
HL -LIMIT ON STEP SIZE-CONVERGENCE IN ANY SUBINTERVAL IS 
NOT RECOGNIZED UNLESS SUBINTERVAL 15 SMALLER THAN HL 
DIMENSION W1C19) oW2C19) owW3C19) eERID)D 
ARRAYS EReW1Cew2CoW3C CONTAIN PRECOMPUTED CONSTANTS 
NEEDED TO COMPUTE APPROXIMATE VALUES ANDO ERROR 
ESTIMATES FOR FILON2Z(SEE COMMENTS) « 
DATA ERC1) cER12) cERC3) cER1 4) ERIS) cER(G6) DERI TI CERIO)» 
S$ ERI9)/ 00005061 60099692 2061810e06233e «062460 
SB 006249 6 006249 -0625/ 
DATA WICELDoWICE ZI CWICE SD eMICISD owWLC(S) owWICIGdoWICIT)e® 
. WICCADOWICI9I7 
4605284736569 34E-01 0 326761020 369133E-Ol> 
3464316760755 7TH1E=-01 530395675332 34962E-Ols 
90 3397411782348E-01 6 30 3349386085934E-Ols 
90 3337346859464 89E-01 030339343372 7O2Z12E-Olse 
303333584327653E-01/ 
DATA WZCULD oW2CE2) owW2C13) ow ZCI S) ow2C(S) ow2CIOd ew2CIT)s 
wW2C(Bd ew2CI9)/ 
102059522143639E-01 » Le9TIOSIOISIOPTE-02>% 
2e6328277TOS2505E-035 363459141 70832 3E-040 
4019970860777 7 7E-05 0 5025506003065 70E-066 
605705211443498E-07 0 Be 21 36815416350E-06e 


nan ARARRARANHABANANR 


VRRHAS 
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$ 120267267595664E~08/ 

DATA W3C11) oW3C12) oWIC13) WICKS) oWICIS DoW IC(6) oWICIT IS 
8 w3C(8) ow3C(9I7 

BS  1003204910186240 16252678001 54900 10 3128845799752, 

S$ 1032819998 7155763 0 3320486700792 51033301 80847949, 

© 19 333253016919 2 5103399132536 798 01 e 33332631 93097/ 
OIMENSION FS(61)5PVAL(30)sAS(30) 


C ARRAYS FSePVAL SAS ARE STORAGE FOR SAVEO VALUES OF F 
C AND BOOK-KEEPING 

DATA PI20P1256/602831865 3071796 020122718463/ 
C P12=20P1, P1256=9P1/256 

DATA ALN2 cERC oROC/ 0693147180} 0E%3001 cE -5/ 
C ALN2Z@NATURAL LOG OF 2-ERC#ERROR VALUE RETURNED 
C BY FRCOS»ROCSCONSTANT USED TO ELIMINATE ROUNDOFF 
C PROBLEMS IN COMPUTING INTERVAL LIMITS 

Epse ET 

VAL® Oc 

Me 1 

AS01120. 


FStlis FC(O60) 
PVAL (1 SERC 
C TEST IF UPPER LIMIT ADJUSTMENT 15 NECESSARY 
wteweT 
IF (wt-P1256eROC ) 10001000101 
NOTE=CONSTANT ROC#1.E-5 USED THROUGHOUT PROGRAM TO 
ELIMINATE EFFECT OF FLOATING POINT ROUNDOFF ERROR 


SET UP FIRST IMTecRVAL FUR SIMPSON RULE 
YOO FSE2d2FC re S#TIOCOS( Sew) 
FSC3) = FCCTI® COStwT) 
Ret 
GO TO 1N5 
ADJUST UPPER LIMIT 
1OY NPe TFIX(ALOR(WT/P1 256) /7ALN2) 0} 
TAs 2®@NP@ PI 256/0 
AsTa 
€ SET UP FIRST INTERVAL FOR FILON RULE 
FS(2)= FC(.5@TA) 
FS(3re FCITA) 
TAKE LAST INTERVAL FROM LIST 
105 AsAS(N) 
HL seB-A 
WH ewer! 
N2=2*N 
Fis FStN2-1) 
F22 FStN2) 
F3e FSIN2¢1) 
xO=e B-.75eH! 
XQ3 = Boe 254H! 
© YEST TO DETERMINE WHICH QUADRATURE RULE IS APPLICABLE 
TF ¢ WHE - P1286 -ROC ) 11001100111 
110 IFC WHE - PI256 #ROC ) 200+200+201 
311 TFC wHt - P12 “ROC 4 2202220+230 
¢ ESTIMATE BY SIMPSON RULE 
200 FQs FCEXQ)eCOS( wexd) 
FO3FFCOKXQ3)#COS (wexQ3) 
VNEWL® HI®(Fleae#FQeF 21/126 
VNEW2= HI®(F2 +4 o*FQOI+F 3/126 
VNEW= VNEWL*VNEW2 
ERR= (PVALINI-VNEWI/15~ 
GO TO 300 
© SwiTCH FROM FILON TO SIMPSON RULE 
201 Fl = Fl® COStWeA) 
F2 = F2@ COStwe i B-eS#HI}) 
FR = F3e COS(weB) 
PVALIND= HIO(F 1) +40 #F 24F 3) /66 
GO TO 200 
Cc ESTIMATE BY FILON2 
220 He. 2758HI 
Foe FCUXKQ) 
FOI: FCIXKO4) 
NHe TF IXCALOGIOL2sweHl) /ALN2¢ROC D6] 
Wie WIC(NH) 
W2e-W2C (NH) 
W3= wWw3C (NH) 
WAeweAa 
wAl=twe(R-.S#H!) 
wReweB 
COs COS(WAL) 
Stl© SIN(WAL) 
VNEWL = Hei wWlECOS(wAleWw2eSIN(wADIOF] + w38#CUSt(wexQ)@ 


n” ANnAN 


-™ 


$ FQetwl*COl-w2*Si)])eF 2) 
VNEW2 = HOC CWLECOL *a2*STLIOF2 © w3IFCOS(wexQsdeF Os 
$ +CwleCOSt(wB) -w2"eSIN( wh) )@F 3) 


VNEWEVNEWlOVNE Ww 
FRrs FRENH) 
FRR « ERT@(PVALINI-VNFwWI/( 1 e-ERY) 
¢ SKIP CONVERGENCE TEST TF INTERVAL= ONE PERIOD 
IFiwHI- P12+ ROC ) 30013002400 
C ESTIMATE BY FILON) 
230 FO=sFC(xKO) 
FO3BFC(XQ3? 
W2eW*wW 
CONSTa#Be/(W2EHE) 
VNEWle CONST@(F 1=-2¢8FQ+F2) 
VNEW2= CONST#(F 2-264FO3+F 3) 
VNEW= VNEWL+VNEW2 
W226. /W2 
W3eHI eH] 
ERT (w3/32e-w2)/(W3/8e-w2). 
ERRe ERT#CPVAL(NI-VNEW)/ (1 0¢-ERT) 
C CONVERGENCE TEST 
C SKIP CONVERGENCE TEST IF HI eGTetHl, 
300 IFIHI= HL) 3013012400 
30] TFLABS(ERRI-EPS*HI/B) 50065002400 
C CONVERGENCE NOT OBTAINED -SPLIT INTERVAL ANO AOD 10 LIST 
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C TEST FOR POSSIBLE LIST OVERFLOW 
400 IFO(N-30) 4012600600 
a0) FSON2¢3)8 FA 
FS(N2¢2)= FQ3 
FSIN24+1)= F2 
FStN2)= FO 
AS(UN4] ) a At,5eH] 
PVAL (CN) a VNEW] 
PVALUN®] De vNEW2 
Nene] 
GO TO 105 
CONVERGENCE OBTAINED -ADD FXYRAPOLATEOD PARTIAL SuM TO 
TOTAL=-ADJUST ERROR AND INTERVAL 
500 VAL® VAL *¢VNEW-ERR 
EPS = EPS-ABSIERR) 
NeN-] 
Baa 
TFN) 70027000105 
CONVERGENCE FAILURE -ROUTINE RETURNS ERC 816E 430 
OPTIONAL ERROR MESSAGE MAY BE INSERTED HERE 
600 FRCOSHERC 
RE TURN : 
C COMPUTATIONS COMPLETED SUCCESSFULLY 
TOO FRCOOS® VAL 
RETURN 
Eno 
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Algorithm 428 


Hu-Tucker Minimum 
Redundancy Alphabetic 


Coding Method [Z] 


J.M. Yohe* [Recd. 2 January 1970, 12 February 1971, 
and 21 June 1971] 

Mathematics Research Center, University of 
Wisconsin, Madison, WI 53706 


Key Words and Phrases: information theory, coding theory, 
Hu-Tucker method, minimum redundancy coding 
CR Categories: 5.6 


Description 

This algorithm implements the Hu-Tucker method of variable 
length, minimum redundancy alphabetic binary encoding [1]. The 
symbols of the alphabet are considered to be an ordered forest of 
terminal nodes. Two nodes in an ordered forest are said to be 
tentative-connecting if the sequence of nodes between the two given 
nodes is either empty or consists entirely of nonterminal nodes. 

An interval of nodes each pair of which is a tentative-connecting 
pair is called a tentative connecting string. 

Given an ordered forest, we create a new ordered forest with one 
less tree by combining a pair of tentative-connecting nodes f,, fs 
such that Q{f:] + Q[iz] is minimal. Such a pair is said to have mini- 
mal weight sum. The old nodes /; and /, are eliminated, and the new 
node replaces the first of the former nodes in the ordering. Its weight 
is the sum of the weights of the former nodes. 

The original forest will, after a finite number of steps, be con- 
nected into a single tree. This tree will not, in general, satisfy the 
order-preserving requirement. However, it is shown in [1| that the 
path lengths are feasible for the construction of a tree which does 
satisfy this requirement and is, moreover, minimal in cost. 

The present procedure finds a minimal cost tree whose longest 
path length and total path length are minimal. This was done for the 
nonalphabetic case by Schwartz [3], and his work carries over 
directly to the alphabetic case by virtue of the fact that any optimal 
alphabetic encoding can be constructed by the Hu-Tucker method, 
simply by modifying the choice of which tentative-connecting nodes 
are combined. This procedure therefore represents a modification of 
the Hu-Tucker algorithm to incorporate these ideas of Schwartz. 

During the procedure, the array L is used to determine which 
roots are tentative-connecting. If L is initially filled with 1’s instead 
of 0’s, any pair of nodes will be considered tentative-connecting, and 
the procedure will implement Huffman’s method [2], giving the 
same results as the “bottom merging” method of Schwartz and 


* Sponsored by the United States Army under Contract No.: 
DA-31-124-ARO-D-462. 
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Kallick [4]. This is because this procedure picks, among those pairs 
with minimal weight sum, the first- pair having minimal length sum. 

Mcdifying the procedure to pick the first pair having maximal 
length sum would be equivalent to the “top merging’’ method of 
Schwartz and Kallick, and would maximize the total number of 
digits and the maximal length of the code in alphabetic case (and in 
the nonalphabetic case, if the L-array is initially filled with 1’s). 

The decision tree may be obtained from the branch lengths by 
combining the first node of maximal path length with the second 
node of maximal path length to form a new node with path length 
one less than that of the original nodes, iterating the procedure until 
only one node (the root) remains. The code can then be constructed 
by assigning the value 0 to the first node on the next level from the 
root and 1 to the second node, appending 0 or respectively 1 to the 
ith level encoding of a node to obtain the encoding for the first or 
second son on the (i + 1)-th level. 
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Algorithm 

procedure Hutree (n, Q, L); 
value n; integer n; integer array Q, L; 

comment 7 is the number of symbols in the alphabet, and Q is a 
vector of length 1. Q[i| is the weight to be attached to the ith 
symbol] in the alphabet. 

The output of the procedure is the vector L of length n. L{i] is 
the length of the path to the ith symbol of the alphabet in a tree of 
minimal cost (i.e. the sum of the Q[i] < L[é] is minimal) which has 
the further property that, subject to minimality of cost, the sum of 
the L[/] and max Li] are minimal; 

begin 

integer maxn, m, i; 

integer array P[{1 : 7], s[l:n — 1],d[1:n— 1]; 

comment P is used to hold the weights of the trees in the ordered 
forest, beginning with the alphabet at the start of the procedure 
and ending with the tree at the ccnclusion of the procedure. L is 
used during the procedure to hold information relating to the 
length sums. At the conclusion of the procedure, L is used to 
return the path lengths. 

If (1 < i2 and nodes /1 and 72 are connected on the mth pass 
through the body of the algorithm, then P[/1] will be set equal 
to P[ii] + P[i2], which is the weight of the new node, and P{i2] 
will be set to zero to indicate that node /2 is no longer a partici- 
pating node. L|/1] is set equal to L[‘1] ++ L[i2] + 1, which is one 
less than the number of terminal nodes which are descended 
from the new node. This number is also one less than the incre- 
ment to the total path length which would result from connect- 
ing the new node il in a subsequent pass through the body of 
the algorithm. The value of L{i2] is irrelevant during the re- 
mainder of the procedure. The s and d vectors are used to 
record connections of tentative-connecting nodes. s[7] is set to 
i1 , which is both the ordered position of the leftmost node and 
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the ordered position of the new node, and d{m] is set to i2, 
which is the ordered position of the rightmost node. 

The variable maxn is set to a number which is larger than the 
sum of the elements of Q. 

The following simple example should be of some assistance 
in understanding the procedure. Assume the procedure is called 
with a = 5and Q = (3, 1, 1, 1. 3). The evolution of the vectors 
P, L, s, and d is shown in the following table. Values which are 
not relevant are indicated by dashes. 


4 


So 
_ 
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m 
Pit] 
P(2] 
P(3] 
P|4] 
P(5) 
L{1] 
L{2] 
L[3] 
L{4) 
L{5] 
s[m] 
d|m] 
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maxn := 1; 
for i := 1 step 1 until x do 
begin 


L{i] := 0; Plé] := Qi]; 
maxn := maxn + Qfi]; 


end 
comment Since there are 7 terminal nodes in the original forest, we 


must make exactly 7 — 1 connections. On each pass through 
the body of this procedure we will determine the next optimal 
connection. We initialize by setting the minimum weight to a 
large value to insure that any valid connection chosen will 
replace the bogus connection initially indicated; 


for m := 1 step 1 untiln — 1 do 
begin 


Bl: 


B2: 


integer j, /1, min1, minL1, 72, min2, minL2, pt, pmin, sumLt, 
sum, ii, i2; 

i:=0; 

pmin := maxn; 


i:= i+; 

comment At B2 we begin our scan of the next tentative-connect- 
ing string to find the most desirable pair in the string. If 
necessary, we skip over any previously absorbed nodes. We 
initialize the most desirable node to the first in the tentative- 
connecting string, and the record of the second most desirable 
node is initialized to reflect a very large minimum. This 
insures that any participating node will be more desirable and 
that valid information will replace the bogus information as 
soon as the next participating node is encountered. If the 
first participating node is the last node in the forest, or if no 
further nodes are participating nodes, then we have com- 
pleted our scan for the next tentative-connecting pair and we 
go to El to make the optimal connection; 


if i1 > 1 then go to £1 else 

if P[/] = Othen go to Bl; 

min2 := maxn; 

Al := 3; 

min£l := Lif]; mini := Pi; 

comment We now begin our scan of all remaining nodes in the 
current tentative-connecting string. The string will end as 
soon as we have examined a participating node which has not 
previously been combined. The purpose of this scan is to 
locate the optimal tentative-connecting pair in the tentative- 
connecting string. The optimal pair is defined to be that pair 


E2: 
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with minimal weight and minimal length sum which occurs 
first in the tentative-connecting string; 
for j := i + 1 step 1 until 1 do 
begin 
comment We check for P[j] > 0 to see whether the jth node is a 
participating node. If P[j] = 0, the node has previously been 
absorbed and we pass over the empty space; 
if P[j] > O then 
begin 
if P[j] < mini \VV (PlL/] = mini A Li] < minZ1) then 
begin 
comment If the jth node is “more desirable” than either of 
the previously most desirable tentative-connecting nodes, 
we record the previous most desirable node as the second 
most desirable node and record the jth node as being 
most desirable; 
min2 := mini; j2:= jl; minL2 := minL1; 
min\ := Pj]; fl := /; minL1 := Lj); 
end 
else if PLj] < min2 \/ (P[j] = 
begin 
comment If the jth node was not more desirable than the 
previous most desirable node, but is more desirable 
than the previous second most desirable node, we record 
the jth node as being second most desirable; 
min2 := P{j|;j2:= 7; minL2 := L{j]; 
end; 
if Z[/] = O then go to £2; 
comment If Z[j] = 0 then we have reached the end of the 
current tentative-connecting string, and we have found 
the most desirable pair in that string. We now go to 
compare it with the previous most desirable pair in the 
forest; 
end 
end; 


min2 /\ L{j] < minL2) then 


pt:= P[fl] + Plj2]; 

sumLt := L{jl] + L{j2]; 

comment We have now found the next tentative-connecting 
pair, namely the j1 and /2 nodes. Here, we test this new pair 
against the previous minimal pair to see whether the new pair 
is more desirable. The new pair is more desirable if its weight is 
less than that of the previous pair, or if its weight is equal to 
that of the previous pair and its length sum is smaller; 

if pt < pmin \/ (pt = pmin./\ sumLt < sumL) then 

begin 
pmin := pt; 
il := j1; 12 := j2; 
sumL := sumLt; 

end; — 

comment The next tentative-connecting string begins with the 
last participating node in the current tentative-connecting 
string. Hence we replace i by j and return to B2 to begin 
processing the next tentative-connecting string; 

b= J; 

go to B2; 

comment Upon reaching E1 the procedure has scanned all 
tentative-connecting pairs and the decision has been made to 
connect nodes in order positions 1 and i2. We switch i1 and 
i2 if necessary to insure that {1 < i2. We record the connec- 
tion by setting s[m] := il and d[m] := i2. The weight of the 
new node is placed in the weight table in position /1 (the 
order position of the new node). The weight in the order 
position of the second combined node is set to zero to indi- 
cate that the node has now been absorbed and no longer 
participates in the scan. L[{/1] is set to one less than the incre- 
ment to the path length sum which would result from con- 
necting the new node; 


COLLECTED ALGORITHMS (cont.) 


EF}: 
if i] > (2 then 
begin 
Jl := tt, il := i2;12:= /1; 
end; 


S[m] := il; d[m] := i2; 


Plil| := pmin; Pli2| := 0; 
Liil} := sumL + 1; 
end; 


comment s{ — 1] gives the ordered location of the root of the . 


tentative tree. We now generate the path lengths as follows: 
the path length to the root is zero, and if the path length to any 
node ts /, then the path length to each of its sons is ¢ + 1. The 
two sons of the node whose order position is given in s{m] lie in 
the order positions given in s{m]| and d[n]. Moreover, if an 
order position is given in s[} for nt < 2 — 1 then that order 
position must be listed in s{j] or d[j| for some j > m, so the path 
lengths obtained by this algorithm are well defined. 

Returning to our example, we now trace the construction of 
the vector of path lengths. This is shown in the following table. 
For the sake of clarity, the vectors s and d are now shown in 
reverse Order. 


m 43 2 1 
LUA] 122 2 
Li2} .- 1 1 2 3 
LI] - - - - 3 
Lia] ee = 22 
Lis] - - 2 2 2 
$[m] 112 2 
d[m] 2 5 4+ 3 


Thus the final value of the vector L is (2, 3, 3, 2, 2); 
L{s{a — 1]] := 0; 
form := n — 1 step —1 until 1 do 
L{s(m]]':= Lid{m]] := Lis(m]] + 1; 
end; 


Remark. on Algorithm 428 {Z| 

Hu-Tucker Minimum Redundancy Alphabetic Coding 
Method [J.M. Yohe, Comm. ACM 15 (May 1972), 
360-362] 


J.G. Byrne [Recd. 26 June 1972] Department of Com- 
puter Science, Trinity College, Dublin 2, Ireland 


Algorithm 428 was translated into Basic Fortran IV and run 
on IBM System 360/44 running under RAX. When the line just 
after the label B2: 


if il > » then goto E1 else 
was changed to 
if i > + then go to E) else 
the algorithm gave correct results for the example given and for the 
example in Gilbert and Moore [1]. In the latter case the cost 
defined as 
ia OL) 
Dh OW) 

and code: lengths were correct. 

When the ZL array was Set to 1’s on entry, the optimum (Huff- 
man) codes were obtained, and they were the same as those given 


by the Schwartz and Kallick [2] method as claimed in the author’s 
description. 
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Table I. 
Size of alphabet 10 27 60 


Time to find optimum alphabetic codes. 0-02 0-14 0-62 
(secs) 


Time to find optimum codes (secs) 0-02 0-08 0-34 


Table I, which gives the cpu time required, shows that the 
algorithm is very fast for small alphabets and that the time is 
approximately proportional to ?, as expected. 
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Algorithm 429 


Localization of the Roots of 
a Polynomial [C2] 


W. Squire (Recd. 16 Mar. 1970, 2 June 1971, and 
4 Oct. 1971) 

College of Engineering, Dept. of Aerospace 
Engineering, West Virginia University, Morgantown, 
WV 26506 


Key Words and Phrases: polynomials, roots of polynonials, 
theory of equations, Routh-Hurwitz criterion 

CR Categories: 5.15 

Language: Fortran 


Description 
This algorithm provides information about the roots of the 
polynomial 


2 spect) ett * ee Pee (1) 


The theorem [1] that the roots of (1) are all inside a ring of 
radius 
1+ max|C;| 
1<icn 
is embodied in the Fortran function RADIUS. By applying this to 
the original polynomial and to the polynomial 


n Cn-t ni Cn-2 n-2 od L 
ee Sip ae a tie (2) 


the inner and outer radii of an annulus containing all the roots are 
determined. 

The theorem [1] that the positive real roots of (1) are less than 
1+ fea ea 

l<gi¢n 

where mm is the subscript of the first negative coefficient is embodied 
in RADIUS. If there are no negative coefficients there cannot be any 
positive roots and RADIUS returns zero in this case. By applying 
RADIUS to both (1) and (2) upper and lower bounds are obtained 
for the positive roots. In some cases (all coefficients positive) it is 
possible to say that there are no real positive roots, but the converse 
does not hold so that the determination of bounds does not guaran- 
tee the existence of a real root between those bounds. RADIUS is 
also applied to the equations whose roots are the negatives and nega- 
tive reciprocal of the roots of (1) to obtain similar results for the 
negative real roots. 

The Fortran function HRWTZR employs a modification of the 
Routh-Hurwitz criterion [2] to determine whether (1) and the equa- 
tion whose roots are the negatives of those of (1) have any roots with 
positive real parts. Unfortunately a zero real part is considered 
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positive so that this test will not determine if an equation nas purely 
imaginary roots. 

The subroutine POLYAN, which computes the coefficients for 
the modified polynomials, calls the functions, and prints out suit- 
able messages, has for its arguments: 

1. An N element array C which contains the coefficients of the 
polynomial except for the leading 1. 

2. An auxiliary N element array CM in which the coefficients of 
the modified polynomials are stored as needed. 

3. N is equal to the degree of the polynomial. 

If desired the argument list can be extended to include the 
various bounds so that they can be transmitted back to the main 
program for use. 


References 

1. Berezin, I.S., and Zhidkov, N.P. Computing Methods. Vol. 
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Algorithm 


SUBROUTINE PGLYAN(Cs CMs ND 

POLYAN @BTAINS INFORMATIGN ABOUT THE LOCATION 

OF THE ROOTS GF A P@LYNGMIAL BY USING 

BOUNDs RADIUSs AND HRWTZR 

C ES AN ELEMENT ARRAY CONTAINING THE CGEFICIENTS 
N@RMALIZED S@ THAT THE LEADING COEFFICIENTC#HICH 
IS NOT INCLUDED IN C) IS +1-0 

CM Is A WEKKING ARRAY THE SAME SIZE AS C 

N=DEGREE Gr POLYNOMIAL 


aagaaaaanaan 


DIMENSIGN CCN), CMCND 
LOGICAL HRWTZK 
Cc TEST FOR ZERG ROOT 
IFCCON) «EG-060) GN TG 50 
Cc COEFFICIENTS FOR KECIPKGCAL PGLYNOMIAL AKE PUT IN CM 
CMON) =1.6/C00N) 
NML=N-1 
b@ 5S l=1.NM)2 
NISN-1 
5 CMCID=CMCN) *KCONID 
RGUT=RADIUSCC2N) 
KINE 1+ ZRADIUSCOMS ND 
WKITEC6s201) RIN» ROUT 
201 FORMATC 40K ROOTS ARE IN AN ANNULUS CF INNER KADIUS» 
1E10+63517H AND CUTER KADIUSs E1023) 
RPU=BOUND(CsN) 
IFCRPU-NE-O0-0) GO TO 10 
WVRITEC 6,202) 
202 FGORMATC33H THERE ARE N@ nEAL POSITIVE xO0TS) 
Go TO 20 
10 KPL=1+/BOUNDCCMsN) 
WRETEC6s203) RPL» RPU 
203 FOKMAT 
1€40H THE P@SITIVE RGOTSCIF ANY) ARE BETWEEN, 
2E1063s 4H ANDs E10.3) 
Cc COEFFICIENTS FOR NEGATIVE mECIPKOCAL AKE PUT IN CM 
20 DO 25 I=1lsNs2 
25 CMCI>=-CMCI) 
RNU=BQUND(CMsN) 
IFCRNU.NE-0-0)GO TB 30 
WRITEC 6,204) 
204 FORMAT 
1¢33H THERE ARE NO NEGATIVE REAL ROOTS) 
GO T@ 40 
Cc COEFFICIENTS Fek NEGATIVE «Q@@TS ARE FUT IN CM 
30 X=-1.0 
DG 35 I=lsN 
CMCT)SX*CCI) 
35 X=<xX 
RNUS-1-/RNU 
RNL=-BOUNDCCMsN) 
WRITEC6s205) KNUs KNL 
205 FORMAT 
1¢44H THE REAL NEGATIVE ROGTSCIF ANYOAKE BETWEEN, 
2E10¢3s 4H ANDsE10+3) 
40 IFCHKWTZK (CoN)) WRITEC 6,206) 
206 FORMAT 
1€44H THERE ARE NO KOOTS WITH POSITIVE KEAL PARTS) 
IFCHRWTZR (CCM»N)) WKETEC 62207) 
207 FGORMAT 
1 €44H THERE ARE NG ROOTS WITH NEGATIVE KEAL PARTS) 
RETURN 
50 WRITEC 62208) 
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202 FORMAT C41H POLYNOMIAL HAS A ZEKO ROOT-REDUCE DEGKEED 
RETURN 
END: 
FUNCTION RADIUS(CsN) 
RADIUS RETURNS AN UPPER LIMIT FO THE M@DULUS 
GF THE RGGTS OF AN N DEGREE POLYNOMIAL. 


aIO4 


DIMENSIGN CCN) 
RADIUS=ABS(CC1)) 
D@ 10 1=2sN 
10 IFCABS(CCL)) ¢GT«RKADIUS) RADIUS=ABS(CCI)) 
RADIUS=S1.+RADIUS 
RETURN 
END 
FUNCTION BOUND(C.N) 
BOUND RETURNS AN UPPER LIMIT FOR THE 
POSITIVE REAL K@GTS OF AN N DEGREE POLYNOMIAL 


aaa 


DIMENSION CCW) 
M=Q° 
BOUND=0.0 
09 10 I=tsN 
IFCM-GT-O) GO TO 10 
IFCCCI)«LT«O0O) Mel 
10 IFCCC1)+«LTeBOUND) BOUUND=C¢(1) 
IFCMeEQ+0) RETURN 
BOUND=1¢+#€-BOUND)**C16/FLCATCM)) 
RETURN 
END 
LOGICAL FUNCTION HRWTZRCCoN) 
HRWTZK RETURNS «TRUE. IF ALL THE ROOTS HAVE 
NEGATIVE REAL PARTS» @THERWISEsFALSE*«IS RETURNED. 
IF A REAL PART IS ZERO,THEN «FALSE~s IS RETURNED. 


agaaa 


DIMENSIGN CCN) 
HRWTZR=. FALSE. 
Ci=C¢l)> 
TFCCL+LE20+¢O)RETURN 
M2N-1 
DO 30 1=1.M 
KM=N-1I 
DO 20 K=1,KM 
CCKI=CL¥CCK4HLD 
TFCK eEQsKMeGRe2*(K/2)-EO-K) GG TO 20 
CCKI=CCKI-CCK42) 
20 CCKY=CCK)/C1 
Cl=CC1) 
IFCCLeLEO.0) ®ETURN 
30 CONTINUE 
HRWIZR=. TRUE» 
RETURN 
END 


Remark on Algorithm 429 [C2| 
Localization of the Roots of a Polynomial [W. Squire, 
Comm. ACM 15 (Aug. 1972), 776| 


Edward J. Williams {Recd. 15 Sept. 1972] Computer 
Science Department, Ford Motor Company, P.O. 
Box 2053, Dearborn, MI 48121 


Corrections are needed in the third paragraph. The theorem 
that the positive real roots of (1) are less than 


1 -+ [maxicign | Ci|]™... should read 
1+ [maxisi<ncico|Ci|]'™ 


Further, the four words“ RAD/US” in this paragraph should be 
replaced by “BOUND”. 


References 
t. Zaguskin, O.O. Solution of Algebraic and Transcendental 
Equations, Pergamon Press, New York, 1961, p. 21. 
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Remark on Algorithm 429 [C2] 


Localization of the Roots of a Polynomial [C2] 
[W. Squire, Comm. ACM 15 (Aug. 1972), 776-777] 


H.B. Driessen and E.W. LeM. Hunt [Recd. 13 Oct. 
1972, 29 Jan. 1973] 


Supreme Headquarters Allied Powers of Europe, 
Technical Center, P.O. Box 174, The Hague, The 
Netherlands 


There seems to be an error in this algorithm. If we take the 
polynomial: 


Z4 + az? + az? + az + as = 0, 


then after the second pass through the K-loop of the logical func- 

tion HRWTZR(C, N), the term (a2a3--a4)d4 — asa, is tested for a 

minus sign. However, the term which should be tested according 

to the Routh-Hurwitz criterion is (a2,a;—as)a, — asa”. If this 

term is negative then there are no roots with positive real parts. 
As an example, if the polynomial 


z4 + 5.6562 z3 + 5.8854 z? + 7.3646 z + 6.1354 = 0 
is studied with the help of Algorithm 429 one will find as output: 


Roots are in an annulus of inner radius .454 E + 00 and 
outer radius .836 E + 01; 

There are no real positive roots; 

The negative roots (if any) are between —.454 £ + 00 and 
— .836 E+ 01; 

There are no roots with positive real parts. 


However, if one calculates the roots of this equation, one will 
tind approximately: 


Z) = — 1.0001 
Zz, = —4.7741 
z34 = +0.0089 + 1.1457 i 


Statement 20 + 1 in the logical function HRWTZR(C,N), 
which was originally ‘““Cl = C(1)”, should be amended to read 
“Cl = C(1)/C1”. 

As a by-product of our investigation, it turns out that the 
structure of the logical function HRWTZR can be simplified by 
abandoning the logically redundant steps C(K) = C(K-+1). 

The following listing incorporates both the correction and 
the simplifications. The function has been parameter tested on a 
CDC-6400. 


LOGICAL FUNCTION HRWTZR (C,N) 
DIMENSION C(N) 

HRWTZR = .FALSE. 

IF (C(1) .LE.0..OR.C(N).LE.0.) RETURN 


Cl = C(1) 
M=N-1 
DO 301 = 2,M 


DO 20K = I,M,2 
20 C(K) = C(K) — C(K+1)/Cl 
Cl = C(1)/C1 
IF (C1.LE.0.) RETURN 
30 CONTINUE 
HRWTZR = .TRUE. 
RETURN 
END 
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Algorithm 430 


Immediate Predominators in a 
Directed Graph [H] 


Paul W. Purdom Jr.* and Edward F. Moore [Recd. 
14 Aug. 1970 and 13 July 1971] 

Computer Sciences Department, University of 
Wisconsin, Madison, WI 53706 


Key Words and Phrases: predominator, immediate predomina- 
tor, graph theory, directed graph, shortest path, articulation, 
connectivity, program optimization, optimizing compiler 

CR Categories: 4.12, 5.32 

Language: Algol 


Description 

We assume a directed graph whose nodes are labeled by integers 
between 1 and n. The arcs of this graph correspond to the flow of 
control between blocks of a computer program. The initial node of 
this graph (corresponding to the entry point of the program) is 
labeled by the integer 1. For optimizing the object code generated 
by a compiler, the relationship of immediate predominator has been 
used by Lowry and Medlock [3]. We say that node / predominates 
node & if every path from node | to node & passes through (i.e. both 
into and out of) node i. Node / is an immediate predominator of 
node k if node / predominates node & and if every other--node i 
which predominates node k also predominates node j. It can 
easily be proved that if k ~ 1 and node k is reachable from node It 
hen node & has exactly one immediate predominator. In case k = 1, 
or node k is not reachable from node 1, the immediate predominator 
of node k is undefined, and the value 0 will be given by the procedure 

‘ PREDOMINATOR. 

The input to this procedure is described for clarity of exposition 
as the adjacency matrix M of the directed graph. 

It is assumed that there is a known bound a such that the 
number g of arcs in the directed graph satisfies g < a. 

Both the machine time and the memory required to perform this 
procedure are related in a simple way to the number 77 of nodes and 
the number g of arcs of the given graph. If 7 is the length of time 
required to perform the procedure PREDOMINATOR, then T is 
bounded by 


T < kin? + kang + kan + kag + ks, 


where the k; are constants depending on the machine used for the 
procedure. If S is the memory required to perform the procedure 
PREDOMINATOR, then S is given exactly by 


S = ken? + koa + ken + ko. 


The ken? term is merely the memory required to store the ad- 
jacency matrix M which is used to give the input description of the 
graph. The description of the graph is first transformed into a linked 
list, and no further use is made of the Boolean array M. If this 
procedure were incorporated into an optimizing compiler, the 
adjacency matrix should be eliminated, going directly from the 


*Present address: University of Indiana, Department of 
Computer Science, Bloomington, IN 47401. 
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source program into the list form, saving the memory used to store 
the adjacency matrix M, which would remove the Xe? term from the 
memory required, as well as decreasing the computing time re- 
quired. The precise details of the list representation can be ex- 
pressed in a more brief and unambiguous manner by a few lines of 
Algol than by an English description. The predominators of any 
given node can be computed as in [3] from the immediate predomi- 
nators, and the articulation points of a graph are the predominators 
of the exit node. 

In an article on program optimization, Allen [1] gives an algo- 
rithm for computing articulation points (which are the predomina- 
tors of the exit node). To test if node / is an articulation point, he 
removes node i, from the graph, and computes the transitive closure 
to see if the exit node is connected to the entry node. By successively 
considering each node as an exit node, his algorithm can be adopted 
to computing the predominators (from which immediate predomi- 
nators can be quickly computed) in a time proportional to n? times 
the time required to compute the transitive closure. Since the transi- 
tive closure takes between n? and 7? operations to compute [4, 5, and 
6], Allen’s algorithm would be slower than the one presented here 
by at least a factor of n for large problems. 

The procedure PREDOMINATOR depends for its speed on the 
use of an algorithm first proposed by Dijkstra [2] for finding the 
shortest path between two points in a graph. The basic idea of the 
method is that a tree is found which is rooted on the entry node and 
which includes each node in the graph which can be reached from 
the entry node. Any node which cannot be reached from the entry 
node does not have an immediate predominator. Each node which 
can be reached from the entry node has the entry node as a predomi- 
nator. It is the immediate predominator unless the node has a pre- 
dominator which is closer to it along the path which was used to 
reach it. To test if a node i, other than the entry node, is a predomin- 


ator of some nodes, a test is made to see which nodes below (further 


from the root) i cannot be reached from the remaining nodes in the 
tree without going through i. The nodes which cannot be reached 
without going through ij have jas a predominator. Using this method 
the entry Immediate{i] is set to the various predominators of node /. 
The calculation is, however, organized to start at the root of the tree 
and proceed to the leaves, so that the last value of Jmmediate|j| con- 
tains the immediate predominator of /. 

The program was tested on 38 graphs including one with 36 
nodes and 49 arcs which represents the flowchart of the algorithm 
and one with 82 nodes and 125 arcs which represents the flowchart 
of a Fortran program. The running time of the program on a Bur- 
roughs B5500 was 0.6 sec for the 36 node graph and 3.8 sec for the 
82 node graph. The longest time for the remaining graphs was 0.5 
sec for a graph with 18 nodes and 48 arcs. The shortest time was 0.07 
sec for graphs with two nodes and one arc, five nodes and 25 arcs, 
and five nodes and 21 arcs. While these numbers are useful for 
estimating the average running time of the program, they are of 
limited use in calculating the constants in the formula for the run- 
ning time, because the formula gives only an upper limit on the 
running time. 
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Algorithm 
procedure PREDOMINATOR( Immediate, M, n, a); 
value 7, a; integer n, a; 
integer array Immediate; Boolean array M, 
comment The procedure sets Jmmediate|i] to theimmediate predomi- 
nator of i or to 0 if i has no immediate predominator. The inci- 
dence matrix of the graph is given by M, where Mii, /] is true if 
there is an arc from node i to node j. The number of nodes in the 
graph, which must be at least 1, is n, and a is (an upper limit on) 
the number of arcs in the graph. The start node is assumed to be 
node 1; 
begin 
integer node, j, avail, k, stp, new, oldnode, down; 
integer array First, Last, St[.:n|, Next{\:n+a], Suc{a+1:a+a); 
Boolean array Mark(1:n); 
comment This section initializes various variables and forms a 
linked list representation of the graph. The head of the list of 
arcs out of node iis Nex¢li| (for 1<i<n). The arcs are put on a 
list linked by the array Next where the corresponding entry in 
the array Suc gives the node to which the arc goes. In the array 
Nex? 0 indicates the end of the list. For most uses of the proce- 
dure the graph will already be available as a linked list and in 
such cases the procedure should be modified so that it starts 
from the list and does not use the array M.,; 


avail := n, 
for j := 1 step 1 until 1 do 
begin 


Mark\|jj := false; Nextli] := Immediate|j] := 0; 
for k':= step 1 until 1 do if M[/, k] then 


begin 
avail := avail+1; Suclavail] := k; 
Nextlavail} := Next{j|; Next|j] := avail, 
end; . 
end; 


down := Last(\| := 0; St{1] := stp .= oldnode := 1; 
Mark'}] := true; new := Nexi{1]; 
comment newp! is the start of Dijkstra’s{2| algorithm for the 
shortest path, modified for the case where all distances are 0 or 
infinity. In addition the array First is set to link the nodes in the 
order they are traversed by Dijkstra’s algorithm. Last|i| is set to 
the next node after node i on the list First which cannot be 
reached from node i by those arcs of the graph which are tra- 
versed by Dijkstra’s algorithm. Node | is set as the tentative im- 
mediate predominator of each node that can be reached from 
node 1; 
newp!: 
if new > O then 
begin 
node := Suc\new); 
if — Mark|node]then 
begin 
for j := 1 step 1 until down do LastSuc|Stsip+]]] : 
down := 0; stp := stp+1; Stlstp| := new; 


node; 


Mark{node] := true; 
Immediate|node] := 1; Firstloldnode| := node ; 
oldnode := node; new := Next\nodel; 
go to newp1; 
end; 
new := Next(new}, 
go to newp!l; 


end; 


down := down+1; new := Next|Stistpll; stp := stp—1; 
if stp0 then go to newp]; 
for j := 2 step 1 until down do Last|Suc{St{j]]] := 0; 
Firstloldnode\ := 0; j := 1; 
if First(1]=0 then go to exit; 

nextdom: 
oldnode := j, j := First\j); k := Firstlj); 


if k=O then go to exit; 
comment The nodes that the above version of Dijkstra’s algorithm 
reached by going through node j will now be unmarked; 
unmark: 
if k~Last(j] then 
begin Mark|k| := false; k := First{|k]; go to unmark; end; 
Firstloldnode) := Last{j); k := 1; 


trace: 
if k~0 then 
begin 
new := Nextlk]; stp := 1; 


comment vewp2 starts a second modification of Dijkstra’s 
algorithm to find which unmarked nodes can be reached 
from the marked nodes without using node /; 


newp2: 
if new +0 then 
begin 
node := Suc|new]; . 
if — Mark\|node) then 
begin 
stp := stp+1; St'stp) := new; Mark(node] := true; 
new := Next|node]; 
go to newp2; 
end; 
new := Next|new]; 
go to newp2; 
end; 


new := Next(St|stp\]; stp := stp-—1; 
if stp~+0O then go to newp2; 


k := First\k),; 
go to trace; 
end; 


k := First\j|; Firstloldnode\ := j; 
comment Each unmarked node will now be remarked and have 
j set to be its tentative immediate predominator, The last 
tentative immediate predominator is the actual one; 
marker: 
if k~Last(j) then 
begin 
if — Mark [k] then 
begin Immediate[k| := j; Mark|k; := true; end; 
k := First(k]; 
go to marker; 
end; 
go to nextdom; 
exit: 
end of PREDOMINATOR; 
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Abstract. A computer program based on Lemke’s 
complementary pivot algorithm is presented. This can be used 
to solve linear and quadratic programming problems. The 
program has been extensively tested on a wide range of problems 
and the results have been extremely satisfactory. 
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Description 

Introduction. The computer routine given below is based on 
Lemke’s complementary pivot algorithm [2] to solve the com- 
plementary problem of the form: 


Find w,z = 0 
such that w = Mz+q (1) 
w’z = 0 


where M is an (N X N) square matrix; w, z and g are (N X 1) 
column vectors. (‘Prime” denotes the transpose of a vector or 
matrix.) 

A solution to the above problem will be called a complementary 
solution, and Lemke’s algorithm is guaranteed to find a comple- 
mentary solution to system (1) only if the matrix M satisfies one of 
the following: 

1. M has all positive elements. 
2. Misa positive semidefinite matrix or x‘Mx 2 0 for all x. 
3. M has positive principal determinants. 

Applications. The two important applications of the comple- 
mentary problem (1) are to solve linear and quadratic programming 
problems by converting them to an equivalent complementary 
problem. 

Quadratic Programming. Consider the quadratic program: 


Minimize Z = c’x + x’Qx 
subject to Ax 2 b 
x20 


where A is an (m X n) matrix, Q is an (7 X n) matrix of the quadra- 
tic form, c and x are (x X 1) column vectors, and b is an (m X 1) 
column vector. 
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An optimum solution to the above problem may be obtained 
by solving a complementary problem of the form: 


RC <a.) 2 


u,v,x,y 20 
vx +u'y =0 
where u denotes the slack variables of the given quadratic program 
and (y, v) denotes the variables of the dual problem. Comparing 
the above system (2) with the original complementary problem 
(1), we note that 


wo (Dorn (oar C42" Santon (5). 


System (2) can be solved by the given computer routine and then 
an optimum solution to the given quadratic program may be 
obtained by reading off the values of (21,22, ..- Zn) Watts +++ 
Wnim) from the complementary solution. It should be remarked 
here that the matrix M in this case is positive semidefinite if and only 
if the matrix Q is positive semidefinite. Hence, the computer routine 
is guaranteed to find an optimum solution to the given quadratic 
program only if the objective function Z is a convex function. 
Linear Programming. Consider the linear program: 
Minimize Z = c’x 

subject to Ax = b 

x 20. 


IV Il 


The only difference between a linear program and a quadratic 
program is in the objective function. Hence, by setting Q = 0 in 
system (2), we get the equivalent complementary problem for a 
linear program. 

Program. A detailed description of Lemke’s algorithm to 
solve the complementary problem, on which the computer routine 
is based, is given in [3]. The program consists of six subroutines and 
a main program which calls these subroutines in proper order. The 
various input data to the program are the number of problems to be 
solved in succession, the size of the problem and the elements of 
matrix M and vector g. The original Lemke’s algorithm [2] was 
modified by the author along the lines of the revised simplex 
method [1] fora linear program to take advantage of the fact that 
for solving linear and quadratic programs, the M matrix in system 
(1) has many zero entries. This led to a greater efficiency of the 
computer routine. 

In an experimental study conducted by the author [4], this 
computer routine was extensively used to compare the relative 
efficiencies of the simplex method [1] and Lemke’s algorithm to 
solve linear programs. The study revealed the superiority of Lemke’s 
algorithm over the simplex method in a number of problems both 
with regard to the number of iterations and computation time. Also 
in [3], another modification of Lemke’s algorithm for solving linear 
programs has been proposed which may save a considerable 
storage and computation time. 
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complementary pivot methods for linear programming. Rep. Js2 Bae 
s . ‘ eLE-QC€J)) 
No. 70-9 (July 1970), School of Industrial Engineering, Purdue , ea a 
2 JsJel 
U,, Lafayette, Ind. IF (CJeLEeN) GO TO] 
C UPDATE Q VECTOR 
4 Ike 
Algorithm Ti=-OCInD 
IF (T1eLE-0-0) GO 10 ¥ 
C x«EMARKS De 3 1l=1lsN 
C SINCE [THIS PKCGRAM [5 COMPLETE IN ALL RESPECTSsIT CAN BE OCI HeGCL1dI+TI 
C KUN AS IT IS WITHOUT ANY ADDITIONAL MODIFICATION On 3 CONTINUE 
C INSTRUCTICN-IN SUCH CASE FOLLOW THE [NPUT FCKMAT AS GIVEN QCIRD=TI 
Cc C UPDATE BASIS INVEKSE AND INDICATOR VECTOK 
C PROGKAM FORK SOLVING LINEAK AND QUADKATIC PRCGKAMMING C OF BASIC VARIABLES. 
C PROBLEMS. IN THE FORM WaM#*Z+0e WeZ=02 WwW AND Z NONNEGATIVE DO 4 J=lsN 
C BY LEMKE/S ALGCRITHMe BCQs Tk a-beO 
¢ : : we Jd=OCJ) 
C MAIN PROGRAM WHICH CALLS THE SIX SUBROUTINES-MATKIX» Z(€ 5220-0 
C INITIAs NEWBASsSOciT»PLVOT AND PPRINT IN PROPEK OKDEK. MBASISCJ)=t 
Cc L=NtJ 
COMMON AMsUsL le Be NLIsNL2s As NEL os NEQs Iie MBASIT Ss we Z MBASIS(LI2J 
DIMENSION AMC50250), Q¢50)5 6¢50s50)» ACSD? 4 CONTINUE 
DIMENSION &€50)5 Z¢€50)2 MBASTIS(100) NL121 
C DESCKIPTION @F PARAMETERS IN COMMEN LeN+lnx 
Cc AM - A Tw DIMENSIONAL ARKKAY CONTAINING THE NL2sIn 
Cc . ELEMENTS OF MATHIX Me MBASISCIRKD=3 
Cc 10) A SINGLY SUBSCRKIPTED AnwAY CONTAINING THE MBASIS(L)20 
Cc ELEMENTS OF VECTOR QO. wCIK2=0.0 
C Ll AN INTEGER VARIABLE INDICATING THE NUMBEK OF ZOzOCI RD 
Cc ITERATIONS fFAKEN FOK EACKd PROBLEM. Lizt 
Cc B A TWO DIMENSIGNAL AKRAY CONTAINING THE C PRINT THE INITIAL ALMOST COMPLEMENTARY SOLUTION 
Cc ELEMENTS GF THE INVERSE OF THE CURRENT BASIS» WwxI TEC 6,5) 
Cc w A SINGLY SUBSCRIPTED AKRAY CENTAINING THE VALUES 5 FOWKMAT €3€7)55Xs29HINITIAL ALMOST COMPLEMENTARY » 
Cc OF W VAKIABLES IN EACH SOLUTION. 1 BHS@LUTIO@N) 
Cc Zz A SINGLY SUBSCRIPTED AnrKAY CONTAINING THE VALUES D@7 I=lseN 
Cc CF Z VAKIABLES IN EACH SOLUTION. WKITEC6,6) Ie,WCI)- 
Cc NL 1 AN INTEGER VARIABLE TAKING VALUE 1 On 2 DEPEND=- 6 FORMAT (10X%s2HW(s14s2H) =e F1925) 
Cc ING ON WHETHER VARIABLE W Gre Z LEAVES THE BASIS 7 CONTINUE 
Cc NEI SIMILAR TO NUI BUT INDICATES VARIABLE ENTERING WRITEC 628) ZO 
Cc NL2 AN INTEGEK VARIABLE INDICATING wHAT COMPONENT & FCRMAT €10X%s 3HZG=5F 1529) 
Cc > OF W OR Z VARIABLE LEAVES THE BASIS. KETURN 
C NE2@ SIMILAR TG NL2 BUT INDICATES VAKIABLE ENTERING 9 WKITE (©6210) 
Cc A A SINGLY SUBSCRKIPTED ARRAY CONTAINING THE 10 FORMAT (€5As36HPROBLEM HAS A TKIVIAL COMPLEMENTARY » 
Cc ELEMENTS OF THE TKANSFOKMED COLUMN THAT IS 1 23HSOLUTION WITH W=Gs, 220.) 
Cc : ENTERING THE BASIS. IR=1000 
Cc In AN INTEGEK VAKIABLE DEN@TING THE PIVOT KOW AT RETURN 
Cc EACH ITERATION. ALSO USED TO IENDICATE TEXMINA- END 
Cc TION OF A PROBLEM BY GIVING IT A VALUE OF 1000. SUBRCUTINE NEWBAS (ND . 
Cc MBASIS. A SINGLY SUBSCKIPTEB AKKAY~INDICATOR FOr THE C PUKPGSE - TO FIND THE NEw BASIS COLUMN TO ENTEn IN 
Cc BASIC VARIABLES. TwG INDICATORS ARE USED FOR Cc TERMS OF THE CURKENT BASIS. 
Cc EACH BASIC VAKLABLE-@ONE INDICATING WHETHEK c 
Cc IT IS A W OR Z AND ANOTHER INDICATING WHAT COMMON AMsOsL1sBsNLLeNL2sAsNELsNE2s ine MBASI Ss wo Z 
c COMPONENT @F W OR Ze DIMENSION AMC50s50)5 9¢50)2 BC350,50)2 AC 50) 
Cc DIMENSIGN W(50)s Z¢€50)s MBASISC100) 
C WKEAD IN THE VALUE OF VAKIABLE IP INDICATING THE C IF NL1 IS NEITHEK ! NOK 2 THEN THE VAKIABLE ZO LEAVES THE 
C NUMBER QF. PK@BLEMS TO BE SOLVED. C BASIS INDICATING TERMINATION WETH A COMPLEMENTAKY S@LUTION 
NEADC5s3) IP IF ¢€NL1-E04-.1) GO TE 2 
C VARIABLE NO INDICATES THE CUNKENT PROBLEM BEING SOLVED IF (NL1-EQ-2) GO TO 5 
Ne@=0 WRITEC 621) 
1 NO=NO+1 1 FORMAT ©5X%s22HCOMPLEMENTARKY SOLUTION) 
IF (NO«GTeIP) GO TO 5 CALL PPRINT (N) 
WRITEC6,2) NO IK=1000 
2 FORMAT (1HIs1OX,11HPROBLEM NQes212) RETURN 
Cc 2 NE1=2 
C READ IN THE SIZE GF THE MATRIX M NE24NL2 
READC5s3) N C UPDATE NEW BASIC COLUMN BY MULTIPLYING BY BASIS INVEKSEs 
3 FORMAT (12) DO 4 I=1sN 
C PROGRAM CALLING SEQUENCE T1=0.0 
CALL MATRIA (N) DG 3 J=i,N 
C PAKAMETER N INDICATES THE PRKOBLEM SIZE 3 Tl=TI-BCI> J) *®4M6 Us NEQ) 
CALL INITIA CN) ACI)=TI 
C SINCE FOR ANY PROBLEM TERMINATION CAN BCCURK IN INITIA, 4 CONTINUE 
CG NEWBAS @rR SORT SUBKOUTINEs THE VALUE OF IK IS MATCHED WITH SETURKN 
C t000 TO CHECK WHETHER T@ CONTINUE Ok GO TO NEAT PROBLEM. S NEL#1 
IF CIR-EQ-1000) GO TO 1 NE2aNL2 
4 CALL NEWBAS (N) DO 6 T=1.N 
IF CIK-E@-1000) GQ TO } ACI)=BCI,NE2) 
CALE SORT CN) 6 CONTINUE 
IF CIR-EQ-1000) GO TO 1 RETURN 
CALL PIVOT CN) END 
GG TO 4 SUBROUTINE SGRT CN) 
8S STOP C PURPOSE - TG FIND THE PIVOT KOW FORK WEXT ITERALTICN BY THE 
END Cc USE @F CSIMPLEX<TYPE) MINIMUM KATIO KULEs 
SUBROUTINE MATRIX CN) Cc 
C PURP@GSE - T@ INITIALIZE AND READ IN THE VAKIOUS INPUT DATA COMMON AMsOsL 1s Bs NLIsNL2e As NEL2NESs TKoMBASI Ss Wo Z 
Cc DIMENSION AMC(50250). &C50)s 3030750) ACSO) 
COMMON AMs QoL ls Bs NLisNlL2sAsNEls NEQs Inks MBASI Ss WsZ DIMENSION WOS5S0)s 2€90)5 MBASISC100) 
DIMENSION AMC 50.50)s €¢50)2 BC5S0.50)s ACSO) 131 
DIMENSION WC50)5 2050)» MBASISC100) 1 IF CACI)-GT-0-0) GO TO 2 
C KEAD THE ELEMENTS GF M MATRIX COLUMN BY COLUMN l=I +1 
DO t J2lsN IF CI1«GT«N) GO TO 6 
1 READC522) CAMCLs Je Tete ND GO TO 1 
2 FORMAT C7F10.5) 2 TrsQ¢cI)ZACI) 
C READ THE ELEMENTS OF 9 VECTOK Iks1 
KEADCS5s2) COCI>s T=baN) 3 Islet! 
C IN ITERATION Lt»BASIS INVERSE IS AN IDENTITY MATRIX. IF ¢CIeGTeN) GO TO 5 
D@ 5S JelseN IF CACTI) -GT-0-0) GO TO 4 
vO 4 [31tsN GO T® 3 
IF CIeEQ-J) GO TO 3 4 TezQ¢CI)sACI) 
BCLlseJ)20¢0 IF (€T2.GE-T1I) GO TE 3 
GO Te 4 Tk=1 
3 BCI» J)F160 Ti2T2 
4 CONTINUE GO TO 3 
5 CONTINUE 5S RETURN 
RETURN C FAILUKE GF THE KATI@G KULE INDICATES TERMINATION WITH 
END C NO COMPLEMENTARY SOLUTICN. 
SUBROUTINE INITIA ¢N) 6 WRITEC 6,7) 
C PURPOSE-TO FIND THE INITIAL ALMOST COMPLEMENTAKY SOLUTION 7 FOKMAT €5Xs37HPROBLEM HAS NO COMPLEMENTARY S@LUTIGN) 
c BY ADDING AN ARTIFICIAL VARIABLE ZO. WRITEC6s3) LI 
c 8 FORMAT CLOXsISHITERATION N@ee 14) 
COMMON AMsQeL1sBeNLiaNL2sAsNElsNE2s IKsMBASE SsWsZ IR*1000 
DIMENSION AMC 50550)» Q¢50)5 BC50550)s ACSO) RETURN 


DIMENSION wC50)s Z¢€50)s MBASIS(C1I00) END 
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SUBR@UTINE PIVOT C¢N) 
C PURPOSE - T®@ FPEKFORM THE PIVOGI] OPEKATION BY UPDATING THE 
Cc INVERSE @F THE BASIS AND Q VECTOR. 
c 
CEMMON AMs QoL ts Bs NL1 es NL2s As NEL» NE2s Ika MBASISs Wo Z 
DIMENSTON AMC50s50)2 UC50)> BC50259)s ACSO) 
DIMENSION WC50)» 2¢50)» MBASIS(C100) 
D@ t FStsn 
BCIR» TL) sBC IRs 1) /ACIRD 
QCTIRIZFQCIRDZACIRD 
D@ 3 I21sN 
IF C1-EQ-IR) GO T@ 3 
QOC1I=QC1T) “QC IK) ¥ACI) 
D@ 2 Jaton 
BOTs JBC Ie J “BC IR» JACI) 
2 CONTINUE 
3 CONTINUE 
C UPDATE THE INDICATOK VECTOR OF BASIC VARIABLES 
NLI=MBASISC IK) 
LAN¢+IR 
NL2=MBASISC(L) 
MBASISCIR)=NE1 
MBASIS(L)=NE2 
LisLiel 
RETURN 
END 
SUBROUTINE PPRINT ¢N) 
C PURP@SE - TO PRINT THE CURRENT SOLUTION TO COMPLEMENTARY 
PROBLEM AND THE ITERATION NUMBER. 


aa 


COMMON AMsQsL 1s Bs NL Io NL2s As NEL sNEQs IitsMBASISs Ws Z 
DIMENSION AMC50s50)s @¢50)% BC50250)s ASO) 
DIMENSION WC50). Z¢50)2 MBASIS(100) 
WRITEC6e1) LI 
FORMAT (C1LOXstESHITERATION N@es1 4) 
T=N#)1 
J=) 
2 Ki=MBASISCI) 
K2=MBASISCJ) 
IF €QCJ)-GE-0-0) GO T2 3 
Q¢J220-0 
3. IF (K2eEQ-1) GO TO S 
WRITEC624) K1sQCJ) 
4 FORMAT ©10X%s2HZ(51422H) 25 F155) 
G@ T@ 7 


- 


S WRITEC 6.6) K1»QCJ) 
6 FORMAT (10Xs2HWCs 1422H)=2F 1525) 
7 [set 

JeJel 

IF CUJ-LEeN) GEO TO 2 

RETURN 

END 


Editor’s note: Algorithm 432 described here is available on magnetic 
tape from the Department of Computer Science, University of 
Colorado, Boulder, CO 80302. The cost for the tape is $16.00 (U.S. 
and Canada) or $18.00 (elsewhere). If the user sends a small tape 
(wt. less than I 1b.) the algorithm will be copied on it and returned 
to him at a charge of $10.00 (U.S. only). All orders are to be prepaid 
with checks payable to ACM Algorithms. The algorithm is re corded 
as one file of BCD 80 character card images at 556 B.P.1-, even 
parity, on seven track tape. We will supply the algorithm at a 
density of 800 B.P.I. if requested. The cards for the algorithm are 
sequenced starting at 10 and incremented by 10. The sequence number 
is right justified in colums 80. Although we will make every attempt 
to insure that the algorithm conforms to the description printed here, 
we cannot guarantee it, nor can we guarantee that the algorithm is 
correct —L.D.F. 
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Remark on Algorithm 431 [H] 
A Computer Routine for Quadratic and Linear Pro- 


gramming Problems {H] [Arunachalam Ravindran, 
Comm. ACM 15 (Sept., 1972), 818] 


Arunachalam Ravindran [Recd. 12 Mar. 1973] 
School of Industrial Engineering, Purdue University, 
West Lafayette, IN 47907 


A small error has been brought to my notice in this algorithm. 
The error is in defining the matrix M. It should read as 


We Co, aa 
A 0 


Remark on Algorithm 431 [H] 

A Computer Routine for Quadratic and Linear Pro- 
gramming Problems [A. Ravindran, Comm. ACM 15 
(Sept. 1972), 818-820] 


L.G. Proll (Recd. 13 Aug. 1973) 
Centre for Computer Studies, University of Leeds, 
Leeds LS2 9JT, England 


Algorithm 431 is a Fortran implementation of Lemke’s comple- 
mentary pivot algorithm [1]. This algorithm has recently received a 
considerable amount of attention in the literature; in particular, 
there is some evidence that the algorithm is an attractive means of 
solving linear programs [2, 3] and can readily be modified to find 
stationary points of nonconvex quadratic programs [4]. 

Eaves [5] has shown that, in principle, degeneracy causes no 
problems in Lemke’s algorithm and that it will always be possible 
to pivot the artificial variable out of the basis. In the presence of 
rounding error, however, this may no longer be true, and further 
pivoting may not be possible despite the presence of the artificial 
variable with a value close to zero. In such a case Algorithm 431 
may incorrectly arrive at the conclusion that the problem has no 
complementary solution because it only recognizes a complementary 
solution when the artificial variable leaves the basis. 

The difficulty can be avoided by: (a) testing whether the value 
assumed by the artificial variable is acceptably ‘‘small” if no further 
pivoting is possible; and (b) not pivoting on ‘‘small’’ elements. The 
problem of deciding what is meant by “small” in this context is one 
for which there is no adequate theory. Clasen [6] has, however, pro- 
posed some empirical rules for dealing with similar problems in the 
revised simplex algorithm, and an adaptation of these has proved 
satisfactory. The modifications of Algorithm 431 given below incor- 
porate Clasen’s pivot tolerance to deal with point (b) above and 
also use this value as the upper limit on the acceptable value of the 
artificial variable. 


(i) In the subroutine JNITIA, add IZR to the end of the COMMON 
list and insert after the statement labeled 4, the statement 


IZR = IR 
(ti) In the subroutine SORT, add IZR to the end of the COMMON 
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list and 
(a) after the second DIMENSION statement, insert 


‘-AMAX=ABS(A(I)) 
DO 101=2,N 
IF (AMAX.GE.ABS(A(I))) GOTO 10 
AMAX =ABS(A(f)) 
10 CONTINUE 
TOL = AMAX*2.0** (—NB) 
IN :-ANY ACTUAL IMPLEMENTATION NB SHOULD BE RE- 
PLACED BY B—I!1 WHERE B IS THE NUMBER OF BITS IN 
THE FLOATING POINT MANTISSA AS CLASEN SUGGESTS 


aang 


(b) Replace 0.0 by TOL in the statement labeled 1 and in the 
statement two lines before that labeled 4. 

(c) Replace the label 6, occurring two lines before the statement 
labeled 2, by 9. 

(d) Immediately after RETURN, insert the statements, 


9 IF(Q(IZR).GT.TOL) GOTO 6 
WRITE(6,11) 
11 FORMAT(5X,22HCOMPLEMENTARY SOLUTION) 
CALL PPRINT(N) 
IR = 1000 
RETURN 
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Algorithm 432 


Solution of the Matrix 
Equation AX + XB= C [F4] 


R.H. Bartels and G.W. Stewart [Recd. 21 Oct. 1970 
and 7 March 1971] | 
Center for Numerical Analysis, The University of Texas 
at Austin, Austin, TX 78712 


Key Words and Phrases: linear algebra, matrices, linear equa- 
tions 

CR Categories: 5.14 

Language: Fortran 


Description 
The following programs are a collection of Fortran IV sub- 
routines to solve the matrix equation 


AX + XB =C (1) 


where A, B, and C are real matrices of dimensions m X m,n X n, 
and m X a, respectively. Additional subroutines permit the efficient 
solution of the equation 


ATX + XA = C, (2) 


where C is symmetric. Equation (1) has applications to the direct 
solution of discrete Poisson equations [2]. 

It is well known that (1) has a unique solution if and only if the 
eigenvalues a, , a2 ,...,@m Of A and ~; , Be ,..., Bn of B satisfy 
a; + B; #0 
One proof of the result amounts to constructing the solution from 
complete systems of eigenvalues and eigenvectors of 4 and B, when 
they exist. This technique has been proposed as a computational 
method (e.g. see [1]); however, it is unstable when the eigensystem 
is ill conditioned. The method proposed here is based on the Schur 
reduction to triangular form by orthogonal similarity transforma- 
tions. 

Equation (1) is solved as follows. The matrix A is reduced to 
lower real Schur form A’ by an orthogonal similarity transforma- 
tion U; that is A is reduced to the real, block lower triangular form. 

An 


aie ae O 


(i= 1,2,...,m;j = 1,2, ..,2). 


A pi A pp a App 
where each matrix A;; is of order at most two. Similarly B is reduced 


This research was supported in part by Grant DA-ARO(D)- 
31-124-G721, Army Research Office, Durham, and by National 
Science Foundation Grant GP-5253 awarded to The University 
of Texas at Austin. 
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to upper real Schur form by the orthogonal matrix V: 
Bu Be ++: Bry 
Box ect & Be 


B' = V’BV = 


O 


’ 
Bag 


where again each B;,; is of order at most two. If 
en a a Ci 
Cc’ = UTCV = 
Cx gies 
and 
Xn + +) Xiq 
X' = UTXV = : 
Xo at ae, Xv 


then eq. (1) is equivalent to 
A'X' + XB’ = C’. 


If the partitions of 4’, B’, C’, and X’ are conformal, then 


bei oy 
AinXir + XnBu = Cu — > AuXji — YS XB 


j=l i=l 
(k = 152, vty Dy [= beds es 9). 


These equations may be solved successively for Mis Nei re, Cre 
Xi2, X22 ,... The solution of (1) is then given by ¥ = UX’VT. 

The reduction of A and B to real Schur form is accomplished by 
standard techniques. The matrix B is reduced to upper Hessenberg 
form by Householder’s method [4, p. 34], and the upper Hessenberg 
matrix is in turn reduced to real Schur form by the QR algorithm 
[3]. The product of the transformations used in the reductions is 
accumulated to form the matrix V. The reduction of A to lower real 
Schur form is accomplished by reducing the transpose of A to upper 
real Schur form and transposing back. 

Since the QR algorithm is an iterative method that, as used here, 
reduces the subdiagonal elements of an upper Hessenberg matrix to 
zero, some criterion must be adopted for determining when an ele- 
ment is negligible. In these programs an element of H is considered 
negligible if it is less than or equal to ey || H ||. where ey is a con- 
stant supplied by the user. This criterion is appropriate if the ele- 


(3) 


- ments of H are all of roughly the same size. A different criterion may 


be required if the elements vary widely and the small elements are 
significant, as when the elements decrease greatly in size as one 
passes from the upper left to the lower right corners of A (see, for 
example, the criterion in [3]). 

The solution for X%; in (3) still requires the solution of a matrix 
equation of the form (1). However, in thiscase the matrices 4;,4.and 
By are of order at most two; hence the solution of (3) can be ob- 
tained by solving a linear system of order at most four. For example, 
if Aj, and Br are both of order two, then 


é e , , ‘ 
au + bn / an boy 0 Xu di 
t a ‘, ’ 
a2 a2 + an 0 bar X21 du |, 
/ = 
bis 0 


‘ ’ ’ ’ 

au + bee aie X12 dy 
, a ’ ’ é 

0 bie Q21 a2. + bre X22 dr 
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where aj; , b;;, and x;; denote the elements of Au, Bi, and Xx 
and d;; denotes the elements of the right-hand side of (3). The sys- 
tems arising from (3) are solved using the Crout reduction. Once 
calculated, the soluticn Xi may be stored in the locations occupied 
by Cy: , which is no longer needed. 

The programs contain provisions for skipping the reduction of 
A to real Schur form, so that once A’ and U have been calculated 
they may be used to solve new systems with different matrices B and 
C. Likewise, the reduction of B may be skipped. These provisions 
may be used to advantage in the iterative refinement of the com- 
puted solution X, of (1). Namely, let the residual matrix Ri = C — 
AX, — X:B be computed in double precision and rounded to single 
precision (On many computers this may be done with single-pre- 
cision multiplications and double-precision additions). Use the pro- 
grams to solve the system AY, + YiB = R,. Then X, = %1+ % 
will in general be a more accurate approximate solution. This 
process may be iterated, no step after the computation of X, re- 
quiring reductions of A and B. This iteration is perfectly analogous 
to the iterative refinement of approximate solutions of linear sys- 
tems described by Wilkinson [4, p. 255]. 

The following trick enables one to use an upper rather than a 
lower real Schur form of A in the solution of (1). Let D be the ma- 
trix with ones on the secondary diagonal and zeros elsewhere. Then 


(DAD)DX + DXB = DC. (4) 


Moreover, if 4’ = UTAU is an upper real Schur form for A, then 
DA'D = (DUD)?T(DAD)(DUD) is a lower real Schur form for 
DAD. Hence to calculate DX, which is X with its rows written in re- 
verse order, one may use the above algorithm with DA’D and DUD 
to solve (4). A similar trick enables one to use a lower real Schur 
form for B. 

In principle, the algorithm described abcve can be used to solve 
the symmetric problem (2). However, it is pcssible to take advantage 
of the symmetry. Let U be orthogonal and 4’ = UTAU be in upper 
real Schur form. Partition 4’, C’ = UTCU, and X’ = UTXU in the 


form 
An ae) 
A : ? 
1s ie ae 
' T 
; ae Xi Xo1 | : 
Xa Xo 
C= Cu a ’ 
Cr Coe 


where Ay; , X11 , and Ci: are at most of order 2. Then from the equa- 
tion A’TX’ + X'A’ = C’, it follows that 

Aya Xn + XnAve = Cx — Xndv — Aa Xn. 

Hence, once X;, and Xo: have been calculated, the size of the prob- 
lem can be reduced. 


The matrix Xo: is computed as described above for the general 
case. The matrix Xi: satisfies the symmetric equation 


An Xu + XudAn = Cu : (5) 


whose solution is trivial when Aj, is of order unity. When An is of 
order two, equation (5) gives a new linear system of order three for 
the three distinct elements of Xi: . 

A mild saving in operations may be realized in the computation 
of C’ = UTCU and X = UX'UT. Let C = T + TT, where T is 
upper triangular. Then 
Cc! = UTCU = UTTU + (UTTU)’. 

Thus one need calculate only UTTU, and, since T is upper tri- 
angular, the product TU can be computed with about half the 
operations required for the computation of CU. 

The number of multiplications required for the solution of (1) 
is probably overestimated by 


5 
(2 + 40)(m3 + n°) + , (mn? + nm?) 


432-P 2—- 0 


where a is the average number of QR steps required to make a sub- 
diagonal element negligible. The first term is due to the reduction of 
A and B to real Schur form. A like estimate for the solution of (2) is 
given by 


(2 + 40)n? + . ws 


the first term is again due to the reduction of A to real Schur form. 

To solve the nonsymmetric problem, the user must furnish 
2m? + 2n® + mn storage locations to hold the matrices A, U, B, V, 
and C. If A, B, and C are required for later use, they must be stored 
elsewhere, since the programs overwrite A and 8B with their real 
Schur forms and C with the solution. The symmetric problem re- 
quires 377? locations to hold A, U, and C. 

In assessing the effects of rounding error on the algorithm, we 
should consider the algorithm stable if the computed solution were 
near a matrix X that satisfied 


(A+E)X + X(B+F) =C+G 


for some small E, F, and G. We are unable to establish such a result. 
However, an-elementary rounding error analysis, combined with 
the known properties of the other algorithms used in the method, 
shows that the residual matrix is small compared with the larger of 
| A | | Xl and || Bll X |. 

Here follows a brief description of the programs listed below. 
Detailed information on their use will be found in the program list- 
ings themselves. The casual user need only familiarize himself with 
the programs AXPXB and ATXPXA, which coordinate the other 
programs for the solutions of (1) and (2), respectively. 

AX PXB. The coordinating prograrn for the solution of (1). 
Given A, B and C the program overwrites C with the solution X. 
The lower real Schur form of A overwrites A, and the upper real 
Schur form of B overwrites B. The user may furnish the real Schur 
forms and skip the reductions. The subroutine requires the subrou- 
tines HSHLDR, BCKMLT, SCHUR, SHRSLY, and SYSSLV. 

ATXPXA. The coordinating program for the solution of (2). 
Given A and C the program overwrites C with the solution XY. The 
upper real Schur form of A overwrites A. The user may furnish the 
real Schur form and skip the reduction. The subroutine requires 
the subroutines HSHLDR, BCKMLT, SCHUR, SYMSLY, and 
SYSSLV. 

HSHLDR. Reduces a matrix A to upper Hessenberg form. The 
upper Hessenberg form and a history of the transformations over- 
write A. 

BCKMLT. Takes the output A of HSHLDR and computes 
the orthogonal matrix U that reduces the original matrix A to 
upper Hessenberg form. At the user’s option the elements of U 
can overwrite A. 

SCHUR. Computes an upper real Schur form of an upper 
Hessenberg matrix A. SCHUR is an adaptation of thel Agol pro- 
cedure Agr by Martin, Peters, and Wilkinson [1]. The product of 
the transformations used in the reduction is accumulated. SCHUR 
leaves undisturbed the elements below the third subdiagonal of the 
array containing A. (N.b. The modifications made in Agr to find a 
real Schur form make SCHUR an ineflicient program for calculat- 
ing the eigenvalues of an upper Hessenberg matrix.) 

SHRSLV. Solves an equation of the form (1), where A is in 
lower real Schur form and Bis in upper real Schur form. 

SYMSLY. Solves an equation of the form (2), where A is in 
upper real Schur form. 

SYSSLV. Solves a system of linear equations. 

When m > n, AXPXB can be modified so that the real Schur 
forms of A and B share the storage originally allocated to A and the 
matrix V occupies the locations occupied by B. The modifications 
are as follows. Replace the section labeled “JF REQUIRED, 
REDUCE B TO UPPER REAL SCHUR FORM” with 


35 IF(EPSB .LT. 0.) GO TO 45 


CALL HSHLDR (B, N, NB) 
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C CONTAIN THE SCHUR F@RM AND ACCOMPANYING ORTHOGONAL MATRIX. 
C THIS PERMITS THE EFFICIENT S@LUTI@N GF SEVERAL EQUATIONS 


IF UI .NE. 1) A(l, [+4) = BUI—1, N}) C OF THE FORM AX + BX = C WHEN A D@ES NOT CHANGE. LIKEWISE, 
ae C IF EPSB IS NEGATIVE. THE REDUCTION @F 8 TO REAL SCHUR FORM 


DO4J=1,N C IS SKIPPED. 
A(L, J+5) = BU, J) ¢ 


DO 401=1,N 


40 CONTINUE Ve ds i isu RULSA UB ewaR ASEAN Al ener Ion ReaaSw eaaaes 
CALL BCKMLT(B,B,N,NB,NB) ia, NAc ABH NUSNES PALL M MoMA sale MTS 19 
CALL SCHURA(1,6),B,N,NA,NB,EPSB,FAIL) Baie ce 
FAIL = —FAIL Nl = Nel 
IF(FAIL .NE.0) RETURN ‘ ee 


In the sections labeled “TRANSFORM C” and “TRANSFORM 
C BACK TO THE SOLUTION” replace all occurrences of the 
variable V with B and all references to A(J,M1) with A(M1,J). 

Change the call to SHRSLV to ars OG 


CALL SHRSLV(A,A(1,6),C,M,N,NA,NA,NC). . so cout? = TEMP 


CALL HSHLDR(AsM,NA) 
CALL BCKMLTCAs UsMsNAs NU) 
IFCMM1 «EQ. 0) GO TO 25 
D@ 20 I=1sMM1 
ACI+1s1) = ACIsM1)? 
20 CONTINUE 


C IF REQUIREDs REDUCE A T@ UPPER REAL SCHUR FORM. 
Cc 


IFCEPSA -LTe O-) GO TO 35 
D@ 10 I=15™ 
DO 10 JFIem 


Note that in this modification the reduction of B to real Schur 
form cannot be skipped without also skipping the reduction of A. 
When m < va similar modification can be made to store the Schur 


form of A and B together in B. 


CALL SCHURCAs Us Ms NA» NUs EPSA, FAIL) 
IFCFAIL «NE« 0) RETURN 
25 D@ 30 I=1toM 
D@ 30 J=I5M 
TEMP = ACIsJ) 


References “ACIe J) = ACdsTD 
: ; : ACJoI> = TEMP 
1. Bickley, W.G. and McNamee, J. Matrix and other direct 30 CONTINUE 


2 : ie : c 
methods for the solution of systems of linear difference equations. C IF REQUIRED» KEDUCE B TO UPPER REAL SCHUR FORM. 
; 5 Cc 
Philos. Trans. Roy. Soc. (London) Ser. A, 252 (1960), 69- 131. 35 IFCEPSB «LT. 0.3 60 Ta 49 
2. Dorr, Fred W. The direct solution of the discrete Poisson CALL HSHLOR(BsN»NBD 
: CALL BCKMLT(BsV,N,NB» NV) 
equation on a rectangle. SIAM Rev. 12 (1970), 248-263. IF(NMI »EQ. 0) 60 TO 45 
3. Martin, R.S., Peters, G., and Wilkinson, J.H. The QR A ahs 
algorithm for real Hessenberg matrices. (Handbook series 40 CONTINUE 
linear algebra.) Numer. Math. 14 (1970), 219-231. poric Se oe ete 
4. Wilkinson, J.H. The Algebraic Eigenvalue Problem, Clarendon, é IFCFAIL «NE+ 0) RETURN 
Oxford, 1965. C TRANSFORM C. 
c 
45 DO 60 J=1sN 
DO SO I=1.M 
ACIsM1) = 0. 
: DQ SO K=1.M 
Algorithm ACIOM1) = ACIOM1) + UCKs1)*CCKs J) 
50 CONTINUE 
SUBROUTINE AXPXBCAs UsMs NAsNUs Ba Ve Ns NBo NVe Co NCo EPSAs D@ 60 I=1s™ 
1EPSBs FAIL) CcIeJ5)>) = ACISMID 
c 60 CONTINUE 
C AXPXB IS A FORTRAN IV SUBROUTINE T@ SOLVE THE REAL MATRIX D@ 80 I=1sM 
C EQUATION AX + XB - Ce THE MATRICES A AND B ARE TRANS- D@ 70 J=isN 
C FORMED INT@ REAL SCHUK FORMs AND THE TRANSFORMED SYSTEM IS BCN1sJ) = O- 
C S@LVEO BY BACK SUBSTITUTION. THE PROGRAM REQUIRES THE D@ 70 K=14N 
C AUXILIARY SUBR@UTINES HSHLDks BCKMLTs SCHURs AND SHRSLV. BONI»J) = BCNIsJ) + CCIsK*VCKs J) 
C THE PARAMETERS IN THE ARGUMENT LIST ARE 70 CONTINUE 
C D@ 80 J=IsN 
c A A DGUBLY SUBSCRIPTED AKRAY CONTAINING THE CCleJ) = BCNEsJ> 
c MATRIX Ae ON RETURN» THE LOWER TRIANGLE 80 C@NTINUE 
c AND SUPERDIAG@NAL @F THE ARRAY A C@NTAIN c 7 
Cc A L@WER REAL SCHUK FORM OF As THE ARRAY C SOLVE THE TRANSFORMED SYSTEM. 
Cc A MUST BE DIMENSI@NED AT LEAST M+1 BY c 
Cc M+ile CALL SHRSLVCAsBsCoMeNeNAsNBeNC) 
Cc U A DOUBLY SUBSCRIPTED ARRAY THAT» ON Cc 
c RETURNs CONTAINS THE ORTHOGONAL MATRIX C TRANSFORM C BACK T@ THE SOLUTION. 
c THAT REDUCES A T®@ REAL SCHUR FORM. c 
Cc M THE QRDER OF THE MATKIA As D@ 100 J=IsN 
Cc NA THE FIRST DIMENSION @F THE ARRAY A. DO 90 I=1.M 
Cc NU THE FIRST DIMENSION @F THE ARRAY U- ACIOM1) = O« 
Cc B A DOUBLY SUBSCRIPTED ARKAY CONTAINING THE D@ 90 K=1.M 
c MATRIX Be ON RETUXNs THE UPPER TRIANGLE ACIsM1) = ACISME) + UCTs KI #CCKs JD 
c AND SUBDIAGONAL OF THE ARRAY B CONTAIN AN 90 CONTINUE 
c UPPER REAL SCHUR FORM OF Be. THE ARRAY B D@ 100 I=15™ 
Cc MUST BE DIMENSIGNED AT LEAST M+1 BY “+1. CCIsJ) = ACISMID 
Cc v A D@UBLY SUBSCRIPTED ARRAY THATs @N 100 CONTINUE 
Cc RETURNs CONTAINS THE @RTHOGONAL MATRIX DQ 120 I=lom 
c THAT REDUCES B T@ REAL SCHUR FORM. D@ 110 J#leNn 
Cc N THE ORDER OF THE MATRIX Be BCNIsJ) = O- 
c NB THE FIRST DIMENSION GF THE ARRAY B. D@ 110 K=1.N 
c NV THE FIRST DIMENSION @F THE AKRAY V. BCN12J) = BONIs J) + CCIsKI#VCJoK) 
c Cc A D@UBLY SUBSCKIPTED ARRAY CONTAINING THE 110 CONTINUE 
Cc MATRIX Ceo @N RETURN» C CONTAINS THE D@ 120 Jalen 
c SOLUTI@N MATRIX Xe CClsJ) = BCNLsJ) 
Cc NC THE FIRST DIMENSION OF THE ARRAY Ce 120 CONTINUE 
c EPSA A CONVERGENCE CRITERION FOR THE REDUCTION RETURN 
Cc OF A T@ SCHUK FORM. EPSA SH@ULD BE SET END 
c SLIGHTLY SMALLEK THAN 10.%*(-N)» WHEKE N SUBROUTINE SHRSLVCAsBs Co Ms No NA» NB NC) 
Cc 1S THE NUMBER @F SIGNIFICANT DIGITS IN C SHRSLV IS A FORTRAN IV SUBROUTINE TO SOLVE THE REAL MATRIX 
Cc THE ELEMENTS @F THE MATRIX A. C EQATIGN AX + XB = Cy WHERE A IS IN LOWEK REAL SCHUR F@RM 
Cc EPSB A C@NVERGENCE CRITERION FOK THE REDUCTION C AND B IS IN UPPER REAL SCHUR FORM. SHRSLV USES THE AUX- 
Cc OF B T@ REAL SCHUR FORM. C ILIARY SUBROUTINE SYSSLVs WHICH IT COMMUNICATES WITH 
Cc FAIL AN INTEGER VARIABLE THATs ON RETUKN, C THR@UGH THE C@MMON BLOCK SLVBLK. THE PARAMETERS IN THE 
Cc CONTAINS AN ERROR SIGNAL. IF FAIL IS C ARGUMENT LIST ARE 
c POSITIVE (NEGATIVE) THEN THE PROGRAM WAS Cc r A D@UBLY SUBSCRIPTED ARRAY CONTAINING THE 
Cc UNABLE T@ REDUCE A (8) T@ REAL SCHURK c MATRIX A IN LOWER REAL SCHUR FORM. 
Cc FORM. IF FAIL IS ZER@s» THE REDUCTIONS c 8 A DOUBLY SUBSCRIPTED ARRAY C@NTAINING THE 
Cc PROCEEDED WITHOUT MISHAP. c _ MATRIX B IN UPPER REAL SCHUR FORM. 
c c c A D@UBLY SUBSCRIPTED ARRAY CONTAINING THE 
C WHEN EPSA IS NEGATIVE THE REDUCTION OF A T@ REAL SCHUK c MATRIX Ce 
C F@RM IS SKIPPED AND THE ARRAYS A AND U ARE ASSUMED TO c M THE @RDER OF THE MATRIX Ae 


0 
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c N THE @RDER OF THE MATRIX Bo c THAT REDUCES A T@ UPPER REAL SCHUR FORM. 
c NA THE FIRST DIMENSION OF THE ARKAY A- c c A DQUBLY SUBSCRIPTED ARRAY CGNTALNING THE 
c NB THE FIRST DIMENSION OF THE ARRAY B- c MATRIX Ce ON RETURN» C CONTAINS THE 
c NC THE FIRST DIMENSIG@N @F THE ARRAY C. Cc S@LUTION MATRIX X- 
c c N THE ORDER OF THE MATRIX A. 
REAL c NA THE FIRST DIMENSION OF THE ARRAY A« 
TACNAs 1) sBONBs 1) eCONCo 1) 2 TOP c NU THE FIRST DIMENSION @F THE ARKAY Us 
INTEGER c NC THE FIRST DIMENSI@N @* THE ARRAY Ce 
IMs Ns NA» NBs NCo Ks KM s DKe KKe lo LM 1s OL ebb» 1, 1Bs Js JAsNSYS c EPS A CONVERGENCE CRITERION FOR THE REDUCTION 
COMM@N/SLVBLK/TC 5s 5)9PC5),NSYS c OF A T@ REAL SCHUR FORM. EPS SHOULD BE 
L=4 c SET SLIGHTLY SMALLER THAN 10¢*«(-N)> 
10. LMI = Let c WHERE N IS THE NUMBER @F SIGNIFICANT 
DL = 1 c DIGITS IN THE ELEMENTS @F THE MATRIX A- 
IFCL «EQ. N) GO TO 15 c FAIL AN INTEGER VARIABLE THATs @N RETUKNS 
IFCB(L+1sL) «NE- 0+) DL = 2 c CONTAINS AN ERROR SIGNAL. IF FAIL IS 
15 LL = L+DL-1 c NONZER@» THEN THE PROGRAM WAS UNABLE TO 
IFCL «EQ. 1) GO 18 30 c REDUCE A T®@ REAL SCHUR FORM. IF FAIL IS 
D@ 20 Jabs LL Cc ZERO» THE REDUCTION PROCEEDED WITHOUT 
DB 20 I=1sM c MISHAP.» 
DO 20 IB=1sLM1 c 
CClsJ) = CCIsJ) = COIs IB) *BCIBs J) C WHEN EPS IS NEGATIVEs THE KEDUCTION OF A T@ REAL SCHUR 
20 CONTINUE C FORM IS SKIPPED AND THE ARRAYS A AND U ARE ASSUMED TO 
30 Kd C CONTAIN THE SCHUR FORM AND ACCOMPANYING ORTHOGONAL MATRIX. 
40 KML = K-1 C THIS PERMITS THE EFFICIENT SOLUTION OF SEVERAL EQUATIONS 
DK = 1 C WITH DIFFERENT RIGHT HAND SIDES.« 
LFCK «EQ. M) GO TO 45 c 
IFCACKsK+1) *NEe O-) DK = 2 REAL 
45 KK = K+DK=-1 LACNAs 1) eUCNUs E>» CONC? 1) 2EPS 
IFCK »EQ@e 1) GO T@ 60 INTEGER 
DO 50 I=KsKK INsNAsNUs NC» FAIL es NI sNM1s 1s Je K 
D@ 50 JsLoLh Nl = Nel 
D@ 50 JA=1sKMI NM1 = Ne} 
CCls I) = CCled) = ACTS JAIRO CAs J) c ; 
50 CONTINUE C IF REQUIREDs REDUCE A T®@ LOWER REAL SCHUR FORM. 
60 IFCDL «EQ. 2) GO T@ 80 c 
IFCDK »EQ@.e 2) GO TY 70 IFCEPS «LTe 04.) GO TO 15 
TO1s1) = ACKsK) + BCLsL) CALL HSHLDRCA, Ns NAD 
IFCTC1s1) «EQ@e OO) STOP CALL. BCKMLT(As Us N» NAs NU) 
C(KsL) = CCKsLO/TCts LD DG 10 I=LsNMI 
G@ Te 100 ACI+1,1) = ACIZNI)> 
70 TCls1) = ACKsK) + BCLoL) 10 CONTINUE 
TC1s2) = ACKsKK) CALL SCHURCAs Us Ns NAs NU, EPS» FAIL) 
TC2s1) = ACKKeK) IFCFAIL «NE+ 0) RETURN 
T(2,2) = ACKKsKK) + BCLoL) Cc 
POl) = CCKsL) C TRANSFORM C. 
PC2) = CCKKsL)D c : 
NSYS = 2 15 D@ 20 I=1t.N 
CALL SYSSLV CCls I) = COIs I 726 
C(KeL) = POL) 20 CONTINUE 
CCKK,L) = PC2) D@ 40 I=1.N 
Go Te 100 D@ 30 J=1sN 
80 IFCDK «EQ. 2) GO Te 90 BEN add! 20's 
TClel) = ACKsK) + BCLsLd D@ 30 K=I>N 
TC1e2) = BCLLsL) ACN12J) = ACNIsJ) + CCTs K*UCKs J) 
TC2s1) = BCLsLL) 30 CONTINUE 
T(202) = ACKsK) + BCLLsLL) DQ 40 J=1sN 
POL) = C€KsL) CCIl,J)> = ACNIs J) 
PC2) = CCKSLL) 40 CONTINUE 
NSYS = 2 DO 60 J=lsN 
CALL SYSSLV D@ SO I=1sN 
CCKsL) = PCED ACI,NIL) = O« 
C(KsLL) = PC2> D@ SO K=15N 
Go To 100 ACIsNID = ACISNED * UCKs TL) ¥CCKs JD 
90 TCls1) = ACKsK) + BCLoLD 50 CONTINUE 
TC1,2) = ACKsKK) D@ 60 I=15N 
TO123) = BCLLs«LD CCIsJ)> = ACISNI)D 
TC1,4) = O« 60 CONTINUE 
TC2s1) = ACKKsK) D6 70 I=tsN 
T( 222) = ACKKsKK) +* BCLoL) D@ 70 J=lsN 
T(253) = QO. Ceol,J) = CCIsJ)D + ClCIeld 
T(2,4) = TC1s3) CCJst) = CCIseJ) 
TC3.1) = BCL»LL) 70 CONTINUE 
T(322) = 0. c 
T(03,3) = ACKsK) + BCLL»LL) C SOLVE THE TRANSFORMED SYSTEM. 
T0354) = TO1s2) c 
TC 401) = O. CALL SYMSLVCAs Cs Ns NA» NC) 
T0452) = TOC3s1) c . 
T0423) = TC2s1) C TRANSF@RM C BACK T@ THE SGLUTION. 
C4 = | Ce 
aan) EAeenIO + BCLL>LL) D@ 80 I=1.N 
& CCIsI) = CCL»II/2¢ 
P(2) = CCKKsL) 
. 80 CONTINUE 
PC3) = CCKsLLD 
a D@ 100 I=1sN 
P€4) = CCKKsLL) 
NSYS = 4 D@ 90 J=1sN 
CALL SYSSLV ACNIsJ) = Oo 
C(KsL) = Pod) D@ 90 K=IsN 
CCKKsL) = PC2) ACNIsd) 2 ACNIs J) * COLsK) UC Je KD 
C{KsLL) = PC) 90 CONTINUE 
= DO 100 J=15N 
CCKKsLL) = PC4) 
pd COI, J) = ACNIsJ) 
100 K = K + DK ros ceaNdiNO 
IFCK «LE* M) G@ TO 40 OBEN TA WE 
D@ 120 J=lsN 
Be eae oe D@ 110 I=1.N 
IF(L -LE. N) GO To 10 ee 
RETURN ACIZN1) = O- 
END D@ 110 K=15N 
F ACIsNL) = ACIONE) + UCT sKI*CCKs J) 
SUBROUTINE ATXPXACAs Us CoN» NAsNUs NCo EPS» FAIL) 110 CONTINUE © 
c D®@ 120 I121,N 
C ATXPXA IS A FORTKAN IV SUBROUTINE T@ SOLVE THE REAL MATRIX CCIe J) = ACIZNID 
C EQUATION TRANSCA)*X + X*A * Ceo WHERE C IS SYMMETRIC AND 120 CONTINUE 
C TRANSCA). DEN@TES THE TRANSP@SE GF As .THE EQUATION IS D@ 130 I=1,N 
C TRANSFORMED S@ THAT A IS IN UPPER REAL SCHUR FOKMs AND THE D@ 130 JaI.N 
C TRANSFORMED EQUATICN IS SOLVED BY A RECURSIVE PROCEDURE. CCls,u) = COLLIS) + COJeID 
C THE PK@GRAM REQUIRES THE AUXILLARY SUBR@UTINES HSHLDR,» COJol) = CClsu 
C BCKMLTs SCHUR, AND SYMSLV.- THE PARAMETERS IN THE ARGUMENT 130 CONTINUE 
C LIST ARE RETURN 
c A A D@UBLY SUSSCKIPTED ARRAY CONTAINING THE END 
Cc MATRIX Ae ON RETURN» THE UPPEK TRIANGLE SUBR@UTINE SYMSLVCAsCsNsNAs NC? 
Cc AND THE FIRST SUBDIAG@NAL @F THE AKRAY A c 
c CGNTAIN AN UPPER REAL SCHUR FORM OF As C SYMSLV 1S A FORTRAN IV SUBROUTINE TO SOLVE THE REAL MATRIX 
Cc THE ARRAY A MUST BE DIMENSIGNED AT LEAST C EQUATION TRANSCA)*X + X*A = C» WHERE C IS SYMMETRIC, A 1S 
c N+] BY Ntle C IN UPPER REAL SCHUR FORM» AND TRANSCA) DEN@TES THE TRANS- 
Cc U A DOUBLY SUBSCRIPTED ARRAY THATs ON C P@SE OF Ae SYMSLV USES THE AUXILIARY SUBROUTINE SYSSLV> 
c RETURNs CONTAINS THE ORTHOGONAL MATRIX C WHICH IT COMMUNICATES WITH THROUGH THE COMM@N BLOCK 


COLLECTED ALGORITHMS (cont.) 


C SLVBLK. THE PARAMETERS IN THE ARGUMENT LIST ARE 
Cc A A DOUBLY SUBSCRIPTED ARRAY CONTAINING THE 
Cc MATRIX A IN UPPER REAL SCHUR FORM. 
Cc Cc A DOUBLY SUBSCRIPTED ARRAY CONTAINING THE 
Cc MATRIX Ce 
Cc N THE ORDER OF THE MATRIX A-w 
Cc NA THE FIRST DIMENSION OF THE AKRAY Ae 
Cc NC THE FIRST DIMENSION OF THE AKKAY Ce 
Cc 

REAL 

TACNAs 1>9CONC21)9ToP 

INTEGER 


INs NAsNCoKs KKs DKs KMD Le Lb» DLs LDL» Is LAs Js NSYS 
COMMON/SLVBLK/T¢ 5s 5)2PC5)eNSYS 
tL = 1 
10 DL = 1 
IF(L «EQe N) GO TO 20 
IFCACL+#ieolL) sNE~« Oo) DL = 2 
20 LL = L+DbL-1 
KeL 
30 KM1 = K-11 
DK = 1 
IFCK .«EQ- N) G@ TO 35 
IFCACK+15K) «NE. O-)} DK = 2 
35 KK = K+DK-1 
IFCK «EQ. L) GQ TO 45 
D@ 40 I=Ks KK 
D@ 40 JeSLeLL 
D@ 40 IA=Ly»KMI1 
CCIoJ> = CCIsd) = ACIAs I *#CCIAs JD 
40 CONTINUE 
45 IFCDL -&Q- 2) GO TO 60 
IFCDK -EQ.s 2 ) GB TB 50 
TCho 1) = ACKsK) + ACLoL) 
IFCTCtso tl) «EQ- Of) STOP 
CCKsbL> = CCKsLd/TC1 st) 


G@ Te 90 
50 TCs dd = ACKsK) + ACLoL)d 
TC1s2) ACKKsK) 


TC2shd = ACKs KK) 

T(2,2) = ACKKsKK) + ACLsL) 
PC1) = CCKsL) 

PC2) = CCKKsL) 

NSYS = 2 

CALL SYSSLV 

CCKeL) = PCI) 

CCKK»L) = PC2) 


GQ TO 90 

60 IFCDK «EQ. 2) GO TO 70 
TChs1)> = ACKsK) + ACLsL) 
TC1e2) = ACLLsLD 
TC2e1) = ACLsLL) 
TC2,2) = ACKsK) + ACLLsLL? 
PCL) = CCKsL) 
PC2) = CCKsLL> 
NSYS = 2 


CALL SYSSLV 
CCKslL) = PCI) 
C(KsLL) = PC2) 
GO Te 90 

70 IFCK .NE- L) GO TO 80 
TCts1) AtLseL) 
TO122) 


ACLsLL)> 


T¢els2) 


=~ 
na 
Le 
. 
- 
wa 
tuu dota 


TC33) 2 ACLLsLL) 
C(LoL 726 
CCLLsbL) 
CC(LL»LL) 726 
3 

CALL SYSSLV 


al 
na 
nN 
“ 

uuna 


CtLsLL> = PC(2) 
C(LLsLL) = PC3) 


GQ Te 90 
80 TChs 1) = ACKsK) + ACLeL)D 
TC1,2) = ACKKsK) 
T6123) = ACLLsL? 
TOs 4) = Oe 
TC2,1) = ACKs KK) 
T(2s2) = ACKKsKK) + ACLoL) 
TC2s3) = Ov 
T(2s4) = TIS 3) 
TC3,1) = ACLsLL)D 
TC3,2) = O- 
TO353) = ACKsK) + ACLLsLL? 
T6354) = TC192) 
TC 4s1) 2 Oo 
TC 422) = TCO301) 
T6493) = TC2,1) 
TC4s4) = ACKKs KK) + ACLLsLL) 
PC) = CCKsL>d : 
P(2) = CCKKsL) 
PC3) = CCKseLL) 
PC4) = CCKKsLL) 
NSYS = 4 


CALL SYSSLV 
C(KselL) = PC1> 
CCKKseL) = PC2) 
CCKsLL) = PC3) 
CCKKsLL) = PC4) 
90 K = K + DK 
JFCK eLE- N) GO TO 30 
LOL = & + DL 
IFC(LDOL »GT- N) RETURN 
D@ 120 J=LDL»N 
D@ 100 I=LeLL 
CClsd) = CCJsT) 


aaAgNgAgAaNggNAAaAaAaaANAN 


agaannanaaanaaaaann 
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100 CONTINUE 
D@ 120 I=JsN 
D@ 110 KSLsLb 
CCIlsJ>) = COIs) - COISKI¥ACKs JD - ACKs TD *CC Ks JD 
110 CONTINUE 
CiJoTd) = CCIoW) 
120 CONTINUE 


L = LDL 
GO Te 10 
END 


SUBROUTINE HSHLUDR(A» Ns NAD 


HSHLDR IS A FORTRAN IV SUBROUTINE TO REDUCE A MATRIX TG 
UPPER HESSENBERG F@RM BY ELEMENTARY HERMITIAN TRANSFORMA- 
TIONS (THE METH@D @F HOUSEHOLDER). THE PARAMETERS IN THE 
ARGUMENT LIST ARE 
A A DOUBLY SUBSCRIPTED AKRAY CONTAINING THE 
MATRIX Ase ON RETURNs THE UPPER TRIANGLE 
@F THE ARRAY A MATRIX AND THE (N+1)-TH 
COLUMN C@NTAIN THE SUBDIAG@NAL ELEMENTS 
OF THE TRANSFORMED MATRIX. ON RETURNs 
THE LOWER TRIANGLE AND THE CN+1)-TH KOW 
OF THE ARRAY A CONTAIN A HISTORY OF THE 
- TRANSFORMATIONS.» 
N THE ORDER GF THE MATRIX Aw 
NA THE FIRST DIMENSION OF THE ARRAY As 
REAL 
LACNAs 1) sMAXs SUMs SaP 
INTEGER 
IN» NAsNM25N1,LeL1o1oJ 
NM2 = N-2 
Nil = N+#1 


IFCN +«EQ-+ 1) RETURN 
IFCN «GT. 2) GO TO S 
ACIsNI> = AC2s1) 
RETURN 

5 D@ 80 L=isNM2 


Li = L+t 
MAX = QO. 
D@ 10 I=LIsN 
MAX = AMAX1CMAX»ABSCACIsL))) 
10 CONTINUE 
IFCMAX »«NE- 0+) GG TO 20 
ACLsN1) = 0. 
ACNIsL) = O- 
GO TO 30 
20 SUM = 0. 
DQ 30 I=L1sN 
ACIsL) = ACIsL)/MAX 
SUM = SUM + ACIsL)**2 
30 CONTINUE 
S = SIGNCSORTCSUM)»ACLIsLD) 
ACLs NL) = -MAX*S 
ACLIsL) S + ACLIsL> 
ACNIsL) S¥ACLIsL) 
D@ 50 J=L1isNn 
SUM = O-+ 
DQ 40 [=L15N 
SUM = SUM + ACIsL)*ACI, J) 
40 CONTINUE 
P = SUM/ACNIsL) 
DO SO I=L1sN 
ACIsJ?2 = ACIZI) - ACTsL2*P 
50 CONTINUE 
D@ 70 I=isN 
SUM = O- 
D@ 60 J=LIsN 
SUM = SUM + ACIsJ)*ACJsL) 
60 CONTINUE 
P = SUM/ACNIsL) 
D@ 70 J=LisN 
ACIsJ) = ACIsd) - P#ACIeL)D 
70 CONTINUE 
80 CONTINUE 
ACN-15N1) = ACNsN=1) 
RETURN 
END 
SUBROUTINE BCKMLTCAsUsNsNAs NUD 


BCKMLT IS A FORTRAN IV SUBROUTINE THATs GIVEN THE OUTPUT 
@F THE SUBKQUTINE HSHLDR» COMPUTES THE ORTHOGONAL MATRIX 
THAT REDUCES A T@ UPPER HESSENBERG FORM. THE PARAMETERS 
IN THE ARGUMENT LIST ARE 

A A DGUBLY SUBSCRIPTED ARRAY C@NTAINING THE 

Q@UTPUT FROM HSHLDK. 
U A D@UBLY SUBSCRIPTED ARRAY THAT» ON 
; RETURN» CONTAINS THE ORTHOGONAL MATRIX. 

N THE @RDER OF THE MATRIX A IN HSHLDR. 

NA THE FIRST DIMENSION @F THE ARRAY Ae 

NU THE FIRST DIMENSION @F THE ARRAY U- 


THE ARRAYS A AND U MAY BE IDENTIFIED IN THE CALLING 
SEQUENCE+ IF THIS IS DONEs THE ELEMENTS OF THE ORTHOGONAL 
MATRIX WILL OVERWRITE THE OUTPUT @F HSHLDR- 


REAL 

TACNAs 1) 2 UCNUs 1)5SUMsP 
INTEGER 

INeNAsNIsNMILs NM2sLLoLoLls load 
Ni = N+! 


UCNsN) = Le 
IFCNMI »2EQ-e 0) RETURN 
UCNMIsN) = O- 
UCNsNM1) = Of 
UCNMISNMID = Le 
IFCNM2 «EQ. 0) KETURN 
be 40 LL=1i,NM2 
L = NM2-LL+1 
LI = L+l 
IFCACNIsL)> «EQs 0.) GO TO 25 


0 
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D@ 20 J2LIsN 
SUM = 0. 
D@ 10 I=L1.N 
“SUM = SUM + ACIsL)*UCTI,ZJ) 
10 CONTINUE 
Po = SUM/ACNIsL) 
DB 20 I=L1.N 
UCIs J) = UCIsJ) - ACIsL) *P 
20 CONTINUE 
es D@ 30 IsL1.N 
UCIsL) = Of 
UCLaI) = O06 
30 CONTINUE 
UCLsL? = be 
40 CONTINUE 
RETURN 
END | 
SUBROUTINE SCHURCHs Us NN» NHs NU» EPS» FAIL) 


SCHUR. IS 7A FORTRAN IV SUBROUTINE T@ KEDUCE AN UPPEK 
HESSENBERG MATRIX T@ REAL SCHUR FORM BY THE QR METHOD WITH 
IMPLICIT ORIGIN SHIFTS. THE PRODUCT OF THE TRANSFORMA- 
TIONS, USED IN THE REDUCTI@N IS ACCUMULATED. SCHUR IS AN 
ADAPTATION @F THE ALG@L PROGRAM HOR BY MARTINs PETERS» AND 
WILKINSON (NUMER. MATHe 14 €1970) 219-231)+- THE PARA- 
METERS IN THE ARGUMENT LIST ARE 
H A DOUBLY SUBSCRIPTED ARRAY CONTAINING THE 
UPPER HESSENBERG MATRIX He ON KETURNs H 
CONTAINS AN UPPER REAL SCHUR FORM OF He 
THE ELEMENTS @F THE ARRAY H BELOW THE 
THIRD SUBDIAGONAL ARE UNDISTURBED. 
U A DOUBLY SUBSCRIPTED ARRAY CONTAINING ANY 
MATRIXe @N RETURN» U CONTAINS THE MATRIX 
UFRCL#RC2)¢6¢e, WHERE RCL) ARE THE TRANS- 
FORMATIONS USED IN THE REDUCTION OF He 


NN THE ORDER @F THE MATRICES H AND U- 
NH THE FIRST DIMENSION OF THE ARRAY He 
NU THE FIRST DIMENSION OF THE ARRAY U- 


EPS A NUMBER USED IN DETERMINING WHEN AN 
ELEMENT OF H IS NEGLIGIBLE. H¢IsJ) IS 
NEGLIGIBLE IF ABSCHCIsJ»)») IS LESS THAN @k 
EQUAL TO EPS TIMES THE INFINITY NORM OF 
He 

FAIL AN INTEGER VARIABLE THATs ON RETURNs 
CONTAINS AN ERR@R SIGNAL+ IF FAIL IS 
POSITIVE, THEN THE PROGRAM FAILED TO MAKE 
THE FAIL-t @R FAIL~2 SUBDIAG@NAL ELEMENT 
NEGLIGIBLE AFTEK 30 ITERATIONS~ 


REAL: 
VHONHs 1). UCNUs 1), EPSs HNs RSUMs TEST» Po Os Ra Ss We Xo Yo Z 
INTEGER 
INNsNASNHs FAILS ls LTSs Js JLo Kea LoL be Me MMe M22 M32 Ne NA 
LOGICAL 
ILAST 
N = NN 
HN = Q. 
D@ 20 I=1.N 
JL. = MAXOC1s 1-1) 
RSUM = 0. 
D@ 10 JeJLsN 
RSUM = KSUM + ABSCHCIsJd)) 
10 CENTINUE 
HN 2 AMAX!CHNs»RSUM) 
20 CONTINUE 
TEST = EPS*HN 
IFCHN «EQ. Q-) GO TB 230 
30 IFCN -LE- t) GO TO 230 
ITS = 0 
NA = N-1 
NM2 = N-2 
40 D@ 50 LL=2sN 
L = N-LL+2 
IFCABSCH(L»L<-1)) -eLE. TEST) G@ TO 60 
50. CONTINUE 
Lo=ot 
GO T2 70 
60 H(LslL-t)d = O- 
70 IFCL «LT. NA) GO TO 72 


N= Let 
G@ Te 30 
72 X HON» N) /HN 


Y =-HC¢NAs,NA)/HN 
@R = CHCNs NA) HN) *CHCNAS ND /HN)D 
IFCETS «LT. 30) GO TO 75 
FAIL = N 
RETURN 
75 IFCLITSeEQs10 -OR- ITS-EQ-20) GO TO 80 
S =X #Y 
Y= X*Y - R 
6G@ Ta 90 
80 Y = CABSCHCNs NAD) + ABSCHCNAsNM2)))/HN 
S = 1¢5*Y 
Y 3 Y**2 
90 ITS-2 ITS + 1 
D@ 100 MM=L,NM2 
NM2=MM+L, 
H(MsM)/HN 
HCM+tsM)/HN 
HOM+1sM41)/HN 
X*CX=S) + Y + R&CHCMoM+1)/HND 
R#CX+Z-S) 
ReCHCM+2,M+1)/HN)D 
ABSCP) + ABSCQ@) + ABSCR) 
P/sW 
asw 
= R/W 
IFCM oEQ+ L? G@ TO 110 
IFCABSCHCMsM-1))*CABSC(Q)+ABSCR)) eLEe ABSC(P)*TEST) 
1G@ Te 110 
100 CONTINUE 
110 M2 = M+2 
M3 = M+3 
De 120 I=M2,N 
HC(I,1-2) = O06 


wok uokRusauu 
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120 CONTINUE 
IF(M3 eGTe N) G@ T@ 140 
D@ 130 I=M3.N 
HCI,1I-3) = 0. 
130 CONTINUE 
140 D@ 220 K=M,NA 
LAST = KeEQ-NA 
IFCK -eEQe M) GO TO 150 
P = HCKsK-1) 
@ = HCK+isK-1) 
R= 0. 
IFC eNOTeLAST) R = HCK+2sK-1) 
X = ABSCP) + ABS(Q) + ABSC(R) 
IFCX eEQ-e Of) GO TO 220 
Pa P/X 
@ = O7xX 
R = R/X 
150 S = SQRTCP**2 + Qee2 + REQ) 
IFCP LT. O-) S 3 -S 
IFCK «NEe M) HCKsK-1) = ~S#X 
IFCKeEQeM eANDe LeNEoM) HC Ks K-1) = -HCKsK~-1) 
P=Pe#s 
P/S 
Q/S 
R/S 
Q/P 
R/P 
@ 170 JeKs NN 
P = HCKs J) + OQeHCK+ls JD 
IFCLAST) G@ TO 160 
P = P + RkEHCK*2sJ) 
HCK+2s J) = HCK+2sJ) = P*Z 
160 H(K+ls,J) = HCK+tl» J) = PY 
H(Ks J) = H(€Ks J) - PX 
170 CONTINUE 
J = MINOCK+3,N) 
DO 190 IalsJ 
P = X*HCIsK) + Y#HC Is Kel) 
IFCLAST) G@ To 180 
P= P + Z¥HCI,K+2) 
HCIsK+2) = HC1sK+2) - PR 
180 HCIsK+1) = HCIsK+t>? - P*Q 
HCIsK) = HCIsK) - P 
190 CONTINUE 
DQ 210 I=1sNN 
P = X#¥UCIsK) + YeUCESK+1) 
IFCLAST) G@ Te 200 
P= P + ZeUCIsK+2) 
UCIsK*+2) = UCIeK#2) - PR 
200 UCIsK4+1) = UCIsK+1) - PQ 
UCIsK) = UCIsK) = P 
210 CONTINUE 
220 CONTINUE 


x 
Y 
z 
a 
R 
D 


Ge Te 40 
230 FAIL = 0 
RETURN 
END 
SUBROUTINE SYSSLV 
Cc 
C€ SYSSLV IS A FORTRAN IV SUBROUTINE THAT S@LVES THE LINEAR 
C SYSTEM AX = B OF ORDER N LESS THAN 5 BY CROUT REDUCTION 
C F@LL@WED BY BACK SUBSTITUTION». THE MATRIX As THE VECTOR 
C Bs AND THE @RDER N ARE CONTAINED IN THE ARRAYS AsBs AND 
C THE VARIABLE N OF THE COMMON BLOCK SLVBLK+ THE SOLUTION 
C IS RETURNED IN THE ARRAY Be 
c 
COMMON/SLVBLK/ACSs 5)sBC5)2N 
REAL MAX 
1 NMI = N-1 
Nt = N+ 
Cc 
C COMPUTE THE LU FACTORIZATION OF A- 
Cc 
DO 80 K=14N 
KME = K-1 
IFCK«EQet)> GG T@ 20 
D@ 10 I=KsN 
D@ 10 J=1sKM1 
ACIsK) = ACIsK) - ACTs J*¥ACIOK) 
10 CONTINUE 
20 IFCKeEQ«N) GO TB 100 
KPL = K+l 
MAX = ABSCACKsK)) 
INTR = K 
DO 30 I=KP1sN 
AA = ABSCACIsK)) 
IFCAA «LE» MAX) GO T@ 30 
MAX = AA 
INTR = I 
30 CONTINUE 
IFCMAX -EQ- O-) STOP 
ACNIZK) = INTR 
IFCINTR eEQ- K) GO TO SO 
D@ 40 J=1,N 
TEMP = ACKsJ) 
ACKs J) = ACINTRs J) 
ACINTR»,J) = TEMP 
ao CONTINUE 
SO DS 80 J=KP1lsN 
IFCKeEQ-1) GO TO 70 
DG 60 I=1,sKM1 
ACKs J) = ACKs J) - ACKs LD #ACIs JD 
60 CONTINUE 
70 ACKs J) = ACKs JOZACKS KD 
80 CONTINUE 
c 
C INTERCHANGE THE COMPONENTS OF B. 
Cc 


100 DO 110 J=1,NM1 
INTR = ACNIsJ) 
IFCINTR «EQ. J) GO TO 110 
TEMP = BCJ) 
BCJ) = BCINTR) 
BCINTR> = TEMP 
110 CONTINUE 


0 
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SOLVE LX = Be 


aaa 


200 BC1) = BCLI/AC1s 1) 
D@ 220 1=2sN 
IMi = I-1 
D@ 210 J=te-IMi 
BCI) = BCI) = ACI»J)*BCJ) 

210 CONTINUE 

BCI) = BCIDZACI,1) 
220 CONTINUE 


SOLVE UX = Be 


aaa 


300 .D@ 310 IIl=15NM1 
I= NMI-II+1 
It = I+1 
D@ 310 J=116N 
BCI) = BCI) - ACI» J *BCUD 
310 CONTINUE 
RETURN 
END 


COLLECTED ALGORITHMS FROM CACM 


Editor’s note: Algorithm 433 described here is available on magnetic 
tape from the Department of Computer Science, University of 
Colorado, Boulder, CO 80302. The cost for the tape is $16.00 (U.S. 
and Canada) of $18.00 (elsewhere). If the user sends a small tape 
(wit. less than 1 lb.) the algorithm will be copied on it and returned 
to him at a charge of $10.00 (U.S. only). All orders are to be prepaid 
with checks payable to ACM Algorithms. The algorithm is recorded 
as one file of BCD 80 character card images at 556 B.P.1., even 
parity, on seven track tape. We will supply the algorithm at a 
density of 800 B.P.I. if requested. The cards for the algorithm are 
sequenced starting at 10 and incremented by 10. The sequence number 
is right justified in columns 80. Although we will make every attempt 
to insure that the algorithm conforms to the description printed here, 
we cannot guarantee it, nor can we guarantee that the algorithm is 
correct.—L.D.F. 


Algorithm 433 


Interpolation and Smooth 
Curve Fitting Based on Local 
Procedures (E2) 


Hiroshi Akima [3 Nov. 1970, 9 Apr. 1971, and 1 Mar. 
1972] 

U.S. Department of Commerce, Office of Telecom- 
munications, Institute for Telecommunication Sciences, 
Boulder, CO 80302 


Key Words and Phrases: interpolation, polynomial, slope of 
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Description 


Introduction. User information and Fortran listings are given 
on two subroutines, IVN7TRPL and CRVFIT. Each subroutine im- 
plements the method of interpolation and smooth curve fitting 
based on local procedures [1]. These subroutines are written in 
ANSI Standard Fortran [2]. 

Outline of the Method. The method is devised in such a way 
that the resulting curve will pass through all the given data points 
and appear smooth and natural. It is based on a piecewise function; 
a portion of the curve between a pair of given points is represented 
by a third-degree polynomial for a single-valued function and by 
two third-degree polynomials for a multiple-valued function. In 
this method, the slope of the curve is determined at each given 
data point locally by the coordinates of five data points, with the 
data point in question as a center point and two data points on 
each side of it. Each piece of the function representing a portion 
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of the curve between a pair of given data points is determined by 
the coordinates of and the slopes at the points. 

When interpolation is made near the end points of the curve, 
two more points estimated at each end point are used to determine 
the slope of the curve. In this method, this estimation is based 
on three data points, the end point in question and two adjacent 
given data points. 

The resulting curve of this method for a single-valued function 
is invariant under a linear-scale transformation of the coordinate 
system; different scalings of the coordinates result in equivalent 
curves. The resulting curve of this method for a multiple-valued 
function, on the other hand, is variant under a linear-scale trans- 
formation of the coordinate system; both the abscissa and the 
ordinate should be scaled with their respective units having an 
equal length on the graph. 

This method requires only straightforward procedures, not 
iterative solutions of equations with preassigned error tolerances, 
which are required by some methods. No problem concerning 
computational stability or convergence exists in application of 
this method. 

The INTRPL Subroutine. This subroutine interpolates, from 
values of the function given as ordinates of input data points in 
an x-y plane and for a given set of x values (abscissas of desired 
points), the values of a single-valued function y = y(x). 

The entrance to this subroutine is achieved by 


CALL INTRPL(VU,L,X,Y,N,U,V) 
where the input parameters are 


IU = logical unit number of standard output unit, 

number of input data points (must be two or greater), 

array of dimension Z storing the x values (abscissas) of 
input data points in ascending order, 

array of dimension Z storing the y values (ordinates) of 
input data points, 

number of points at which interpolation of the y value 
(ordinate) is desired (must be one or greater), 

= array of dimension N storing the x values (abscissas) of 


desired points, 


is 
tot tl 


xX 
Y 
N 
U 


and the output parameter is 


V -=array of dimension N where the interpolated y» values 
(ordinates) are to be displayed. 


This subroutine occupies 515 locations on the CDC-3800 
computer. Computation time required for this subroutine on the 
same computer is approximately equal to 


1 + 0.2 N msec for L 10, 
3 + 0.5N msec for ZL = 100, 


when the elements of the U array are given in ascending order; and 


1 + 0.5 N msec for L 10, 
3 + 0.7 N msec for L = 100, 


when they are given in random order. 

When the function to be interpolated represents a periodic 
function and a set of Z, data points covers a whole period, two 
additional data points should be added at each end and a set of 
L, + 4 data points should be given as the input data points to 
this subroutine. 

The CRVFIT Subroutine. This subroutine fits a smooth curve 
to a given set of input data points in an x-y plane. It interpolates 
points in each interval between a pair of data points and generates 
a set of output points consisting of the input data points and the 
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Fig. 1. Curve fitted to the input data points given in Table I (a). 
(Encircled points are given data points.) 
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Table I. An Example of CRVFIT (MD = 1) 


(a) Input data points 
I XD) Y() I XW YU) I x@ b 449) 


I 0.000 0.000 4 3.000 0.000 7 6.000 10.000 
2 1.000 0.000 5 4.000 0.000 8 7.000 80.000 
3 2.000 0.000 6 5.000 1.000 9 8.000 100.000 
4 3.000 0.000 7 6.000 10.000 10 9.000 150.000 


(b) Output points 
K  U(K) V(K)  K .U(K) V(K)  K_ U(K) V(K) 


1 0.000 0.000 16 3.000 0.000 31 6.000 10.000 
2 0.200 0.000 17 3.200 0.000 32 6.200 18,341 
3 0.400 0.000 18 3.400 0.000 33 6.400 33.645 
4 0.600 0.000 19 3.600 0.000 34 6.600 51.778 
5 0.800 0,000 20 3.800 0.000 35 6.800 68.607 
6 1.000 0.000 21 4.000 0.000 36 7.000 80.000 
7 1.200 0.000 22 4.200 0.068 37 7.200 85.510 
8 1.400 0.000 23 4.400 0.244 38 7.400 88.574 
9 1.600 0.000 24 4.600 0.485 39 =7.600 90.882 
10 1.800 0.000 25 4.800 0.751 40 7.800 94.127 
11 2.000 0.000 26 5.000 1.000 41 8.000 100.000 
12 2.200 0.000 27 5.200 1.523 42 8.200 108.080 
13 2.400 0.000 28 5.400 2.659 43 8.400 116.940 
14 2.600 0.000 29 5.600 4.433 44 8.600 126.760 
15 2.800 0.000 30 5.800 6.871 45 8.800 137.720 
16 3.000 0.000 31 6.000 10.000 46 9.000 150.000 
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Fig. 2. Curve fitted to the input data points given in Table II (a). 
(Encircled points are given data points.) 
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Table II. An Example of CRVFIT (MD = 2) 
(a) Input data points 


I X(1) Y() I X() YU) 1 xm YW 
1 — 30.000 70.000 4 —18.000 4.000 7 30.000 20.000 
2 —30.000 40.000 5 0.000 0.000 8 30.000 40.000 
3 —30.000 20.000 6 18.000 4.000 9 30.000 50.000 
4 —18.000 4.000 7 30.000 20.000 10 30.000 70.000 
(b) Output points 
K U(K) V(K) K  U(K) V(K)  K  U(K) V(K) 
1 —30.000 70.000 16 —18.000 4.000 31 30.000 20.000 
2 —30.000 64.000 17 —14.641 2.463 32 30.000 24.000 
3 —30.000 58.000 18 —11.097 1.331 33 30.000 28.000 
4 —30.000 52.000 19 —7.433 0.567 34 30.000 32.000 
5 —30.000 46.000 20 —3.713 0.136 35 30.000 36.000 
6 —30.000 40.000 21 0.000 0.000 36 30.000 40.000 
7 —30.000 36.000 22 3.713 0.136 37 30.000 42.000 
8 —30.000 32.000 23 7.433 0.567 38 30.000 44.000 
9 —30.000 28.000 24 11.097 1.331 39 30.000 46.000 
10 —30.000 24.000 25 14.641 2.463 40 30.000 48.000 
11 —30.000 20.000 26 18.000 4.000 41 30.000 50.000 
—29.315 16.080 27 21.501 6.240 42 30.000 54.000 
—27.466 12.400 28 24.758 9.080 43 30.000 58.000 


—24.758 9.080 29 27.466 12.400 44 30.000 62.000 
—21.501 6.240 30 29.315 16.080 45 30.000 66.000 
16 —18.000 4.000 31 30.000 20.000 46 30.000 70.000 
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interpolated points. It can handle either a single-valued function 
or a multiple-valued function. 
The entrance to this subroutine is achieved by 


CALL CRVFITUIU,MD,L,X,Y,M,N,U,V) 
where the input parameters are 


IU = logical unit number of standard output unit, 
MD = mode of the curve (must be 1 or 2) 
= 1 for a single-valued function 
= 2 for a multiple-valued function, 
L = number of input data points (must be two or greater), 
X = array of dimension Z storing the abscissas of input data 
points (in ascending or descending order for MD = 1), 
Y = array of dimension L storing the ordinates of input data 
points, 
M_ = number of subintervals between each pair of input data 
points (must be two or greater), 
N = number of output points 


= (L-1)M + 1, 
and the output parameters are 
U_ = array of dimension N where the abscissas of output points 
are to be displayed, 
V = array of dimension N where the ordinates of output points 


are to be displayed. 
This subroutine may also be entered by 
CALL CRVFITUIU,MD,L,X,Y,M,N,X, Y) 


but the input data X and Y are not preserved in this case. 

This subroutine occupies 711 locations on the CDC-3800 
computer. Computation time required for this subroutine on the 
same computer is approximately 


500 + 300 LZ + 50 (ZL—1) (M—1) usec for MD 1, 
500 + 600 L + 75 (E—1) (M—1) usec for MD = 2. 


When: the curve exhibits periodicity (that includes a closed 
curve) and a set of L, data points covers a whole period, two ad- 
ditional data points should be added at each end, a set of L, + 
4 data points be given as the input data points to this subrou- 
tine, and two intervals on each side be discarded from the set of 
output points. 

Test Results. Ali tests were performed on a CDC-3800 com- 
puter. An example of smooth curve fitting by the CRVFIT sub- 
routine for a single-valued function (MD =1) is shown in Table I, 
and for a multiple-valued function (MD =2) in Table II. In each 
table, input data shown in (a) were given to CRVFIT with L = 10, 
M = 5, and N = 46, and values shown in (b) were obtained. 
Also, the data in Table I (a) together with the U values in Table I 
(b) were given to the INTRPL subroutine with L = 10 and N = 
46, and the V values in Table I (b) were obtained. Figure 1 depicts 
the curve fitted to the input data points given in Table I (a) by 
the CRVFIT subroutine with MD = 1, and Figure 2, Table II 
(a) with MD = 2; both curves are fitted with L = 10, M = 20, 
and N = 181. These examples demonstrate one of the properties of 
this method, that the resulting curves are free from unnatural 
wiggles. © 
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Algorithm 


SUBROUTINE INTRPLCIUsL»XsY¥sNoUs V> 
INTERPOLATION OF A SINGLE-VALUED FUNCTION 


THIS SUBROUTINE INTERP@LATESs FROM VALUES OF THE FUNCTION 
GIVEN AS ORDINATES @F INPUT DATA POINTS IN AN X-Y¥ PLANE 
AND FOR A GIVEN SET @F X VALUES CABSCISSAS)» THE VALUES OF 
A SINGLE-VALUED FUNCTION Y 2 YCX)- 


aaQaana a 


THE INPUT PARAMETERS ARE 


IU = LOGICAL UNIT NUMBER @F STANDARD OUTPUT UNIT 

L = NUMBER @F INPUT DATA POINTS 
(MUST BE 2 OR GREATER) 

X = ARRAY @F DIMENSION L ST@RING THE X VALUES 
CABSCISSAS) OF INPUT DATA F@GINTS 
CIN ASCENDING ORDER) 

Y = ARRAY @F DIMENSION L STORING THE Y VALUES 
CORDINATES) OF INPUT DATA FOINTS 

N = NUMBER GF P@INTS AT WHICH INTERP@LATI@N OF THE 
Y VALUE C(ORDINATE) IS DESIRED 
(MUST BE 1 @R GREATER) 

U = ARRAY @F DIMENSION N STORING THE X VALUES 
CABSCISSAS) @F DESIRED POINTS 


THE @UTPUT PARAMETER IS 


V = ARRAY OF DIMENSION N WHERE THE INTERP@LATED Y 
VALUES (ORDINATES) AKE T@ BE DISPLAYED 


aga aA AMANANAAARAANRAA A 


C DECLARATION STATEMENTS 


DIMENSION 
EQUIVALENCE 


XCLISYCLIZUCN) » VON) 
€PO02X%3)s C805 Y3)s2 COls T3) 

REAL MlsM2sM3eM4M5 

EQUIVALENCE CUKsDX)sCIMNsX2sALeM1l>s CIMXs X52 AS9M5)» 
1 C Ja SWs SA)» CYS 2 W202 W 4202) 2 CY Ss W399 O3) 


C PRELIMINARY PROCESSING 


10 LO=L 
LM1=LO-1 
LM2=LM1-1 
LPI=LO+1 
NO=N 
IFCLM2eLT.0) 
IFCNO-LE-0) 
De 11 [=2sL0 
IFCACT=1)-xXC1)) 
11 CONTINUE 
IPV=0 


GQ T@ 90 
GO TO 91 


11595296 


C MAIN D@-LOoP 


D@ 80 K=1sNO0 
UK=UCK) 


C ROUTINE T@ LOCATE THE DESIRED POINT 


20 IFCLM2-EQe-0) GQ T@ 27 
IFCUK-GEeXC(LO)) G@ TO 26 
IFCUK«eLT-XC1)> G@ T@ 25 
IMN=2 
IMX=LO 

21 I=CIMN+IMX)72 
IFCUK-GEeXC1I)) GO T@ 23 

22 IMX=1 
GO Ta 24 

23 IMN=I +1 

24 IFC IMXeGTeIMN) GO TO 2h 
I=IMX 
69 Te 30 

25 I=1 
G@ To 30 

26 IsLPi 
G® T@® 30 

27 «12 

© CHECK IF I = IPV 
30. I FCI.EQsIPV) 68 Te 70 


IPvslL 


C ROUTINES T@ PICK UP NECESSARY X AND Y VALUES AND 
Cc TO ESTIMATE THEM IF NECESSARY 


40 J=I 
IFC JeEQ.1) J=2 
IFC JeEQ-LPI) J= 
AXAI=XCJ-1) 
Y3=YCJ-1) 
X4=XCJ) 
Y4=YCJ) 
A3=X4-X3 
M32 CY 4-Y3) /A3 
IFCLM2-EGQ-0) 
IFC JeEQ02) 
X2=X(€J-2) 
Y2=Y¥CJ-2) 
A2=X3-K2S 
M23 CY3-Y2)/A2 
IFC JeEQ.LO> 


GO TS 43 
G@ TO 41 


G@ Te 42 
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c 


Cc 


c 


Cc 


Cc 


41 X5=XCJ+h> 
YS2YCJ+1) 
A42X5-X4 
M4=CY5-Y4)/A4 
IFC JeEQ.2) M2"M34M3-M4 
G@ TO 45 

42 M 4=M34M3-M2 
GO To 45 

43 M23M3 
143M3 

45 IFC J-LE-3) GO T® 46 
ALSX2-XC€ 5-3) 
Ml®CY2-YCJ-3))/A1 


GO T@ 47 
46 MLaM2+M2-M3 
47 KFC J-GEeLM1 > G6 TO 48 


AS#XCJ+2)°XS 
MS=CYCJ+29-°YS)7A5S 
G@ Te 50 

48 MS=M 44M 4-M3 


NUMERICAL DIFFERENTIATION 


50 TFCIeEQeLPI) G@ T@ $2 
W22ABS(M4-M3) 
W33ABS(M2=-M1) 

SW2W2+W3 

ITF CSWeNE +020) GO T@ 5} 
W220.5 

W320-5 

SwWalte0 

51 T3=CW2*XM2+W34M3) /SW 
IFC I «EQ-1) G@ T@ S4 

se W32ABS(M5-M 4) 

W 42ABS(M3-M2) 

SW2W3+W4 

IFC SWeNE0-0) G@ TO 53 
W3=0-5 

W420-5 

Sweted0 

33 T4= CW3*M3+W 4%M4)/SW 
IFCIeNEeLP1) G@ T@ 60 
T3sT4 
SAZA2+A3 
T 420 ¢ S# 0M 44M S-A2*(AZ-A3) *C(M2-M3)/CSA*SA) > 
X3=X4 
Y32Y4 
A3=A2 
M3=M4 
G@ Te 60 

54 T4=T3 
SAZA3+A4 
T3=0 6 SHCM14M2 -A4*CA3-A4) *CMI9-M4) 76 SA*SA) ) 
X32X3-A4 
Y32Y3-M2*A4 
A32A4 
M3amMe 


DETERMINATION OF THE COEFFICIENTS 


60 O23 (2 60#(M3-T3) #M3-T4)7A3 
Q3=¢-M3-M34+T34+T 4) /CA3*A9) 


COMPUTATION OF THE POLYNOMIAL 


70 DX=UK-PO 
80 VCK )2Q0+DX*(Q1+DX¥*(Q2+DX*O3) > 
RETURN 


ERROR EXIT 


90 WRITE (1Us2090) 
G@ TO 99 
91 WRITE €1Us,2091) 
G@ TS 99 
95 WRITE ¢1Us2095) 
GO TO 97 
96 WRITE (€1Us2096) 
97 WRITE €1Us2097) IsXC1) 
99 WRITE €1Us2099) LO»NO 
RETURN 


F@RMAT STATEMENTS 


2090 FORMATCIX/22H ee . =: 1 OR LESS-/) 

2091 FORMATCIX/22H ee N = O OR LESS-/) 

2095 FORMATCIX/27TH = #e% IDENTICAL X VALUES-/) 

2096 FORMATCIX/33H eke X VALUES OUT OF SEQUENCE.-/) 

2097 FORMATC 6H IT w517910Xs GHAXACID =e E1203) 

2099 FORMATC 6H L 3537, 10X%s3HN Ss 17/ 
1 36H ERR@R DETECTED IN ROUTINE INTRPL) 
END 


SUBROUTINE CRVFITCIUsMDs Lo XsYoMs No Us Vd) 
SMOOTH CURVE FITTING 


THIS SUBROUTINE FITS A SMOOTH CURVE TO A GIVEN SET OF IN- 
PUT DATA P@BINTS IN AN X-Y¥ PLANE+ IT INTERP@LATES POINTS 
IN EACH INTERVAL BETWEEN A PAIR OF DATA POINTS AND GENER- 
ATES A SET OF OUTPUT POINTS CONSISTING OF THE INPUT DATA 
PBINTS AND THE INTERP@LATED POINTS. IT CAN PROCESS EITHER 
A SINGLE-VALUED FUNCTION OR A MULTIPLE-VALUED FUNCTION. 


aanganaaaaanaaana an 
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THE INPUT PARAMETERS ARE 


IU = LOGICAL UNIT NUMBEK OF STANDARD @UTPUT UNIT 
MD = MODE OF THE CURVE CMUST BE 1 OK 2) 

= 1 FOR A SINGLE-VALUED FUNCTION 

= 2 FOR A MULTIPLE-VALUED FUNCTION 
L == NUMBER @F INPUT DATA POINTS 


(MUST BE 2 OR GREATER) 

X = ARRAY @F DIMENSION L STORING THE ABSCISSAS OF 
INPUT DATA P@INTS CIN ASCENDING @R DESCENDING 
@RDER FOR MD = 1) 

Y # ARRAY OF DIMENSIGN L STORING THE ORDINATES @F 
INPUT DATA POINTS 


M = NUMBER @F SUBINTERVALS BETWEEN EACH PAIR QF 
INPUT DATA POINTS (MUST BE 2 OR GREATER) 
N NUMBER @F OUTPUT P@INTS 


= (L=-1)«M+) 


THE @UTPUT PARAMETERS ARE 


U = ARRAY @F DIMENSION N WHERE THE ABSCISSAS OF 
@UTPUT POINTS ARE TQ BE DISPLAYED 

V = ARRAY OF DIMENSION N WHERE THE @RDINATES OF 
@UTPUT POINTS ARE TO BE DISPLAYED 


C DECLARATION STATEMENTS 


1 


DIMENSION XCLI*YCLI2UCND SVEN) 

EQUIVALENCE ¢(M1sB1)s(M2sB2)s(M3sB3)9(M42B4)> 
CX2eP0)5CY2s00)5¢(T2.Q1) 

REAL MisM2sM3eM4 

EQUIVALENCE (€W2sQ2)5(W3s03)5 CALs P2)2 (Blea PI)» 
€CA22DZ) eC SWeRs Z) 


C PRELIMINARY PROCESSING 


10 MDO=MD 


ll 


12 


19 


MDM1L=MDO-1 

LO=L 

LM1=LO-1 

MOsM™ 

MM12M0-1 

NO=N 

IFCMDO-LE.O) Ge Te 90 
IFCMDO-GE-3) GO Te 90 
IFCLM1-LE-0) Ge Te 91 
IFCMM1-LE20) GO TO 92 
IFCNOeNEsLMI*MO+1) GO TO 93 


GO TO (11216)5 MDO 

132 

IFCXC1)-XC2)) 12295214 

D9 13° IT=3eL0 
IFCXCI-19-XC1)) 13295296 
CONTINUE 

G@ Te 18 

D@ :5 1=3sL0 
IFCXCI“21)-XC3)) 96095015 
CONTINUE 

G@ T@ 18 

D@ 17 1=2s,L0 
IFCXCI“1)¢NEeXCI)) GO TO 17 
IFCYCI-1)-EQeYCID) GO TO 97 
CONTINUE 


K=NO+MO 
I=L0+1 
D@ 19 J=1eL0 
K=K=-MO_ 
IsI-1 
UCK) #XC1) 
VCK)aYCT) 
RM=MO 
RM=1-0/RM 


C MAIN DO-LOOP 


20 KS=M0+1 


D@ 80) I=1sLO 


C ROUTINES T@ PICK UP NECESSARY X AND Y VALUES AND 


Cc 


30 


3t 
32 


33 


TO ESTIMATE THEM IF NECESSARY 


IFC eGTel) G@ T@ 40 
X3BUCE)D 

Y3=VC1> 

X43UCMO+1) 

Y 42VCMO4+1) 

AI#X4-X3 

B32zY 4-73 

I1FC(MDM1 -EQ~0) M32B3/A3 
IFCLONEs2) G@ Te 41 
A42A3 

B42B3 

G2 TO €33932)2 MDO 
A22A3+A3-A4 

Al#A2+A2-A3 

B22B3+B3-B4 

B1=B2+B2-B3 

GO T@ €51t+56)s5 MDO 


COLLECTED ALGORITHMS (cont.) 


40 X22X3 
Y2zY3 
X3axX4 
Y32Y4 
Kae2xS 
Yaszy5 
AlzA2 
Bl=B2 
A22A3 
B2=B3 
A32A4 
B3=B4 
IFCIeGE«LM1) G@ Te 42 
4l K5e8K5+M0 
X5e2UCK5S) 
YS5S=2vcKS) 
A4=X5-X4 
BazyS-Y4 
IFCMDM1 ~-£Q-9) M43B4/A4 
G8 TO 43 
42 IFCMDM1 .NE-O) A 42A3+A3-A2 
B4=B3+B3-B2 
43 IFCIL-EQe1) G@ Te 3i 
G@ T@ (50.55). MDO 


C NUMERICAL DIFFERENTIATION 


50 Te2T3 

Si W2=ABS(M4=M3) 
W3=ABS(M2-M1) 
SW2W2 +W3 
IFCSWeNE 000) G@ Te S52 
W2=0-5 
W3=0-5 
SW=1.0 

52 T3=CW2*M24+W9eM3)/5W 
IFCI-1) 80280560 


55 C9S$2=C8S3 
SIN2=SIN3 

56 W2=ABS(A3*B 4-A4*B3) 
W3=ABSCAL*B2-A2*Bt) 
IFCW2+W3.NE.0-0) GO TO 57 
W2=SQRTCA3*A3+4B3*«83) 
W3=SQRTCA2*A2+B2«4B2 ) 

S57 C8S3=W2*A2+W3*A3 
SIN3=W2 *B2+W3*B3 
R=COS3*COS3+SINI*SING 
IFCReEQ-0-0) Ge Te 58 
R=S@RTCR) 
COS3=CBS3/R 
SIN3=S1N3/R 

58 IFCI<1) 8080.65 


C DETERMINATION OF THE COEFFICIENTS 


60 Q2=(2.0*(M2-T2)+M2-T3) /A2 
Q35(-M2-M2+T2+T3) /CA2*A2] ) 
G@ Te 70 


65 R=SORTCA2Z*A2+B2*B2) 
P1I=R*COS2 
P2=3.0*A2-R*(COS2+C8S2+C9S3) 


433-P 5- R1 


P3=A2-P1-P2 
Q1=R*SIN2 
Q2=3.04*B2-R¥(SIN2+SIN2+SIN3) 
93=B2-01-02 

6@ To 75 


C COMPUTATION OF THE POLYNOMIALS 


70 DZ2A2*RM 
Z20.0 
D@ 71 Ji» MMi 
K2K+1 
Z2=Z+DZ 
UCK) sP0+Z 
71 VCK > 2Q04Z*(0142%(024+Z*03)) 
G@ Te 79 


7s Z20.0 
DO 76 Ji» MM1 
K=sK+1 
Z3Z+RM 
UCK) sPO4Z#CP1+Z*CP24+Z#P3)) 
76 VCK) #Q04+Z*(O14+Z2%( 0242403) } 


19 K2K+l 
80 CONTINUE 
RETURN 


C ERROR EXIT 


90 WRITE (1Us2090) 
G@ TO 99 
91 WRITE (CIUs2091) 
G8 T8 99 
92 WRITE (1Us2092) 
G@ TO 99 
93 WRITE (1Us2093) 
GO Te 99 
95 WRITE (€1Us2095) 
G@ Te 98 
96 WRITE (1Us2096) 
G@ Te 98 
97 WRITE (1Us2097) 
98 WRITE (1Us2098) IeXCI>sYCI> 
99 WRITE €1Us2099) MDOs»LO»M0sNO 
RETURN 


C FORMAT STATEMENTS 


2090 FORMATCIX/31H = *¥** MD @UT OF PROPER RANGE-/) 


2091 FORMATCIX/22H = *¥* L = t OR LESS+«/) 
2092 FORMATCIX/22H ex M = 1 OR LESS-/) 
2093 FORMATCIX/25H =e IMPROPER N VALUE-/) 


2095 FORMATCIX/27H 3 ***% IDENTICAL X VALUES-/) 
2096 FERMATCIX/33H ee X VALUES QUT OF SEQUENCE./) 
2097 FORMATCIX/33H = eK IDENTICAL X AND Y VALUES-/) 


2098 FORMATC7H I =e 149 10Xe26HACI) 32 E120 3s 


Hy 10X,6HYCI)> =sE12-3) 
2099 FORMATC7H MD 251498Xs3HL =+1598X>s 
1 3HM =515e8Xs3HN 25 15/ 
2 36H ERR@R DETECTED IN ROUTINE 
END 
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REMARK ON ALGORITHM 433 


Interpolation and Smooth Curve Fitting Based on Local Procedures [E2] 
[H. Akima, Comm. ACM 15, 10 (Oct. 1972), 914-918] 


Michael R. Anderson [Recd 8 Dec. 1975] 
Gettysburg College, Gettysburg, PA 17325 


Subroutine CRVFIT is not written in ANSI Standard Fortran as referenced in [27]. 
In particular, [2, 7.1.2.8] states that the initial value of a DO statement must be 
less than or equal to the value represented by the terminal parameter. DO state- 
ments numbered 12 and 14 violate this rule when L is input as 2, which the limita- 
tions of the program allow. Error conditions of IDENTICAL X VALUES or 
X VALUES OUT OF SEQUENCE may improperly result from the IF tests within 


these two DO statement loops. 


The subroutine may be corrected as follows. Delete the statement numbered 12 
and replace it with the following two statements: 


12 IF(LO.EQ.2) GOTO18 


DO 13 I=3,L0 
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Delete the statement numbered 14 and replace it with the following two statements: 


14 IF (LO.EQ.2) GOTO 18 
DO 15 I=3,L0 


The subroutine, if tested for the case L = 2, would have performed correctly 
because of the implementation of DO statements in Fortran for the CDC-3800, 
which would not have executed the range if L < 3. However, for the IBM System/ 
360 Fortran compilers, the subroutine produces the erroneous messages mentioned. 


With the preceding corrections, the subroutine has been used with much success 
on a wide variety of problems. 
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Algorithm 434 


Exact Probabilities for R x C 
Contingency Tables [G2] 


David L. March [Recd. 24 Nov. 1970 and 7 Mar. 1971] 
School of Education, Lehigh University, 
Bethlehem, PA. 18015 


Key Words and Phrases: probability, contingency table, test of 
significance 

CR Categories: 3.5, 5.5 

Language: Fortran 


Description 

Freeman and Halton [1] derive a general method for computing 
exact probabilities for contingency tables that result if a sample is 
subjected to & different and independent classifications. The follow- 
ing algorithm is limited to the case where k = 2. 

If a sample of size N is subjected to two different and inde- 
pendent classifications, A and B, with R and C classes respectively, 
the probability P, of obtaining the observed array of cell frequencies 
X(x:i;), under the conditions imposed by the arrays of marginal 
totals A(r;) and B(c;) is given by 


eG 


R 
TT Go IT Gd 
a 


j=l 


RC 
NII II (xi!) 


t=1 j=1 


P, = (1) 


Expression (1) is exact and holds if (a) the parent population is 
infinite or the sampling is done with replacement of the sampled 
items, (b) the sampling is random, (c) the population is homo- 
geneous, and (d) the marginal totals are considered fixed in re- 
peated sampling. 

To test the null hypothesis that A and B are independent against 
the indefinite two-sided alternative, the probability P, of obtaining 
an array as probable as, or less probable than, the observed array 
is needed. P, is found as follows: (a) the probability P, of the ob- 
served array is computed; (b) the probabilities for all other possible 
arrays of cell frequencies, subject to the conditions imposed by the 
fixed marginal totals, are computed; and (c) P, is then obtained by 
summing all of the probability values found in (b) that are less than, 
or equal to, the probability P, . 

Method. The method of the subroutine uses the fact that 
expression (1) can be rewritten as 


P, = Q./Rz 
where 
R c 
Il @) IL (es) 
_ iat jai 
Q:= N! 
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which is constant for the given set of marginal totals (r;) and (c;) 
and 


which varies depending on the array of cell frequencies (x;;). In 
order to avoid machine overflow and roundoff error, these compu- 
tations are performed using logarithms. 

The observed R X C contingency table is specified by the 
NR X NC matrix which is partitioned as follows: 


Xiu eee eae Xic ry 
XR eae eee XRC rr 
aera ree. ae a 


After computing the constant term QXLOG and the probability 
of the given table PT, the subroutine assigns to each of the lower 
right (R — 1) X (C — 1) cells the minimum of its corresponding 
row and column totals which is the maximum possible number for 
the cell. These cells are then varied in all possible combinations with 
each cell varied between its maximum number and zero. 

Starting with cell (2,2), the variation is accomplished by sub- 
traction of 1. When the subtraction yields a zero or positive result 
the routine goes to compute the remainder of the cell frequencies. 
When a negative result is obtained, the cell in question, say cell 
(i, /), is reset to the minimum of the corresponding row and column 
totals, 1 is subtracted from cell (i, 7 + 1) or, iff + 1 is greater than 
C, cell ( + 1, 2), and the count down resumes at cell (2,2). If none 
of the lower right (R — 1) & (C — 1) cells yield a zero or positive 
result, the computations are complete and the subroutine returns to 
the caller. For example, if the top line (below) is the cell maximum 
ordered left to right from the (2,2) to the (R, C) cell, the combina- 
tions generated will be 


2 1 1 
1 1 1 
0 1 1 
Z 0 1 
1 0 1 
0 0 1 
2 1 0 
0 0 0 


The column 1 and row 1 cells are filled by subtraction of the 
generated cell numbers from the marginal totals. Since the method 
described above yields illegal as well as legal partitions, it is possible 
to obtain a negative result for one of these cells. When this occurs, 
the routine goes back to get a new set of cell frequencies. Otherwise 
RXLOG is computed. Then, the probability PX is computed and 
added to the cumulative sum PC. If PX is less than, equal to, or, 
to avoid missing one due to computational inaccuracy, slightly 
larger than P7, PX is also added to the significance probability PS. 

Since PC is the probability of obtaining some of the tables 
possible within the constraints of the marginal totals, PC should 
equal 1.0.. The accuracy of the result can be estimated from the 
amount of deviation of PC from 1.0. . 

The floating point logarithms (base 10) of the integer factorials 
are obtained from function FACLOG. For arguments less than or 
equal to 100, the result is obtained from a table that is computa- 
tionally filled on the first reference to FACLOG. Stirling’s approxi- 
mation is used for arguments greater than 100. 
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Results. The algorithm was tested on a CDC 6400 (60 bit word) 
using 2 X 3 (N = 30), 2 X 4 (N = 7), and 3 X 3 (N = 7) con- 
tingency tables. Results for the 2 < 3 tables were verified against 
values separately computed using programs developed by March 
(2]. In several cases PC deviated from 1.0. by 1.0 * 10712. Results 
for the 2 K 4 and 3 X 3 tests were verified by hand computation. 

The author is indebted to the referees for their valuable com- 
ments and suggestions. 


References 

1. Freeman, G.H., and Halton, J.H. Note on an exact treatment 
of contingency, goodness of fit, and other problems of significance. 
Biometrika 38 (1951), 141-149. 

2. March, D.L. Accuracy of the chi-square approximation for 

2 X 3 contingency tables with small expectations. An unpublished 
D.Ed. Diss., School of Education, Lehigh U., Bethlehem, Pa., 
1970. 


Algorithm 
SUBROUTINE CONPCMATRIXsNRsNCoPT»PSsPC) 


INPUT ARGUMENTS. 


MATRIX = SPECIFICATION @F THE CONTINGENCY TABLE. 
THIS MATRIX IS PARTITIONED AS FOLLOWS 


XCL1L eee eXC1C) ROL) 


ACRE Dee ee eX CRC) RCR) 
CCl eeeee CCC) N 


WHERE XCIJ) ARE THE @BSERVED CELL FREQUENCIES, 
RCI) ARE THE ROW TOTALS» CCJ)> ARE THE COLUMN 
TOTALSs AND N IS THE TOTAL SAMPLE SIZE. 
NOTE THAT THE BRIGINAL CELL FREQUENCIES. ARE 
DESTROYED BY THIS SUBROUTINE. 

NR = THE NUMBER @F ROWS IN MATRIX CR=NR-1)- 

NC = THE NUMBER @F COLUMNS IN MATRIX (C=NC-1)- 

Q@UTPUT ARGUMENTS. 
PT = THE PROBABILITY OF OBTAINING THE GIVEN TABLE. 


PS = THE PR@BABILITY OF BBTAINING A TABLE AS PROBABLE 
AS» @R LESS PROBABLE THAN, THE GIVEN TABLE. 


PC = THE PROBABILITY @F OBTAINING S@ME OF THE 
TABLES P@SSIBLE WITHIN THE CONSTRAINTS @F THE 
MARGINAL TOTALS. (THIS SHOULD BE 1-0. DEVIATIONS 
FROM 1-0 REFLECT THE ACCURACY OF THE COMPUTATION.) 


EX TERNALS« 


FACLOGCN) = FUNCTION TO RETURN THE FLOATING POINT 
VALUE @F L@G BASE 10 OF N FACTORIAL. 


AaAAIKRaAaAAANANANMAAMAAANNAAAMAAAANANNMNANAMAANAAAAAA 


DIMENSIGN MATRIXCNR»s NC) 
INTEGER R»Cs TEMP 


Cc 
R=NR-1 
C=NC~-1 
Cc 
C COMPUTE LOG BF CONSTANT NUMERAT®OR. 
Cc 


QXLOG=-FACLOGCMATRIXCNKs NC) > 
D@ 10 I=1+>R . 

10 QXLOG=QXLOG+FACLOGCMATRIXCIsNC)) 
D@ 20 J=1sC 

20 QXLOG=OXLOG+FACLO@GCMATRIXCNR» JD) 


c 
C COMPUTE PROBABILITY OF GIVEN TABLE 
Cc 
RXLOG=0-0 
D@ 50 1=1>R 
D@ 30 J=1sC 
50 RXLOG=RXLOG+FACLOGC(MATRIXCIsJ)) 
PT=10 60**CQXLOG~RXALOG) 
Cc 
PS=0+0 
Pc=0.0 
c 
C FILL LOWER KIGHT (R-1) X ¢€C-1) CELLS WITH 
C MINIMUM @F R@W AND C@LUMN TOTALS 
Cc 
DO 100 L=2sr 
DG 100 J=2,C 
100 MATRIXC Es JDSMINOCMATRIXCIs NC) sMATRIXCNRs JD) 
G@ TB 300 
c 
C OBTAIN A NEW SET OF FREQUENCIES IN 
C LGWER RIGHT ¢CR-1) X €C-1) CELLS 
Cc 


200 DO 220 I=2sR 
DO 220 J=2sC 
MATRIXCIs JOSMATRIXC Is J)-1 
IF CMATRIXCI,J)-GE-0) GO TO 300 
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220 MATRIXC Ls J) =MINOCMATRIXCIs NC) sMATRIXCNRe SDD 
RETURN 


FILL REMAINDER OF @BSERVED CELLS 
eeeeeCOMPLETE COLUMN 1 


aaaa 


300 D@ 320 1=2sKn 
TEMP=MATRIXCIsNC) 
D@ 310 J=2sC 

310 TEMP=TEMP-MATRIXCIs JD 
IFCTEMP-LT-0) GO T® 200 

320 MATRIXC€1s1)= TEMP 


veeeeCOMPLETE ROW 1 


aaa 


D®@ 340 J=1,C 
TEMP=MATRKIXCNR» JD 
DO 330 I=2sR 
330 TEMP=TEMP-MATRIXCIsJ) 
IFCTEMP-LT-O) GO T@ 200 
340 MATRIXC1sJ)= TEMP 


COMPUTE LOG OF THE DENOMINATOR 


aaa 


RXL@G=0-0 
D@ 350 I=1sR 
DO 350 J=1sC 
350 RXLOG=RXLOG+FACLOGCMATRIXC Is J)) 


COMPUTE PX» ADD T@ PS IF PX eLEs PT 
CALLOW FOR ROUND-OFF ERROR) 


aqaaaNnN 


PX=10-0*k(QXLOG-RXL@G> 
_PC#PC+PX 
LFCCPT/PX) 6GT20+99999) PS=PS+PX 
FUNCTIGN FACLOGCN) 
INPUT ARGUMENT. 
N = AN INTEGER GREATER THAN @R EQUAL T@ ZERO. 
FUNCTION RESULT. 


FACL@G = THE L@G T@ THE BASE 10 OF N FACT@RIAL.- 


aaaaaagaan 


DIMENSION TABLEC101) 

DATA TPIL@G/0.39908 993427 
DATA EL@G /0-43429 448197 
DATA IFLAG/O/ 


USE STIRLINGS APPROXIMATION IF N GT 100 


aaa 


IFCN-GT-100) GG Ta@ 50 


L@@K UP ANSWER IF TABLE WAS GENERATED 


aaa 


IFCIFLAG.EQ.0) G@ TO 100 
10 FACL@G=TABLECN+1) 
RETURN 


HERE FOR STIRLINGS APPR@XIMATIGN 


ana 


SO X=FL@OATCN) : 
FACLOG=(X+0-5)*ALGGIOCX) - X*¥ELOG + TPIL@G 
1 + ELO@G/€12.0%X) = ELOG/(3606O¥X*X*X) 
RETURN 


HERE T@ GENERATE LOG FACTORIAL TABLE 


aaAg 


100 TABLEC1)=0.0 
D@ 120 I=2,101 
X*FLOATCI-1) 
120 . TABLECI)=TABLECI-1)+AL@G10CX) 
IFLAG=1 
G@ Te 10 
END 


Remark on Algorithm 434 [G2] 
Exact Probabilities for R * C Contingency Tables [D.L. 
March, Comm. ACM 15 (Nov. 1972), 991| 


D.M. Boulton [Recd. 5 Mar. 1973 and 30 July 1973| 
Department of Information Science, Monash Univer- 
sity, Melbourne, Australia 


Algorithm 434 calculates the exact probability of a two-dimen- 
sional contingency table by generating all possible cell frequency 
combinations which satisfy the marginal sum constraints, and 
summing the probabilities of all combinations as likely or less 
likely than the observed combination. The method used to generate 
all the cell frequency combinations is rather inefficient as it operates 
by generating all combinations which satisfy a weakened set of 
constraints and then rejecting those combinations which violate 
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the actual marginal sum constraints..As the number of combina- 
tions rejected very often far exceeds the actual number accepted, 
the process is very wasteful. 

A more efficient combination generating algorithm is described 
in Boulton and Wallace [1]. It generates explicitly only those com- 
binations which satisfy the marginal sum constraints. In addition, 
because the combinations are generated by a set of nested DO loops 
each with a different cell frequency as its controlled variable, the 
order of generation is such that one combination usually only differs 
from the next in the values of a few cell frequencies in the lower 
right corner of the table. This ordering can be used to reduce the 
time taken to obtain the logarithm of the probability of each com- 
bination. Instead of always summing over all cells, an array of 
partial sums of logarithms of cell frequencies is maintained, and 
for.each new combination only that part of the logarithm which 
has changed is evaluated and then added to the relevant partial sum. 

March’s algorithm has been modified to use the combination 
generating algorithm of Boulton and Wallace and to take advantage 
of the order in which the combinations are generated. A series of 
comparison tests were run on a CDC 3200, and the results of a few 
are shown .in Table I. The modified algorithm was always faster, 
and as can be seen in Table I, the speed improvement can be quite 
large. 


Table |. Times for Evaluating Probabilities 


; Time (sec 
Contingency Sia 


table Probability Original Improved 
8 12, (20) .05767116 .026 O13 
8, 2, (0) 
(16) (14) @0) 
a. 3) 3-8.) 35262364 .290 .095 
2. 3.5% 28) 
(7) (6) (4) (2) (19) 
5, 1, 0, 0 6) 
1, 1, 2, 1 6) . 10625089 3.31 .510 
On) he 9... -} 
(6) (3) (@) @) (4) 
Ze) Dy. O- 0 (2) . 12380952 13.9 .693 
0; i O, 2.2) 
Ox. A0,2 2 FOF 7} 
0, 1, 0, 1 (2) 
(2) (2) (2) (2) (8) 


Finally, it is worth noting that the combination generating 
algorithm of Boulton and Wallace can be systematically extended 
for contingency tables of more than two dimensions. It can thus be 
used as the basis of a subroutine for calculating exact probabilities 
in more than two dimensions. 


References 
1. Boulton, D.M., and Wallace, C.S. Occupancy of a rectangular 
array. Comp. J. 16, 1 (1973), 57-63. 
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Remark on Algorithm 434 [G2] 

Exact Probabilities for R xX C Contingency Tables 
[D.L. March, Comm. ACM 15 (Nov. 1972), 991] 
T.W. Hancock [Recd 16 Nov. 1973, 11 Feb 1974] 
Waite Agricultural Research Institute, The University 
of Adelaide, Glen Osmond, South Australia 5064. 


The above algorithm was presented for computing exact 
probabilities of R < C contingency tables by the method described 
by Freeman and Halton [1]. Clearly inefficient for small matrices, 
this algorithm becomes impracticable for 4 < 4 matrices or larger. 
For this reason the subroutine presented below is suggested. Every 
effort has been made to minimize the number of coding changes 
so that (a) the original work of March can be recognized; and 
(b) the important differences are apparent to anyone wishing to 
compare the two approaches. Row and column dimensions have 
been added to the formal parameters, so that the elements of the 
contingency table do not have to be stored in a contiguous manner. 
(Both are included to ensure compatibility with any type of com- 
piler.) Function FACLOG(N) is exactly as presented by March. 

Acknowledgment. 1 thank Dr. O. Mayo, Waite Agricultural 
Research Institute, University of Adelaide, for suggesting that I 
investigate March’s algorithm. 

Differences in Method Comment cards have been included 
in the listing to locate and describe the differences discussed below. 
These can be identified by an asterisk in column three. Also where 
appropriate this is followed by a number which relates to the order 
in the list below. 

1. All cell frequencies are set to zero initially. 

2. The jump indicator KEY is equivalenced to 1, and cell (2,2) 
(MATRIX(2,2) in the subroutine) is set to —1. 

3. The generation process is accomplished by addition of 1 to 
the appropriate (/,J) cell frequency (where 7 and J proceed from 
2,..., Rand 2,..., C respectively). 

4.  Thevalueof row marginal / is checked against > x= MATRIX 
(,K). Similarly column marginal J is checked. If either marginal 
is less than the appropriate sum, control returns via 8 below to 3 
above. 

5. If indicator KEY equals 2 the cell frequencies preceding cell 
(I,J) are set to zero and the addition sequence recommences from 
cell (2,2) (i.e. 2 above). 

6. However, if KEY equals 1, subroutine INIT is called to gen- 
erate the ‘‘next” matrix of cell frequencies satisfying the margin- 
als. INIT first adjusts the marginals for the cell values in MATRIX. 
Then beginning at the lower left hand corner matrix (i.e. cell (R,1)), 
each cell in turn is increased to its maximum value and its marginals 
reduced. Once the row marginal is reduced to zero the sequence 
jumps to the first cell in the row above. Using this process it is 
possible to progress from one valid set of frequencies to the next, 
thus saving considerable time. 

7. After the probability calculations have been computed, for 
the matrix returned from JNIT, a sequence of matrices is generated 
if the frequencies of cells (1,2) and (2,1) are both greater than 
zero. (As explained by Freeman and Halton the probabilities of 
the members of this sequence of matrices are related and recognition 
of this simplifies their calculation.) 

8. KEY is equivalent to 2, and control returns to 3 above via the 
loop terminator causing cell (J,/)’ to be increased by 1. 

Results The two methods were cornpared on a Control Data 
Corporation CYBER 73 using contingency tables over a range of 
sizes and cell frequencies. Table I summarizes the CP times. Clearly 
the original method becomes unquestionably slow; in fact for a 
4 x 4 matrix, with all frequencies one, this method would attempt 
5® = 1,953,125 matrices before it reached a result. For the same 
matrix the revised method calculates probabilities for 10147 
matrices, all of which are compatible with the marginals. Obviously 
this improved method would be impracticable for contingency 
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NEALE 


Table I. Comparison of Subroutines 
(CP time required in seconds to compute exact probabilities 
for RXC contingency tables; where all cell frequencies are 
chosen equal to one. These are presented to illustrate the rela- 
tive improvement of RXC PRB over CON P. Obviously the actual 
times will depend on the machine used.) 


RXC CONP (by March) RXCPRB 


zx 2 .019 (3T) .018 (3T) 
2x3 012 (9) .010 (7) 

3 2 .018 (9) .016 (7) 
2x5 .073 (8) .054 (51) 
3x2 .093 (81) .055 (51) 
3 xX 3 -110 (256) .055 (55) 
3 xX 4 1.279 (4096) .509 (415) 
4X3 1.344 (4096) .514 (415) 
4x4 Unknown* 15.495 (10147) 
t+ Number of matrices attempted in the calculation 
* 


Computation was still incomplete after 500 seconds 


tables with more degrees of freedom and/or larger total sample 
size, but grouping of classes and alternative statistical tests are 
available in this area (see Goodman {2] or Sugiura and Otake 
[3]). Further it is generally trivial to continue once the tail prob- 
ability becomes large, so that insertion of a statement of the form, 
IF(PS. GT. 0.1. AND. PC. LT. 0.9)RETURN 
in subroutine RXYCPROB prior to statement numbered 32 would 
increase efficiency. 

In all cases; RXCPROB and CONP produced correct prob- 
abilities. (For smaller matrices, the computed probabilities were 
checked by hand; for the larger ones, agreement between the 
methods was taken to indicate the correctness of RXCPROB, 
since March had already tested his subroutine.) The maximum 
deviation of PC from 1.0 was 1.0 X 107°. Although slightly larger 
than reported by March this is a direct result of the increased 
complexity of the tables solved, and in fact CONP gave a similar 
deviation. 


References 

1, Freeman, G.H., and Halton, J.H. Note on an exact treatment 
of contingency, goodness of fit, and other problems of significance. 
Biometrika 38 (1951), 141-149. 

2. Goodman, L.A. On methods for comparing contingency tables. 
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3. Sugiura, N., and Otake, M. Numerical comparison of Im- 
proved methods of testing in contingency tables with small fre- 
quencies. Annals of the Institute of Statistical Mathematics 20 
(1968), 505-517. 


Algorithm 


SUBROUTINE RXCPRBC(MATRIXs NRDs NCDs NR» NCo 
* PTs PSs PC) 
* THIS SUBROUTINE COMPUTES EXACT 
* PROBABILITIES FOR R X C CONTINGENCY TABLES 
* INPUT VIA FORMAL PARAMETERS 
* NRD * THE RCW DIMENSION 
* NCD * THE COLUMN DIMENSION 
NR = THE NUMBER OF ROWS IN MATRIX (CRENR-J).- 
NC = THE NUMBER OF COLUMNS IN MATRIX (C#NC-1). 
MATRIX = SPECIFICATION OF THE CONTINGENCY 
TABLE. THIS MATRIX IS PARTITIONED AS 


FOLLOWS 
XCLs lL IXCL sds cccvcvesvaesX(]sC) XChoNC) 
. re ee . . 
* e eeeveeeereeenes e e 
XORs 1s XCRe 2) a ce cere cece es XC Rs C) XCRs NC) 


XCNPS TS XCNRs 2) a ee ee cece es XCNRIC) XCNRs NC) 
WHERE X€IsJd) ARE THE OBSERVED CELL 
FREQUENCIES, XC(IsNC) ARE THE ROW TOTALS, 
XC(NRsJ) ARE THE COLUMN TOTALS, AND X(NRJNC) 
IS THE TOTAL SAMPLE SIZE. 

NOTE THAT THE ORIGINAL CELL FREQUENCIES ARE 
DESTROYED BY TKIS SUBROUTINE. 


aqgQagoaagaaanaaaaaaagqagaaaaan 
HR HEHRKR HEHE 


aagaaraRaaaagaaagaaagaaa 


aq 
* 


eaaga 


aaaan 
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OUTPUT ARGUMENTS. 
PT = THE PROBABILITY OF OBTAINING THE GIVEN 
TABLE. 
PS = THE PROBABILITY OF OBTAINING A TABLE aS 
PROBABLE AS» OR LESS PROBABLE THAN» THE 
GIVEN TABLE. 
* PC = THE PROBABILITY OF OBTAINING ALL OF THE 
TABLES POSSIBLE WITHIN THE CONSTRAINTS OF 
THE MARGINAL TOTALS. (THIS SHOULD BE 1-8. 
DEVIATIONS FROM 1.8 REFLECT THE ACCURACY OF 
THE COMPUTATION.) 


EXTERNAL S. 

* INITCMATRIX.NRDsNCD»NR»NC) = SUBROUTINE WHICH 
* RETURNS THE *&NEXT* MATRIX TO SATISFY 
* THE MARGINALS. 


FACLOS(N) = FUNCTION TO RETURN THE FLOATING 
POINT VALUE OF LOG BASE 1@ OF N FACTORIAL. 
DIMENSICN MATRIX(NRDsNCD) 
INTEGER Rs» C 
R=NR- J 
C=NC - } 
COMPUTE LOG OF CONSTANT NUMERATOR 
QXLOS = -FACLOG(MATRIXC(NRJNC)) 
DO 16 I#),R 
QXLOG = QXLOG + FACLOG(MATRIXCI.NC)> 
18 CONTINUE 
DO 26 JFI,C 
QXLOG = QXLOG + FACLCG(MATRIX(NRasJ)) 
26 CONTINUE 
COMPUTE PROBABILITY OF GIVEN TABLE 
RXLCG = 2.@ 
DO 4@ 1=1,.R 
DO 38 J=16C 
RXLOG = RXLOG + FACLOG(MATRIXCIsJ)) 
3G CONTINUE 
4@ CONTINUE 
PT = 10-@**(QXLOG-RXLOG) 
PS = Go 
PC = 0-8 
* Je ALL CELL VALUES INITIALLY SET TO ZERO 


MATRIXCIsJ) = @ 
58 CONTINUE 
66 CONTINUE 
* @. EACH CYCLE STARTS HERE 
76 KEY = ! 
MATRIX(€2s,2) = -1) 
3- GENERATING SET OF FREQUENCIES PROGRESSIVELY IN 
LOWER RIGHT (R-1) * (€C-1) CELLS. 
DO 1€8 I#2,R 
DO 158 J=2,C 
MATRIX(€I4J) = MATRIXCIsd) + 1 


* 


* 4s CHECKING SUMMATIONS .LE. RESPECTIVE MARGINALS 
* I.E. (SUM OF ELTS. J TO C IN ROW I) -LEe 
* MATRIXCIZNC) AND (SUM OF ELTS. 1 TO R IN COL. 
* J)eLE. MATRIX(NRsJ) 

ISUM = @ 

JSUM = @ 


DO 8B MeJ,C 
ISUM = ISUM + MATRIXC1sM) 
8B CON TINUE 
IF CISUMe-GT.MATRIXCI,NC)) GO TO 138 
DO $8 KeI.R 
JSUM = JSUM + MATRIX(Ksad) 
Sf CONTINUE 
IF (JSUMeGTeMATRIX(NRsJ)) GO TO 138 
* S» JUMP TO STATEMENT 17@ WHERE ALL CELLS PRIOR TO 
* MATRIXCIsJ) ARE SET TO ZERO. 
IF (KEY-EQ-2) GO TO 176 


Ip =! 

JP=zJd 
* 6. CALL SUBROUTINE INIT TO FIND THE NEXT BALANCED 
* MATRIX 


CALL INIT(MATRIXs, NRD» NCDs NR» NC) 
COMPUTE LOG OF THE DENOMINATOR 

RXLOG = @-@ 

BDO 118 Kel.sR 

DO 188 Me}.C 
PXLOG = RXLOG + FACLOGC(MATRIX(KsM)) 
188 CONTINUE 
11g CCNTINUE 
* COMPUTE PX- ADD TO PC AND ALSO PS IF PX -LEs PT 
CALLO’ FOR ROUND-OFF ERROR) 

PX = 16-0" (QXLOG-RXLOG) 

PC = PC + P¥ 

IF €CPT/PX)+GT+GeSSSS9) PS = PS + PX 
* 7. IF PCSSIBLE A SEGUENCE OF MATRICES AND 
* ASSOCIATED PRCBABILITIES (P%sPC ANL PS) ARE 
* GENEPATED BY MANIPULATING CELLS (141). (122),4 
* (221) AND (2,2) CSIMILARLY ALLOWING 
* FOR ROUND-OFF ERROR) 
126 IF (MATRIX(),2)-LTe} .GR- 

* MATRIXC2s})-LT.-1) GC TO 142 
MATRIXCHs1) = MATRIXC}s1) + 3 
MATRI¥ (2,2) = MATRIX(2,2) + I 


PX = PX*FLOATC(MATRIX( 1s 2))*FLOATCMATRIX (251) 


* /FLOGATCMATRIX(}413)9)/FLOATCNATRIX( 2s 2) ) 
PC = PC + PX 
IF CCPT/PX) -GTeBeSSSSS) PS = PS + PX 
MATRIX(€1s2) = MATRIX()].22) - J 
MATRIXC2s1) = MATRIX(2,1) - 1 
GO TO 12e 


0 
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aan 


Q 


aa 


13@ IP = |] 

JP = J 
* 8. KEY SET TO € AS CYCLE COMPLETED 
142 KEY = 2 


159 CONTINUE 
168 CONTINUE 
PETURN 
* ALL CELLS OF MATRIX PRIOR TO THE (1sJ)THe ARE 
* SET TG 7ERO. 
17@ CO 18@ M=2,.uUP 
MATRIXCIPsM) = @ 
18@ CCNTINUE 
IP = IP - ] 
DO 2@2 K=1,1P 
DO 1$@ M=2,€ 
MATRIX(CKsM) = @ 
1S¢ CONTINUE 
206 CONTINUE 
GO TO 78 
END 


SUBROUTINE INITCMATRIXs NRD» NCDs NR, NC) 
* THIS SUBROUTINE RETURNS THE *NEXT* MATRI¥ TO 
* SATISFY (1) THE MAPGINALS AND (2) THE SEQUENCE 
* OF GENERATION DEFINED IN SUBROUTINE RXCPRB. 
DIMENSICN MATRIXCNRD,NCD)s» MROW(58)5 MCOL(5B) 
IMTEGER Ry C 
De NR - ] 
CH= NC - ] 
EQUIVALENCE MROW AND MCOL TO ROW AND COLUMN 
MARGINALS PESPECITVELY. 
DO 1@ K=I.R 
MATRIX(Ks1) = @ 
MROWCK) = MATRIXCKSNC) 
1@ CONTINUE 
DG 26 nN=14C 
MCCL(M) = MATRIX(NRSM) 
28 CONTINUE 
* FOR EACH R80, SUBTRACT ELEMENTS 2 TO C FROM MROW 
DC 40 YH2,R 
DO 38 M=2.C 
MRC CK) = MROWCK) - MATRIX¢KsM) 
38 CON TINUE 
48 CONTINUE 
* FOR EACH COLUMN, SUBTRACT ELEMENTS 2 TO R FROM 
* MCCL 
DO €4 M=2,C 
DC 58 K=2,.R 


% 
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MCOL(M) = MCOL(M) - MATRIXCK,sM) 
58 CCNTINUE 
6@ CONTINUE 


C * FORMING *NCXT BALANCED* ARRAY 


aaana 


DC 98 121,F 
IR = NR - I 
DC 8@ J=1.,C 
MIN = MIN@CMRCWCIRD»MCOLCY) > 
IF (MIN-EQ.@) GO TO 78 
MATRIXCIR,J) = MATRIXCIRsJ) + MIN 
MROYCIR) = MROWCIR) - MIN 
MCOL¢J) = MCOL¢(J) - MIN 
7@ IF (MROWCIR)D.EQ-@: GO TO SO 
344) CONTINUE 
S$@ CONTINUE 
RETURN 
END 


FUNCTICN FACLOG(N) 


INPUT ARGUMENT. 


N = AN INTEGER GREATER THAN OR EQUAL TO ZERO. 


FUNCTION RESULT. 


FACLCG = THE LOG TO THE BASE 1@ OF N FACTORIAL. 
DIMENSICN TABLE(1@1!) 

DATA TPILOG /86-39S08S¢6342/ 

DATA ELOG /@.4342644816/ 

DATA IFLAG /@/ 


USE STIRLINGS APPROXIMATION IF N GT 186 


IF (N-GT.18@) GG TO 22 


LCOK UP ANSWER IF TABLE WAS GENERATED 


H 


IF CIFLAG-EQ.8) GO TO 38 
1@ FACLOG = TABLECi+]) 
PETURN 
ERE FCR STIRLINGS APPROXIMATION 
2@ * = FLOAT(CN) 
FACLOG © (X+0.5)*ALOGI@(X) - X*ELOG + TPILOCG + 
* ELOG/(1260#X) - ELOG/( 368.0%"XeXeX) 
RETURN 


HERE TO GENERATE LOG FACTORIAL TABLE 


3@ TABLE(I]) = 6.8 
DO 4@ 172,18) 
X = FLOATCI-1) 
TABLECI) = TABLECI-1) + ALOGI8¢X) 
4@ CONTINUE 
IFLAG = } 
GC TO 18 
END 
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REMARK ON ALGORITHM 434 


Exact Probabilities for R X C Contingency Tables [G2] 
[D.L. March, Comm. ACM 15, 11 (Nov. 1972), 991 ] 


D.M. Boulton [Reed 25 June 1975 ] 
Department of Computer Science, Monash University, Clayton, 3168, Victoria, 
Australia 


Two previous Remarks, by Boulton (1974) [1] and by Hancock (1975) [2], 
have shown that Algorithm 434 by March (1972) is rather inefficient, especially for 
contingency tables with many degrees of freedom. The inefficiency lies in the method 


Table II. Times in Seconds for the 
Contingency Tables in Hancock{2] 


Table I. Times in Seconds for the 
Contingency Tables in Boulton [1] 


RxXC Hancock Boulton RxXC Hancock Boulton 
2X2 0.024 0.018 2X 2 0.008 0.007 
2X 4 0.16 0.10 2X 3 0.023 0.016 
3x 4 1.37 0.68 3X 2 0.023 0.016 
4X 4 2.21 1.05 2X 5 0.21 0.11 
—— 5X 2 0.21 0.10 
3 xX 3 0.22 0.12 
3 xX 4 2.08 0.98 
4X 3 2.08 0.99 
4x4 63.5 25.5 
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of generating all those cell frequency combinations that satisfy the marginal sum 
constraints. 

The purpose of this remark is to compare directly the speeds of the above two 
more recent algorithms (in the Remarks). The comparisons were carried out on a 
Hewlett-Packard HP2100A computer with fully extended arithmetic and micro- 
programmed array referencing and subroutine entry. In Table I, times are given for 
the four examples originally used in Boulton. In Table II, times are given for the 
examples presented in Hancock. 

The algorithm by Boulton is always faster, and for all but 2 x 2 tables the 
improvement is quite significant, being more than a factor of 2 for contingency 
tables with several degrees of freedom. 

The same set of tests were run again on the HP2100A with standard firmware, 
i.e. without microprogrammed array referencing and subroutine entry. The times 
were then found to be even more in favor of Boulton’s algorithm. The speed ratio 
increased to 3 for Hancock’s 4 X 4 table. 
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Algorithm 435 


Modified Incomplete Gamma 
Function [S 14] 


Wayne Fullerton [Recd. 30 Dec. 1970 and 12 April 
1971] 

Department of Astronomy, University of Michigan, 
Ann Arbor, MI 48104 


Key Words and Phrases: modified incomplete Gamma function, 
incomplete Gamma function, chi-square distribution function, 
Poisson distribution function 

CR Categories: 5.13 

Language: Fortran 


Description 
The incomplete Gamma function is defined by 


y(a, x) -| yo evdy, x20. (1) 
0 


If x is allowed to assume negative values and if the absolute value of 
y is substituted for y in the term y2—!-, then a modified incomplete 
Gamma function may be defined by 


Zz 
y'(a, x) = [ | y |e. e dy, —0o <x < ow, (2) 
0 
Note that if x is less than zero, the above is equivalent to 
|2| 
y'(a,x) = — [ yatetudy, x<0. (3) 
0 


The function subprogram GAMINC given below computes the 
more general function 


zQ 
GAMINC (a, x1 , x2) & e”! [ | y |e) eY dy 
Jay (4) 


a e"[y'(a, x2) a y'(a, x1)). 


For x, equal to zero, GAMINC is just a modified incomplete 
Gamma function. And if x: is also greater than or equal to zero, 
then GAMINC is simply an incomplete Gamma function. 

The need for the function GAMINC arises in the calculation of 


Zz Z 
l= i : et tbz exp 4 — i ext82' _az' | az (5) 
zy ‘ —sin(d) J —sin(3)’ 


The facilities of the University of Michigan Computing 
Center were used for this work and the research was supported by 
the National Science Foundation through a Traineeship granted 
to the author. Author’s present address: Los Alamos Scientific 
Laboratory, Los Alamos, NM 87544. 
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where # is an angle between —7z and +7 not equal to zero. The two 
constants 5 and @ are of the same sign. The integral in the exponent 


can be done explicitly to yield 


et T0tX1 22 — et tBZ ( 
l= OTe OxD a 6 
—sin (8) Jz, |= a ie ) 
where 
at+bZ i 
X= eee 
—b sin (9) 
and 
Zz / 
Tm) = [ ex tB2’ at 
5 —sin (J) 


A change of variables finally reduces the above integral to 


a9 
I = e* 7° |b sin (9) [8/01 -e—a8lb tal yprreray| (7) 


vy, 


The quantity in brackets is GAMINC(6/b, Xi, X2). 

The approximations of +’ (a, x) used in GAMINC are valid only 
for 1. Sa S 2. (See Table I.) The user may compute GAMINC 
for other values of a with the aid of the recurrence relation (mm is a 
positive integer such that 1. Sa < 2). 


GAMINC(m + a, x1, X2) 
= (m + a — 1) GAMINC(m + a — 1, m1, x2) + [| |" (8) 


= | Xe eek a) | 
In general for x1 > O and x, > 0, 


GAMINC(m + a, x1, X2) 
= (m+ a-— 1):(m+a-— 2) --- (a):GAMINC(a, x, x2) 


+ [xa [2 [| on fm +3 (m +a — 1) 

canta fialr] 
— [xale[laale + 2m ta— 1) 

ae ++ (m + a — i) | x2 |e Jerr-=2, 


The recurrence relation should be applied in the other direction 
if m + a is less than 1. 

For large values of a (a > 15.) in the incomplete Gamma 
function, the user is referred to the algorithm by Takenaga [5]. 

In all cases we use approximations which are functions of both 
a and x, so that it is not necessary to compute and store an econo- 
mized polynomial for each value of a. The overhead in execution 
time for doing this is not significant since many-term expressions 
would result anyway. Also exponentiation and real numbers 
raised to a real power require 30 percent of the total computing 
time. Multiplying y’(a, x2) — y'(a, x1) by e71 saves two exponentia- 
tions and greatly extends the range over which the difference can be 
represented without over- or underflows occurring. Four separate 
approximations are used to compute y’ (a, x). 

Region 1. For x > 5.0, the complimentary incomplete Gamma 
function is computed by using a continued fraction approximation 
{1] 


iad ‘ced 


x+T7;’ 


T'(a) — y/(a, x) = (10) 
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where 
eo Sn 
1+ i/(x + Ti+)’ 


and where I'(a) is the complete Gamma function of a. Only terms 
through 73 are used explicitly. 7, is taken into account in an ap- 
proximate way by setting 7, = 1.7, which is its approximate value 
when x ~ 5.0. If both argument values are greater than 5.0, then 
significance is maintained by subtracting the complementary func- 
tions, not the functions themselves. 

Region 2A. For —12. < x < —1. and 1. < x < 5., the con- 
tinued fraction approximation given by Luke [3] is valid. We 
rewrite the approximation in the form 


x-[x |e he 


11 
ati (11) 


y'(a, x) = 


where 


a (a+n—1)-x 
"(a + 2n — 2)-[a + 2n — 1 + (n-x)/((a + 2n)-Trt1)] | 
Only terms through 7; are used explicitly, and 7s is computed by 
using the approximate expression 

Ts = 1.00150 — 8.95-10%-a 4+ x (12) 


- (—0.0337062 + .0004182 “a + x 
- (000999294 — .000104103 -a)). 


Tn =1 


On a computer with 32 bit words, eq. (11) must be evaluated in 
double precision in order to maintain approximately six significant 
figures of accuracy. On an IBM 360 double precision evaluation 
can be forced by including more than seven digitsin a constant as is 
done in eq. (12). Of course, double precision evaluation is unneces- 
sary if there are somewhat more than 32 bits per word. Because 
the calculation of the approximation of y’(a, x) is a rélatively time 
consuming operation, a separate approximation is used when 
[x| <1. 

Region 2B. For —1.0 < x < 1.0, a change of variables is made 
so that . 


x y a1. 
y'(a, x) = laches | (2) e ut dy, (13) 
aN 
or 
1 
y'(a, x) = x-|x Pe (1 — p)*"!-e? dp. (14) 
0 


Because —1.0 < xp < 1.0, e*? may be adequately approximated 
with a polynomial. A Chebyshev approximation of nine terms yields 


a maximum absolute error less than 10-7, which is adequate to — 


insure that the maximum relative error of the integral ordinarily be 
much less than about 10~*. Since the relative error in the single 
precision evaluation of | x |*e-* is usually ~1-10-° for a machine 
with a 32 bit word length, the above error bound seems entirely 
reasonable. Write 


M 
e => b,Zi, -10< Z< 10. (15) 
im 
Then 
tis Died fete ee Mn ee 
EES imo (i + a)Gi +a — 1) +--+ @)’ ea 
Finally we may define 6,’ = 5;-i!, and write 
b;’ x? 
(a, x) = x+|x [91 -e* os (17) 


ino (i + a)-(i+a—1)--- (a) 
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Note that if the series was not economized, all the 5,;’ would be 
unity. But because a finite Chebyshev economized series is em- 
ployed, the 5,’ are only approximately unity. 

Of course, it would be possible to extend the Chebyshev ap- 
proximation to include the entire range —12. < x < 5.0; however 
the many-term result would have to be evaluated in double pre- 
cision in order to insure a relative error <10-®. It would also be 
possible to decrease the range of validity of the ascending continued 
fraction approximation; however the other approximations would 
then have to be more complicated and would require an accordingly 
longer time to evaluate. Such a change was judged inadvisable since 
the function is used predominantly with arguments whose absolute 


Table I. Relative Errors of GAMINC(A,0.,X) in Units of 
the Sixth Decimal Place 


A 
X Sa ce BO hacen 
0.5 0.8 |1.1 [1.4 [1.7 [2.0 [2.3 [2.9 |3.5 
—14. 16.57 |3.37 [0.27 |0.79 |1.11 [0.81 |1.87 |1.35 /1.03 
—12. 1.03 {1.61 [0.42 10.47 |1.23 |0.95 |2.70 |1.69 |2.70 
— 8. 0.09 |1.04 |0.34 |0.69 |1.69 |1.07 |2.01 |2.11 |2.24 
— 4. 0.38 |0.82 |0.15 |0.79 |0.67 (0.19 {0.51 |1.59 [0.75 
— 2. 1.22 |0.78 |0.50 |0.11 |0.07 |0.01 |0.09 (0.11 0.51 
— 0.5 0.65 [0.43 [0.21 |0.32 [0.23 (0.15 |0.18 [0.65 |0.78 
+ 0.5 1.03 {0.42 (0.66 {1.19 {1.25 10.77 [0.29 (0.06 |0.35 
2. 0.53 |1.57 |0.77 |0.26 |0.04 |0.22 |0.05 |0.23 |0.40 
6. 0.44 |0.38 (0.06 |0.01 10.06 |0.06 [1.21 |0.36 |2.21 
10. 0.63 |0.73 |0.03 {0.02 0.08 (0.03 [0.60 |0.04 [0.26 


Table II. Execution Times of GAMINC(A, X, , X2) in Milli- 
seconds 


—12. < %& <5 
X2 < —12. Xe > 5 
[| Xe) <1. ][X|> 1. 
xX < -12. 1.1 1.4 2.0 0.6* 
od [M| <1.]1.4 1.3 2.0 1.4 
nom <s.{iel st 2.0 2.0 2.4 2.0 
mM > 5. 0.6* 1.4 2.0 1.1 
Xi =0. 0.8 0.9 1.4 0.8 


* Only the modified incomplete Gamma function for X = X1 was calculated, 
because | Xz: — Xi | was greater than EXPLIM. 


values are large. Also, the present choice of ranges and approxima- 
tions provides for the accurate representation of y’(a, x) further 
beyond a = 2. than would many other choices. 

Region 3. For x < —12., the asymptotic expansion 


y (a, x) ~w T'(a) 


—1 
— xibe[ ie + ; 
x x 


(a — 1)-(a — 2) (18) 
4... 
is used. Shank’s e; process [4] is applied once to the six-term series 
in order to accelerate convergence. 

The function subprogram is invoked by a reference of the form 


GAMINC(A, X1, X2, GAM), 


where GAM is the user-supplied value of the complete Gamma 
function of A. I'(a) is now commonly a part of the standard Fortran 
library of functions. If it is not, one of the several algorithms de- 
scribed in this department may be used, or GAMMA given in 
IBM’s Scientific Subroutine Package (cf. Hastings [2]) may be 
used. 
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Table I presents the absolute value of the relative errors (multi- 
plied by 10°) of y’(a, x) for selected values of a and x. Because 
| x |*e-* was not calculated in double precision, these errors are the 
total errors and not the errors in the approximations. The “exact” 
values were found by directly summing the series 


N —y)t 
heed cea 


in double precision. N was chosen so that the contribution of the 
Nth term was less than 2- 10~ times the sum of the previous N terms. 
Single precision approximations were used to represent a and x in 
order to insure that the series and the subprogram gave +’'(a, x) 
for the same parameter values. The subroutine has been used 
extensively to compute a three-fold integral which includes numer- 
ous cases of eq. (5) as a part of the integral. Independent numerical 
integration results are in agreement with subroutine results to 
within three significant figures—the accuracy of the numerical 
integration. Table II gives the average execution times in milli- 
seconds of the subroutine for various argument combinations. The 
times are for an IBM 360/67, which, for comparison, exponentiates 
in approximately 0.1 milliseconds. 

Acknowledgments. It is a pleasure to thank Dr. Carl deBoor 
for commenting on a draft of this paper. 
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Algorithm 
FUNCTI@N GAMINC CAs X1sX2s GAM) 


COMPUTE THE DIFFERENCE BETWEEN TWO MODIFIED INCOMPLETE 
GAMMA FUNCTI@NS FOR ¢A»X1> AND (CAsX2@) THEN MULTIPLY BY 
EXPCX1)~. THAT ISs COMPUTE THE INTEGRAL OF ASS(X)**(A-1.) 
*EXPCX1-X) FROM X1 TO X20 1F Xi .GTe X2s THEN X1-X2 MUST 
BE «LE. EXPLIM. 

EXPLIM CAN BE.A MACHINE DEPENDENT CONSTANT WHICH PREVENTS 
EXPONENTIATI@N @VER- AND UNDERFLOWS. IT IS USED HERE TO 
SUPPRESS. THE CALCULATION @F MIGAMCAsX2) WHEN THE VALUE OF 
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MIGAMCAsX2) IS INSIGNIFICANT. THIS USAGE REQUIRES X2 + 
EXPLIM «GE. X1i- CMIGAM IS AN ABBREVIATION FOR MODIFIED IN- 
COMPLETE GAMMA FUNCTION.) 

GAM IS THE COMPLETE GAMMA FUNCTIGN OF A SUPPLIED BY THE 
CALLING PROGRAM. 


FOR X eGT. Se» GAM=MIGAMCAsX) IS COMPUTED WITH A CONTINUED 
FRACTI@N APPROXIMATION» FOR ABS(X) eLEe 160% THE INTEGRAL 
IS TRANSFORMED AND EXP(Q@) IS APPR@XIMATED WITH A CHEBYSHEV 
SERIES S@ THAT THE NEW INTEGRAL MAY BE DONE ANALYTICALLY- 
FOR X eGT. -126 AND X eLTe Se CABSCX) eGTe 16002 A CONTIN- 
UED FRACTIGN APPROXIMATIG@N IS USED. FINALLY FOR X «LEe 
~12es THE ASYMPTOTIC EXPANSION IS USED. 


SGN IS A SWITCH WHICHs IF NONZERO, INDICATES WHETHER GAM 
SH@ULD BE ADDED OR SUBTRACTED FROM AN INTERMEDIATE RESULT. 


AQaAAAARANAAANAANRAAAAA 


DATA EXPLIM/20-/ 
Z=X1 
SGN20. 
TIM=-}. 
EXPDIF=1.0 

5 IF (Z .NE- 0.) GO TO 10 
GAM1=0. 
SGN2SGN+TIM 
G@ Te 40 

10 IF (Z «LE» 5-) GO TO 20 

C USE EQUATION 10. 

GAML=<-EXPDIF&ZHHA/SCZ HCL et AD/ CL et ho ACZ HC 2 e-A/C 1 a t2e 
1 SOZ#CDe“AY/ C1 oe ¢Ze/CZ41079999))? 
G@ TQ 40 

20 AZ=ABS¢Z) 
IF (Z «LE+ -12-6) GO TO 30 
SGN=SGN+TIM 

C USE EQUATION 17- 

IF (AZ eLEs 16) GAMI2EXPDIF*Z/A *AZ¥*CA-1.-) 

*C1e HZ/CAFL o> OC CIIIIIIIFZ/CA42 oD 
#609999999 +Z/CAt3e) #C14000008+Z/( A+ 4.) 
#€16000005 #+Z/CAF5e) €6 0999431 64Z/CAT6e) 
#Co9995587 *Z/CASTe) #010031 68 44Z/ 0 A486) 
*1.028125))9299)) 

C USE EQUATIONS 11 AND 12. EVALUATION MUST BE DONE 

C IN DQUBLE PRECISION IF COMPUTER, HAS 32 OR FEWER BITS 

C PER WORD. ON AN IBM 360s De Pe EVALUATION IS FORCED 

C BY THE D. Pe CONSTANTS IN CONTINUATI@N CARD 9- 

IF CAZ oGTe 15) GAMISEXPDIF&Z/A #¥AZ#*CA-16) 
7Cle> AD 42/0 A *CAt Let Z/CCAt 20) 
HCL CAFL OD BZ/SC CAH Zed RCAF GetQeMZ/C CAF 4s) 

60) 
Be) 


VbOD = 


RCD eH CAFZe HZ/C CAT Fed BCAt So t3oeZ/C CAF 
HCL a= CAFG* DHZ/SCCAt Ged HCAH Tet 4e8Z/ 0 CAF 
RCL CA+ 40 BZ/C CAH Be d*CAt Fet5eoB*Z/C CAFO“) 
HOD CAFS se DHZ/SCCAFN0 0 DHCAFI 1 +60 #Z/ 0 CAF12 6) 
HCL CAG 0 IHZ/SCCAF1L2 0 DRCATTS 0+ 7 0 #Z/ (6 CAFE AS) 
*C1 6001 50-AS8 e9SE“5 +Z%C- -0337062+Ae 0004162 
+Z%( 200099929 4-A%2000104103)>) >22)2 922) 992) 
>>>> >>> 299) 9279 
G8 Te 40 
C USE EQUATION 18 AND SHANK-S E1 PR@CESS ONCE. 
30 GAM 1 R-EXPDIFSAZHECAH1L oe DEC Lo HCA Lede Det CAW2e)% 
LD CDH CAKDe ECL HCA He ECD et Chim Se SC ZOAFGe)) 
@ /2)9/2)9/Z)/2) 
40 IF (TIM -GT» 0+) GO TO S55 
GAMINC#GAM1 
IF CABSCX1-X2) -GT- EXPLIM) G@ TO 50 
C IF TRUEs CONTRIBUTION AT X2 IS eLTe 16E-7 *CCONTR AT X1)s 
C PROVIDED X2 «-GT+ Xie 
Z=x2 
EXPDIFsEXPC(X1-X2) 
TIM=1. 
G8 Te § 
50 GAM1=0. 
$s GAM INC=GAM1~GAMINC 
IF (SGN «NE» 0.) GAMINC=GAMING~SIGN( GAM#EXPC(X1)»sSGN) 
RETURN 
END 


PwYUNQrausBWwNH— 


[Wayne Fullerton, Comm. ACM 15, 11 (Nov. 1972), 993-995] 


Andrew Y. Schoene [Recd 18 May 1977 and 13 October 1977] 
Research Laboratories, General Motors Technical Center, General Motors Cor- 


poration, Warren, MI 48090 


The following changes were made to ACM Algorithm 435: 
(1) .LE. in the line labeled 10 was changed to .LT. to conform with the 


algorithm presented in the text. 


(2) .LE. in the line following the line labeled 20 was changed to .LT. This 
change is recommended because the continued fraction [eq. (11)] is more accurate 
than the asymptotic expansion [eq. (18)] at X = —12.' 


' Equation numbers in this Remark refer to those in ACM Algorithm 435, referenced above. 


COLLECTED ALGORITHMS (cont.) 


Note, also, that the expression for T; following eq. (10) contains a misprint: the 
numerator should read zi — a rather than i + a. 

With changes (1) and (2) the algorithm was executed on an IBM 370/168 using 
the Fortran H extended (Opt = 2) compiler, and Table I of Algorithm 435 was 
approximately reproduced (see Table I of this Remark). 

The proposed method for extending the range of applicability of GAMINC is, 
however, not entirely satisfactory. It is the purpose of this Remark to show how 
Fullerton’s methods may be successfully employed to compute his modified 


incomplete Gamma function for an extended parameter range. A Fortran function. 


subprogram GAMDRYV which accomplishes this is included here; it serves partly 
as a driver for GAMINC and should be a useful companion to it. 

To compute GAMINC(A, X1, X2) for 2 < a < 15, Fullerton suggests the use of 
forward recursion. However, satisfactory accuracy cannot be maintained for all 
values of the parameters due to numerical instability of the recursion. For 
simplicity we consider only the modified incomplete Gamma function defined by 
Fullerton as G(a, x) = f*|y|*"' exp(—y) dy. Using the methods of Gautschi [1], 
forward recursion for G(a, x) can be shown to be numerically unstable for x > 0 
and for x < 0 with a > |x|. For example, computing G(12.5, 2) by double-precision 
forward recursion starting from G(1.5, 2) yields a value with the incorrect sign. 

While recursion cannot be used indiscriminately, it is possible to extend 
G(a, x) to the range 2 < a < 15 while maintaining approximately six-significant- 
digit accuracy. This can be done most simply by dividing the x-axis into three 
regions and using a different extension in each region. This task can be appreciably 
simplified by evaluating the term |x|*~' in eqs. (11) and (17) of Algorithm 435 in 
double precision. The Fortran function subprogram GAMDRV(A, X), when used 
in conjunction with a version of GAMINC modified as suggested above, will 
compute G(a, x) for 1 = a < 15 and —EXPLIM <= x < o to an accuracy of 
approximately six significant digits. EXPLIM is a machine-dependent constant 
(with the value 20. for the IBM 360/370 series) used in GAMINC to prevent 
exponent overflow. The extensions employed by GAMDRYV are sketched by 
region as follows. 

Region 1: x = 5. GAMDRY also makes use of the complementary incomplete 
Gamma function denoted by CG(a, x) =ffy*" exp(—y) dy and its continued 
fraction approximation [eq. (10)] from Algorithm 435. Since up to three digits of 
accuracy may be lost in the subtraction G(a, x) = Gamma(x) — CG(a, x) for 
values of a near 15, it is necessary to use double precision exclusively in this 
region. Five terms of eq. (10) are used, with 7; represented by a linear function 
of a, selected to fit for x = 5,2 = a = 3. After subtraction from Gamma(x) this 
basic approximation yields six-digit accuracy in the region x = 5, a = .5 (x + 4) 
(this bound is slightly conservative to simplify the code). For larger a, the 
recurrence relation CG(a + 1, x) = a-CG(a, x) + x*-exp(—x) is employed after 
first reducing a to get a sufficiently accurate starting value. 

Region 2: -12 = x <5. If |x|*~' in eqs. (11) and (17) of GAMINC is evaluated 
in double precision as suggested above, then GAMINC achieves. approximately 
six-digit accuracy for 2 < a < 15. On the assumption that this has been done, 
GAMDRYV calls GAMINC to obtain the value, If GAMINC is not so modified, 
then as a very rough approximation the relative error increases linearly with a, 
reaching levels of 20 x 107° for a > 10. 

We consider further the evaluation of eqs. (11) and (12). On the IBM 370 series 
the double-precision constants in eq. (12) cause some subexpressions of eq. (11) 
to be evaluated in double precision while others involving only a and z are 
evaluated in single precision and the results subsequently extended to double 
precision. If double precision is used for the entire expression (including |x|*~’), 
then only six terms of the continued fraction are required to achieve six-digit 
accuracy throughout the entire range 1 <= a < 15. A slight complication in the 
coding is that different approximations to Ts must be used for x < 0 and x > 0. 
The following were obtained by a least squares fit to computed values of 7¢ for 
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COLLECTED ALGORITHMS (cont.) 435-P 5- 0 
Table I. Relative Errors of GAMDRV (x 10°) 


a 

x 1.50 5.50 8.00 11.00 14.50 
—18.00 0.61 2.57 0.87 1.19 3.63 
—14.00 0.10 1.40 0.85 1.03 2.79 
—12.00 0.14 0.07 0.12 0.12 0.30 
— 8.00 0.65 0.12 0.01 0.57 0.00 
— 4.00 0.21 0.36 0.12 0.15 0.18 
— 2.00 0.12 0.10 0.05 0.01 0.25 
— 0.50 0.15 0.32 0.12 0.14 0.25 
0.50 0.84 0.75 0.59 0.29 0.63 
2.00 0.17 0.36 0.29 0.15 0.18 
5.00 0.03 0.62 0.18 0.00 0.33 
7.00 0.03 1.64 0.05 0.02 0.02 


Table II. Execution Times of GAMDRYV in Milliseconds on the IBM 370/168. 


(The numbers in parentheses represent the original GAMINC values.) 


x 1.5 5 10. 15 

X<—-12. .21 (.20) 23 36 39 

(-12..5 X<-1L1<X .27 (.28) .28 .28 .28 
< 5.) 

|X| = 1. .20 (.17) 21 21 21 

X=5. .21 (.20) 27 .34 388 

X= 10. 21 25 33 37 


X = 15. 21 25 Jl 34 


the critical regions -12 <x = -10,1lsasx2and45 x55, 1s a6 2, 
respectively: 


Ts = .92391 + x-(—.065094 + .00073933 - x) 
+ a-(.020541 + .0020402-a + .0060327-x), x<0 
= .96410 — x-(.029325 + .0012057 - a) 
+ .0034758 - a, x>0. 


This six-term double-precision approximation executes slightly faster than the 
original eight-term approximation on the 370/168. Double-precision arithmetic is 
only modestly slower than single precision on this computer. 

Region 3: x < —12. GAMINC yields approximately six-digit accuracy for 2 = 
a = 6 and is called by GAMDRYV for such a. Larger a are reduced to the range 
1 < a = 2 (it is necessary to start the recursion with as accurate an a value as 
possible) and the forward recursion relation G(a + 1, x) = —x*-exp(—x) - 
a:-G(a, x) employed. This recursion is essentially stable in the above range, 
although accuracy deteriorates slightly for a near 15 where the maximum ob- 
served relative error of 5 x 10°° occurs. 

Table I presents the absolute value of the relative errors (multiplied by 10°) for 
selected values of a and x using GAMDRV in conjunction with the modified 
version of GAMINC described above. The exact values were found as described 
in Algorithm 435. For x > 8 the observed relative errors were always less than lt. 
Execution times of GAMDRYV for various arguments are given in Table II. 
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COLLECTED ALGORITHMS (cont.) 


ALGORITHM 
C 
C TEST DRIVER FOR FUNCTION SUBPROGRAM GAMDRV 
C FOR -EXPLIM .LE. X .LT. 1@. THE "EXACT" ANSWER IS COMPUTED BY 
C FUNCTION SUBPROGRAM SUMSER. 
C FOR X'.GE. 10. THE "EXACT" ANSWER IS COMPUTED BY. FUNCTION 
C SUBPROGRAM COMGAM. 
C 


Cc 


AAARAANAARAAANAARAAAAA 


DOUBLE PRECISION AA,XX,GAM, SERIES, DELTA 

DOUBLE PRECISION SUMSER,COMGAM 

DIMENSION A(3@) ,X(30) ,T(30, 30) 
5@ READ(5,9) NA,NX 

IF (NA .EQ. 0) GO TO 460 

READ(5,10) (ACI), 1=1,NA) 

READ(5,10) (X(J), J=1,NX) 

WRITE(6,1) NA 

WRITE(6,2) (A(I), I=1,NA) 

WRITE(6,3) NX 

WRITE(6,2) (X(J), J=1,NX) 

WRITE (6,4) 

DO 20@ J=1,NX 

DO 106 I=1,NA 
XX = X(J) 
AA = A(T) 

GAM = GAMDRV(A(1) ,X(J), IER) 
IF (IER .NE. @) WRITE(6,7) IER 
IF (X(J) .LT. 10.) SERIES = SUMSER(A(I) ,X(J)) 
IF (X(J) .GE. 16.) SERIES = COMGAM(A(I) ,X(J)) 
RELDEL = @. 
IF (SERIES .EQ. @.D@) GO TO 2¢ 


DELTA = (GAM-SERIES)/SERIES 
RELDEL = ABS(SNGL(DELTA) ) 
- -'T(I,J) = 1.D6* (DABS (SERIES-GAM) /SERIES) 
20 WRITE(6,5) A(I),X(J),GAM, SERIES, RELDEL 
100 CONTINUE 
206 CONTINUE 
WRITE (6,8) 


WRITE(6,6) (A(I), I=1,NA) 
DO 30@ J=1,NX 
") WRITE(6,2) X(J),(T(1,J), I=1,NA) 
1 FORMAT(///3Q0H NUMBER OF INPUT VALUES OF A =,13) 
2 FORMAT(/16F8. 2) 
3  FORMAT(/3@H NUMBER OF INPUT VALUES OF X =,13) 
4 


FORMAT (/45H A X GAMDRV EXACT, 
* 18H REL ERR/) 
5 FORMAT(2F8. 2, 2D18.8,E12.3) 
6 FORMAT(//9X,15F8.2/) 
7 FORMAT(/5H IER=,13/) 
8 FORMAT(//1@X,41HTABLE OF RELATIVE ERRORS OF GAMDRV X 1.E6) 
9 FORMAT(213) 
1@ FORMAT(12F6. 2) 
GO TO 5@ 
46@ STOP - 3 
LAST CARD OF TEST DRIVER PROGRAM FOR FUNCTION SUBPROGRAM GAMDRV 
END 
FUNCTION GAMDRV (A,X, IER) 
REAL A,X 
INTEGER IER 
PURPOSE: COMPUTES A MODIFIED INCOMPLETE GAMMA FUNCTION DEFINED 
AS THE INTEGRAL OF ABS(Y)**(A-1.) * EXP(-Y) FROM @ TO X, WHERE 
X MAY BE NEGATIVE. GAMDRV IS AN EXTENSION OF GAMINC (ALGORITHM 
435, CACM), AND USES IT AS AN AUXILIARY FUNCTION SUBPROGRAM. 
PRECISION: SINGLE 
ARGUMENT RESTRICTIONS: 1. .LE. A .LT. 15. 
-EXPLIM .LE. X .LT. INFINITY 
EXPLIM IS A MACHINE DEPENDENT CONSTANT USED BY GAMINC TO PREVENT 
EXPONENT OVERFLOW. IT HAS THE VALUE 2@. FOR THE 3606/37@ SERIES. 
ERROR RETURNS: IER =-1l A .LE. @. OR X .LT. -EXPLIM 


SCHOO400 
SCHOO450 
SCHOO50O 
SCHOO55@ 
SCHOO6OO 
SCHOO650@ 
SCHOO700 
SCHOO750 
SCHOO8OO 
SCHOO850 
SCHOO9IGO 
SCHOO950 
SCHO1600 
SCHO1O5@ 
SCHO110¢ 
SCH@115@ 
SCHG1200 
SCHO125@ 
SCHO1300 
SCHO1350@ 
SCHO1400 
SCHO145¢@ 
SCHO15060 
SCH@155@ 
SCHO1600 
SCHO1650 
SCHO176@ 
SCH@175@ 
SCHO1800 
SCHO1850@ 
SCHO1900 
SCHO195¢@ 
SCHO200¢6 
SCHO205@ 
SCHO2100 
SCH@215@ 
SCHO2200 
SCH@2256 
SCHO2300 
SCHG235¢@ 
SCHO2460 
SCHO2450 
SCHO2500 
SCHO2550 
SCHO2600 
SCHO265¢ 
SCHO2700 
SCHO275@ 
SCHO28060 
SCHO2856 
SCHO2900 
SCHO295@ 
SCHO3 O00 
SCHO3050 


SCHO3450 
SCHO3 560 
SCHO355@ 
SCHO3600 
SCHQ365@ 
SCHO370@ 
SCHO375@ 
SCHO3 800 
SCHO3850 
SCHO3900 
SCHO3950 
SCHO4000 
SCHO4050 
SCHO410¢6 
SCHO4150 
SCHO4200 
SCHO4250 
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COLLECTED ALGORITHMS (cont.) 


IER @ NORMAL RETURN 

IER 1 @. .LT. A .LT 1. 

IER = 15 A .GE. 15. 
FOR IER = -1 GAMDRV RETURNS THE VALUE @., WHILE FOR @. 
OR A .GE. 15. THE (INACCURATE) APPROXIMATION IS RETURNED. 


SUBROUTINES REQUIRED: DGAMMA (DOUBLE PRECISION GAMMA FUNCTION) 
AND GAMINC, WHICH IN TURN REQUIRES GAMMA (SINGLE PRECISION GAMMA 
FUNCTION). BOTH ARE COMMONLY INCLUDED IN THE FORTRAN LIBRARY OF 
FUNCTIONS. OTHER SOURCES ARE THE IMSL (INTERNATIONAL MATHEMATICAL 


«LT. A .UTs. 1, 


AND STATISTICAL LIBRARIES, INC.) AND NAG (NOTTINGHAM ALGORITHMS 
GROUP) FORTRAN LIBRARIES. 
P. 18@, IS ALSO SATISFACTORY FOR THE SINGLE PRECISION GAMMA 
FUNCTION. ALGORITHM 221, COMM. ACM 7 (MARCH 1964), P.143, 
ACHIEVES 1@ SIGNIFICANT DIGIT ACCURACY WHICH IS SUFFICIENT FOR 
THE DGAMMA REQUIRED BY GAMDRV. 


QAMQRAANAANRANRANANANAAANAAAN 


DOUBLE PRECISION DA,DX,DEXPXA,DGAM1 
DATA EXPLIM/20.0/ 
IER=0 
ASAVE = A 
IF (X .NE. 6.) GO TO 10 
GAMDRV = @. 
RETURN 
C 
16 IF (X .LT. -EXPLIM) GO TO 5@ 
Cc 
IF (A .GT. @.) GO TO 16¢ 
C A IS NOT POSITIVE OR X IS LESS THAN -EXPLIM 
56 TER = -1 
GAMDRV = @. 
RETURN 
Cc 
100 IF (A .GT. 2.) GO TO 110 
C A IS LESS THAN OR EQUAL TO 2. 
IF (A .LT. 1.) IER = 1 
GAMDRV = GAMINC(A,@. ,X) 
RETURN 


116 IF (A .LT. 15.) GO TO 21¢ 
-GE. 15. SET IER = 15 AND CONTINUE 
TER = 15 


21@ IF ((X .LT. -12.) .AND. 
IF (X .GE. 5.) GO TO 236 
C -12 .LE. X .LT. 5. GET REQUIRED VALUE FROM GAMINC. 
GAMDRV = GAMINC(A,®@. ,X) 


(A .GT. 6.)) GO TO 220 


RETURN 
C 
C REDUCE A TO RANGE 1. .LT. A .LE. 2. AND USE FORWARD RECURSION 
Cc 


22@ NRECUR = INT(A)-2 
C IF A IS NOT INTEGRAL, ONE MORE RECURSION WILL BE NEEDED 
IF (A-FLOAT(NRECUR) .GT. 2.) NRECUR = NRECUR+1 
A = A-FLOAT (NRECUR) 


SIGNX = SIGN(1.@,X) 

EXPXA = EXP(-X) * ABS(X)**A 
c 
C CALL GAMINC TO GET INITIAL VALUE FOR RECURSION 
c 


GAM1 = GAMINC(A,@. ,X) 
DO 225 K = 1,NRECUR 
GAM1 = SIGNX * (EXPXA + GAML*A 
A = At1.0 
EXPXA = EXPXA*ABS(X) 
225 CONTINUE 
GAMDRV = GAM1 
A = ASAVE 
RETURN 
CALCULATE THE COMPLEMENTARY INCOMPLETE GAMMA FUNCTION. IF A IS 
TOO LARGE, REDUCE A AND USE FORWARD RECURSION. 
IS REQUIRED SINCE SIGNIFICANT CANCELLATION OF LEADING DIGITS 
MAY OCCUR IN SUBTRACTION FROM GAMMA(A) WHEN A IS LARGE. 


aaAaAaANn 


ALGORITHM 54, COMM. ACM 4 (APRIL 1961), 


DOUBLE PRECISION 


SCHO4300 
SCHO4350 
SCHO4400 
SCHO4450 
SCHO4560 
SCHO4550 
SCHO4600 
SCHO4650 
SCHO4700 
SCHQ@4750 
SCHO4800 
SCHO4850 
SCHO4900 
SCHO4950 
SCHO5 000 
SCHO505@ 
SCHO5100 


SCH@515@ 
SCHO5200 
SCHO5250@ 
SCHO5300 
SCHO535@ 
SCHO5 400 
SCHO5450@ 
SCHO5500 
SCHO@555@ 
SCHO5600 
SCHO5650 
SCHO5 700 
SCH@575@ 
SCHO5 800 
SCHO5850 
SCHO5900 
SCHO595@ 
SCHOG6OOO 
SCHO6O5¢ 
SCHO6100 
SCHO6 15 
SCHO6 200 
SCHO6 25 
SCHO6300 
SCHO6350 
SCHO6400 
SCHO6450 
SCHG6560 
SCHO6550 
SCHO6600 
SCHO6656 
SCHO6 700 
SCHQ6750 
SCHO68O0 
SCHO685@ 
SCHG690O 
SCHQ695@ 
SCHO7 OOO 
SCHO705¢@ 
SCHO7160 
SCHO715@ 
SCHQ7 200 
SCHO7 25 
SCHO7 300 
SCHO7 350 
SCHO7 4060 
SCHO7 450 
SCHO7 5060 
SCHO7 550 
SCHO7 6900 
SCHQO765@ 
SCHO770@ 
SCHO775@ 
SCHO7 800 
SCHO7 856 
SCHO79060 
$CHO795@ 
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COLLECTED ALGORITHMS (cont.) 


Cc 


C 


236 


RANGE = 


NRECUR = @ 
~5* (X44. ) 


C TEST TO SEE IF FORWARD RECURSION IS.NECESSARY. 


C 


C 
C 


QAANRQAA 


Q 


235° 


PwWNH 


IF (A .LE. RANGE .OR. X .GE. 22.) GO TO 235 


NRECUR = INT(A-RANGE) + 2 

A = A-FLOAT(NRECUR) 

DA=A 

DX = X 

DEXPXA = DEXP(=DX) * DABS(DX)**DA 

DGAM1 = DEXPXA/(DX+(1.D@—DA) / 
(1.D@ + 1.D6/(DX+(2.D@-DA) / 
(1.D@ + 2.D@/ (DX+(3.D@-DA) / 
(1.D6 + 3.D6/(DX+(4.DO-DA) / 
(1.D@ + 4.D@/(DX+1.78D@ - .64D0* (DA-2.D0)))))))))) 

IF (NRECUR .EQ. @) GO TO 25@ 


DO FORWARD RECURSION FOR COMPLEMENTARY INCOMPLETE GAMMA 


240 
250 


LAS 


DO 24@ K=1,NRECUR : 
DGAM1 = DGAM1*DA + DEXPXA 
DA = DA+1.D¢ 
DEXPXA = DEXPXA*DX 

CONTINUE 

GAMDRV = DGAMMA(DA) - DGAMI 

A = ASAVE 

RETURN 

T CARD OF FUNCTION SUBPROGRAM GAMDRV 

END 


FUNCTION. GAMINC (A, X1, X2) 


MODIFIED VERSION OF ALGORITHM 435, MODIFIED INCOMPLETE GAMMA 


FUN 
MOD 


CTION, TO BE USED WITH FUNCTION SUBPROGRAM GAMDRV. THE 


IFICATIONS ARE DESCRIBED IN THE ACCOMPANYING TEXT. 


DOUBLE PRECISION DA,DZ,T6 

DATA EXPLIM/2@. / 

DATA ZERO/@./,ONE/1./,FIVE/5./,TWELVE/12./ 
Z = Xl 

SGN = @. 

TIM = -1. 


EXPDIF = 1. 


19 
USE 


1 


20 


USE 


1 + DA*(.26541D-1 + .204@62D-2*DA + .60327D-2*DZ) 
IF (Z .GT. @.) T6 = .96410D@ - DZ*(.29325D-1 + .12057D-2*DA) 

it + .34758D-2*DA 

GAM1 = EXPDIF*DZ/DA * DBLE(AZ)** (DA-1.D@) 


IF (Z .NE. ZERO) GO TO 1¢ 

GAM1 = @. 

SGN = SGN + TIM 

GO TO 40 

IF (Z .LT. FIVE) GO TO 2¢ 

EQUATION 10 (SEE REFERENCE) 

GAM1 = -EXPDIF * Z**A/(Z+(1.-A)/(1.41./(Z+(2.-A) /(1.42. 

/(Z+(3.-A) /(1.4+3./(Z4+1-.7))))))) 

GO TO 4@ 

AZ = ABS(Z) 

IF (Z .LT. -TWELVE) GO TO 3@ 

SGN = SGN + TIM 

IF (AZ .GT. ONE) GO TO 25 

EQUATION 17 

GAM1 = EXPDIF*Z/A* DBLE(AZ) **(A-1.) 
* (1.000000 +2/(At1.) *(.9999999+Z/(At2.) 
*(.9999999 +Z/(A+3.) *(1.600@08+Z/(A+4.) 
*(1.000005 +Z/(A+5.) *(.9994316+Z/ (A+6.) 
*(.9995587 +Z/(A+7.) *(1.031684+4Z/ (A+8.) 
* 1.028125)))))))) 

GO TO 40 


EQUATIONS 11 AND 12. 


oO 
> 
"it 


A 
Z 


IF (Z .LT. @.) T6 


EVALUATION IS DONE IN DOUBLE PRECISION. 


-92391D@ + DZ*(-.65094D-1 + .73933D-3*DZ) 


SCHO8OOO 
SCHO8050 
SCHG8100 
SCHO8150 
SCHO8200 
SCHO825@ 
SCHO8300 
SCHO8350 
SCHO8400 
SCHO845@ 
SCHO8500 
SCH@855@ 
SCHO8600 
SCH@865@ 
SCHO8700 
SCHO8750 
SCHO8800 
SCHO8850 
SCHO890@ 
SCHO8950 
SCHOIGOO 
SCHOI@5@ 
SCHO9100 
SCHO915@ 
SCHO9 200 
SCH@9 250 
SCHO9I 3060 
SCHQ9350 
SCHO9 400 
SCHO9450 
SCHG9850 


SCHO99GO 
SCHO9950 
SCH1O000 
SCH10050 
SCH10100 
SCH10150 
SCH10260 
SCH10625@ 
SCH10300 
SCH1035@ 
SCH1046¢ 
SCH16450@ 
SCH10500 
SCH10550 
SCH10600 
SCH1065@ 
SCH10700 
SCH1075@ 
SCH108060 
SCH10850 
SCH1096¢ 
SCH1095@ 
SCH11000 
SCH11050@ 
SCH111060 
SCH11150@ 
SCH11200 
SCH1125@ 
SCH11300 
SCH11350 
SCH1140@ 
SCH1145@ 
SCH11500 
SCH1155@ 
SCH11690@ 
SCH11650@ 
SCH11700@ 
SCH1175@ 
SCH1180¢ 
SCH1185¢@ 
SCH11900 
SCH1195@ 
SCH1206¢ 
SCH1205¢@ 
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COLLECTED ALGORITHMS (cont.) 


Cc 


Cr 0y Ca 


QNANMNANAAANA 


OVO COOL Ora 


/(1.D@- DA *DZ/( DA *(DA+ 1.D¢+1.D6*DZ/((DA+ 2.D0) 
*(1.DO-(DA+1.DO)*DZ/((DA+ 2.D6)* (DA+ 3.DG+2.D@*DZ/((DA+ 4.D0) 
*(1.D@—(DA+2.D@)*DZ/ ((DA+ 4.D6)*(DA+ 5.DO+3.DO*DZ/((DA+ 6.DO) 
* (1. DQ@—(DA+3.D@)*DZ/((DA+ 6.D6)*(DAt+ 7.D4+4.D0*DZ/((DA+ 8.D0) 
*(1.D@—-(DA+4.DO)*DZ/((DA+ 8.DQ0)* (DA+ 9.DG+5.DO*DZ/ ((DA+10.DQ) 


Ou fFWwWNhN 


* Té )))) )))) 0D) 9))) DD) 
GO TO 40 
USE EQUATION 18 AND SHANKS El PROCESS ONCE 


30 GAM1 = -EXPDIF*AZ** (A-1.)*(1.+(A-1. )*(1.4+(A-2.)* 


1 (1.4+(A=3.)*(1.+(A-4. )*(1.4+(A-5.) /(Z-At6.))/Z)/Z)/Z)/Z) 


49 IF (TIM .GT. ZERO) GO TO 55 


GAMINC = GAML 
IF (ABS(X1-X2) .GT. EXPLIM) GO TO 5@ 
IF TRUE, CONTRIBUTION AT X2 IS .LT. 1.E-7 * CONTRIBUTION AT Xl, 
PROVIDED X2 .GT. Xl. 

Z= X2 

EXPDIF = EXP(X1-X2) 

TIM = 1. 


5@ GAM1 = @. 
55 GAMINC = GAM1 -— GAMINC 
IF (SGN .NE. ZERO) GAMINC = GAMINC — SIGN (GAMMA(A) *EXP(X1) ,SGN) 


RETURN 
LAST CARD OF FUNCTION SUBPROGRAM GAMINC 
END 


DOUBLE PRECISION FUNCTION COMGAM(A,X) 


COMPUTES THE INCOMPLETE GAMMA FUNCTION BY SUBTRACTING A 
CONTINUED FRACTION EXPANSION FOR THE COMPLEMENTARY INCOMPLETE 
GAMMA FUNCTION FROM DGAMMA(X). 

REFERENCE: 
FUNCTIONS. NATIONAL BUREAU OF STANDARDS, U.S. GOV. PRINT. OFF., 
WASHINGTON D.C., 1967, P. 263, FORMULA 6.5.31 


DOUBLE PRECISION DA,DX,TK 
DA =A 
DX=X 
TK=0.DO 
LAST=2@ 
DO 1@ K=1,LAST 
FK = FLOAT (LAST+1-K) 
TK = (DBLE(FK)-DA) /(1.DQ@+DBLE (FK) / (DX+TK) ) 
1@ CONTINUE 
TK = DEXP (-DX)*DX**DA/ (DX+TK) 
COMGAM = DGAMMA(DA) - TK 
RETURN 
LAST CARD OF FUNCTION SUBPROGRAM COMGAM 
END 


DOUBLE PRECISION FUNCTION SUMSER (A,X) 


COMPUTES THE INCOMPLETE GAMMA FUNCTION FOR -EXPLIM .LE. X .LT. 10. 


THE SERIES IS TRUNCATED AS DESCRIBED BY FULLERTON. 
REFERENCE: 
FUNCTIONS. NATIONAL BUREAU OF STANDARDS, U.S. GOV. PRINT. OFF., 
WASHINGTON D.C., 1967, P. 262, FORMULA 6.5.29 


DOUBLE PRECISION SUM,TERM,X2, ISIGN 
DOUBLE PRECISION XX,AA 
IF (X .NE. @.) GO TO 5 
SUMSER = @.D@ 
RETURN 
5 XX =X 
AA=A 
= 1.D0/AA 
SUM = TERM - XX/(AA+1.D0) 
ISIGN=1.D@ 
X2 = XX*XX 
DO 1@ N=2,100 
FN = FLOAT(N) 
TERM = X2*ISIGN/(2.D@* (AA+DBLE (FN) ) ) 


SCH12160 
SCH1215@ 
SCH12200 
SCH1225@ 
SCH12300 
SCH1235@ 
SCH12400 
SCH1245@ 
SCH1250@ 
SCH1L255@ 
SCH1260@ 
SCH1265@ 
SCH12700 
SCH1275@ 
SCH128060 
SCH1285@ 
SCH12900 
SCH12950 
SCH13000 
SCH13050 
SCH131060 
SCH1315@ 
SCH13 2060 
SCH13250 
SCH13300 
SCH1335@ 
SCH1375@ 


SCH13800 
SCH1385@ 
SCH1390¢ 
SCH13950 
SCH140600 


ABROMOWITZ, M.,AND STEGUN, I.A.. HANDBOOK OF MATHEMATICALSCH1405@ 


SCH1410@ 
SCH14150@ 
S$CH14206 
SCH14250@ 
SCH14300 
SCH14350 
SCH144066 
SCH1445@ 
SCH14560 
SCH14550 
SCH1460@ 
SCH14650 
SCH14700 
SCH1475@ 
SCH14800 
SCH1485¢@ 
SCH1L490@ 


SCH1530@ 
SCH15350 
SCH15400 
$CH154506 


ABROMOWITZ, M.,AND STEGUN, I.A.. HANDBOOK OF MATHEMATICALSCH15500 


SCH1L5550 
SCH15600 
$CH1565@ 
SCH1570@ 
§$CH15750 
SCH15 800 
SCH1585@ 
SCH15900 
$CH1595@ 
SCH16000 
§CH16050 
SCH1619@ 
SCH16150 
SCH1620@ 
$CH1625@ 
$CH16360 
5$CH1635@ 
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SUM = SUM + TERM 
IF (DABS(TERM) .LT. 2.D-9 * DABS(SUM)) GO TO 26 
ISIGN = -ISIGN 
X2 = X2*XX/(DBLE(FN)+1.D@) 
1@ CONTINUE 
2@ SUMSER = DABS(XX)**AA * SUM 
IF (X .LT. @.) SUMSER = -SUMSER 
RETURN 
C. LAST CARD OF FUNCTION SUBPROGRAM SUMSER 
END 


. 8.9 11.0 14.5 
-18. -14. -12. “8. <4. -2. -.5 -5 2. 


SCH1640@ 
SCH16450@ 
SCH16500 
SCH16550@ 
SCH16600 
SCH16650@ 
SCH1670@ 
SCH1675@ 
SCH168¢0 
SCH1685¢ 


SCH17250@ 
SCH1730@ 
SCH17350@ 
SCH17400 
SCH17450 
SCH17500 
SCH17556 
SCH17600@ 
SCH1765@ 
SCH17700 
SCH1775@ 
SCH1L780¢ 
SCH1785@ 


SCH1L7900 


SCH1795@ 
SCH180600 


435-P 10- 


0 


COLLECTED ALGORITHMS FROM CACM 
436-P 1- 0 


Algorithm 436 foe 
Product Type Trapezoidal ee) = ta) 


e FC2) = FNCX) 
Integration [D1] 6c2) = GNX? 
AG = 0-D0 
W. Robert Boland [Recd. 10 Dec. 1970 and 14 May 1 ag = AG & Ane sony + cK 
1971} : VINT = H ‘ VINT / 6-DO 
° . RETURN 
Department of Mathematics, Clemson University, END 


Clemson, SC 29631 


Key Words and Phrases: numerical integration, product type 
quadrature, trapezoidal integration 

CR Categories: 5.16 

Language: Fortran 


Description 
This subroutine uses the product type trapezoidal rule com- 
pounded x times to approximate the value of the integral 


ab 
| fdg(x) dex. 


The approximating sum is 


he 2 1\/g@+ VU — 1h) 
AS Va + U = Din, fla-+s00 (7 )( heres ), 


where h = (6 — a)/n. Note that if g(x) = 1 (or f(x) = 1), the 
rule reduces to the regular trapezoidal rule. The procedure was 
proposed and discussed by Boland and Duris in [1]. 

The subroutine was written in Fortran using double precision 
arithmetic and was checked on an IBM 360 Model 50. The calling 
parameters for the routine are as follows. A is the name for the 
lower limit of integration, and B is the name for the upper limit. 
N is the number of times the formula is to be compounded. The 
basic interval [A, B] is subdivided into N subintervals each of 
length (B — A)/N and the rule is applied to each subinterval. 
FN and GN are names of double precision FUNCTION subpro- 
grams which evaluate the functions f(x) and g(x), respectively. 
These are to be supplied by the user. The result is stored in VINT. 

There are no machine dependent parameters. 


References 
1. Boland, W.R., and Duris, C.S. Product type quadrature 
formulas. BIT 11, 2 (1971), 139-158. 


Algorithm 
SUBROUTINE PTRAPCAs Bs Ns FN» GNs VINT) 


THIS SUBROUTINE USES THE PRODUCT TYPE TKAPEZ@IDAL RULE 
COMPOUNDED N TIMES TO APPROXIMATE THE INTEGRAL FROM A T@ B 
@F THE FUNCTION FNCX) * GNCX)- FN AND GN ARE FUNCTION 
SUBPROGRAMS WHICH MUST BE SUPPLIED BY THE USEK- THE 
RESULT IS STORED IN VINT- 


aAanaaaad 


DOUBLE PRECISION As AGs AMC2s2)5 Bs FC2)2 FNs GC2)s 
* GN» Hs VINTs Xs DBLE 

DATA AMC1s1)» AMC2s2) 72 *€ 2eD0/2 AMC 152) AMC2s1) 
* 72 *® 16D0/ 

H = (B - A) / DBLECFLOATCN)) 
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e 
A c 
Algorithm 37 DOUBLE PRECISION A» AG» AMC323)5 Bs FC3)s FNe GC3)e 
* 


GN» H» VINT» XC€2)s DBLE 
DATA AMC1s1)>> AMC393) 72 * 46D07» AMC122)» AMC221)> 
* AMC2s3)5 AMC322) 44 * 2eD0/s AMC 153)» AMC3s1) 
* 72 # -teDOZs AMC2s2) 416eD0/ 


Product Type Simpson’s H = (B - A) / DBLECFLOATCN)) 


XC1) = A + H / 2.D0 


2 xX(2) = A+H 
Integration [D1] oe, 
FC3) = FNCA) 
GC3) = GNCA) 
' DQ31= ls WN 
W. Robert Boland [Recd. 10 Dec. 1970 and 14 May Fol) = FC3) 
G1) = GC3) 
1971] DQ 1 J = 162 
° ‘ . FC J+th> = FNCXC 
Department of Mathematics, Clemson University, Gctid = GNCXCI)) 
Clemson, SC 29631 — wera gaciea | 
AG = 0-DO 
D@ 2K = ts 3 
2 AG = AG + AMCJsK) * GCK) 
3 VINT = VINT + FCJ) * AG 
Key Words and Phrases: numerical integration, product type RETLRN 
quadrature, Simpson’s rule hs 
CR Categories: 5.16 
Language: Fortran 
Description 


This subroutine uses the product type Simpson’s rule com- 
pounded times to approximate the value of the integral 


b 
i foneo'de: 


The approximating sum is 


h nm 
o 2d (fla+ G — Ih), fla + G — Ph), fla + jh) 


4 2 -1\ /glat+ G— 1A) 
216 2)ig¢a@+G—HA ], 
-l 2 4 gla + jh) 


where 4 == (6 — a)/n. Note that if g(x) = 1 (or f(x) = 1), the rule 
reduces to the regular Simpson’s rule. The procedure was proposed 
and discussed by Boland and Duris in [1]. 

The subroutine was written in Fortran using double precision 
arithmetic and was checked on an IBM 360 Model 50. The calling 
parameters for the routine are as follows. A is the name for the 
lower limit of integration and B is the name for the upper limit. 
N is the number of times the formula is to be compounded. The 
basic interval [4, B] is subdivided into N subintervals each of 
length (B — A)/N and the rule is applied to each subinterval. 
FN and GN are names of double precision FUNCTION subpro- 
grams which evaluate the functions f(x) and g(x), respectively. 
These are to be supplied by the user. The result is stored in VINT. 

There are no machine dependent parameters. 


References 
1. Boland, W.R., and Duris, C.S. Product type quadrature 
formulas. BIT 1], 2 (1971), 139-158. 


Algorithm 
SUBROUTINE PSIMPCAs Bs Ns FN»GNs VINT) 


THIS SUBROUTINE USES THE PRODUCT TYPE SIMPSON RULE 
COMPOUNDED N TIMES TO APPROXIMATE THE INTEGRAL FROM A TG B 
OF THE FUNCTION FNCX) * GNCX). FN AND GN ARE FUNCTION 
SUBPR@GRAMS WHICH MUST BE SUPPLIED BY THE USER. THE 

KESULT IS STORED IN VINT. 


aagqagaaaa 
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Algorithm 438 


Product Type Two-point 
Gauss-Legendre-Simpson’s 
Integration [D1] 


W. Robert Boland [Recd. 10 Dec. 1970 and 14 May 
1971] 

Department of Mathematics, Clemson University, 
Clemson, SC 29631 


Key Words and Phrases: numerical integration, product type 
quadrature, Gaussian quadrature,.Simpson’s rule 

CR Categories: 5.16 

Language: Fortran 


Description 

This subroutine uses the product type two-point Gauss- 
Legendre-Simpson’s rule compounded » times to approximate the 
value of the integral 


b 
/ f(x)g(x) dx. 


The approximating sum is 


4 1-31! 


D (fla + i — § — 3°2/6)K), fla + Gj — 4 + 3""/6)h)) 
bape) 8+ GA J, 


h 
2 
14.312 
Q —31e 
g(a + jh) 


where h = (b — a)/n. Note that if g(x) = 1, the rule reduces to 
the regular two-point Gauss-Legendre rule, while if f(x) = 1, it 
reduces to the regular Simpson’s rule. The procedure was proposed 
and discussed by Boland and Duris in [1]. 

The subroutine was written in Fortran using double precision 
arithmetic and was checked on an IBM 360 Model 50. The calling 
parameters for the routine are as follows. A is the name for the 
lower limit of integration and B is the name for the upper limit. 
N is the number of times the formula is to be compounded. The 
basic interval [A, B] is subdivided into N subintervals each of 
length (B — A)/N and the rule is applied to each subinterval. 
FN and GN are names of double precision FUNCTION subpro- 
grams which evaluate the functions f(x) and g(x), respectively. 
These are to be supplied by the user. The result is stored in VINT. 

There are four machine dependent constants. These are: 


(i) 1 + 3% & 2.732050807568877, 

(il) 1 — 3!/2 A —0.7320508075688773, 
(il) 3 — 32/6 = 0.2113248654051871, and 
(iv) $ + 3!?/6 = 0.7886751345948129. 


The first constant is assigned to AM(1, 1) and AM(2, 3), the second 
to AM(1, 3) and AM(2, 1), while the third and fourth are used in 
the calculation of X(1) and X(2), respectively. 


) ‘gla + G — 1A) 


References 
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1. Boland, W.R., and Duris, C.S. Product type quadrature 
formulas. BIT 71, 2 (1971), 139-158. 


Algorithm 


SUBROUTINE P2PGSCA»s Bs 


BY THE USER. 


aaaaqagna 


DOUBLE PRECISION As AGs 


N» FNs GN» VINT) 


THIS SUBROUTINE USES THE PRODUCT TYPE TW&-PGINT GAUSS- 
LEGENDRKE-SIMPSON RULE C@MP@UNDED N TIMES TO APPKOXIMATE 
THE INTEGRAL FROM A T@ B OF THE FUNCTION FNCX) * GNCX)» 
FN AND GN ARE FUNCTION SUBPR@GKAMS WHICH MUST BE SUPPLIED 
THE RESULT IS STORED IN VINT. 


AM(2s3)e Bs FC2)5 FN» GOI)» 


* GNs Hs VINTs XC2)s YC2)» DBLE 
DATA AMC1ls lds AMC2s3) /2 * 26732050807568877D07, 
* AMC1s2)2 AMC222) /2 * 4¢D0/e AMC 153)5 AMC251) 
* 72 * -6732050807 5688773007 
H = (B - A) / DBLECFLOATCN) >) 
KCL) = A + 62113248654051871D0 * H 
KC2) = A + 67886751345948129D00 * H 
YC!) = A + H Z 2-D0 
Y¥C2) =A+H 
VINT = 0-DO 
GC3) = GNCA) 
D9 3 I = ts N 
Gct)d = GC3) 
D@1J55 ts 2 
FCJ) = FNCACJI)) 
GCJ+1) = GNCYCJ)) 
XCJ> = XCJD + H 
1 YOJ) = YOU) + H 
DO 3 J ts 2 
AG = 0-DO 
D@2K = ls 3 
2 AG = AG + AMCJsK) * GCK) 
3 VINT = VINT + FC J) * AG 
VINT = H * VINT / 12-D0 
RETURN 
END 
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Algorithm 439 


Product Type Three-point 
Gauss-Legendre-Simpson’s 
Integration [D1] 


W. Robert Boland [Recd. 10 Dec. 1970 and 14 May 
1971] 

Department of Mathematics, 
Clemson, SC 29631 


Clemson University, 


Key Words and Phrases: numerical integration, product type 
quadrature, Gaussian quadrature, Simpson’s rule 

CR Categories: 5.16 

Language: Fortran’ 


Description 

This subroutine uses the product type three-point Gauss- 
Legendre-Simpson’s rule compounded x times to approximate the 
value of the integral 


b 
i f(x)g(x) dx. 
a 
The approximating sum is 


5 Mla + U = 4 — 46/5), fla +  — DM, 
fla+ UV — 4+ 4/99) 


2 + 65/3)"%) 1 3 - (5/3)1)\ /g(a + G — 1)A) 
0 4 0 gla + YG — 4A) |, 
2(1 — (5/3)"2) 1 31 + (5/3)"), gla + jh) 


where h = (6 — a)/n. Note that if g(x) = 1, the rule reduces to 
the regular three-point Gauss-Legendre rule, while if f(x) = 1, 
it reduces to the regular Simpson’s rule. The procedure was pro- 
posed and discussed by Boland and Duris in [1]. 

The subroutine was written in Fortran using double precision 
arithmetic and was checked on a IBM 360 Model 50. The calling 
parameters for the routine are as follows. 4 is the name for the 
lower limit of integration and B is the name for the upper limit. 
N is the number of times the formula is to be compounded. The 
basic interval [A, B] is subdivided into N subintervals each of length 
(B — A)/N and the rule is applied to each subinterval. FN and 
GN are names of double precision FUNCTION subprograms 
which evaluate the functions f(x) and g(x), respectively. These are 
to be supplied by the user. The result is stored in VINT. 

There are four machine dependent constants. These are: 

(i) $01 + (5/3)12) ee 1.718245836551854, 

(ii) (1 — (5/3)"*) e —0.2182458365518542, 
(iti) 4(1 — (3/5)") e& 0.1127016653792583, and 
(iv) $0. + (3/5)"/*) = 0.8872983346207417. 


The first constant is assigned to AM(1, 1) and AM(2, 3), the second 
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to AM(1, 3) and AM(2, 1), while the third and fourth are used in 
the calculation of X(1) and X(2), respectively. 


References 
1. Boland, W.R., and Duris, C.S. Product type quadrature 
formulas. BIT 11, 2 (1971), 139-158. 


Algorithm 
SUBROUTINE P3PGS ¢ As Bs Ne FN»s GN» VINT) 


THIS SUBKQUTINE USES THE PRODUCT TYPE THREE-P@INT GAUSS- 
LEGENDRE-SIMPSON RULE C@MPQUNDED N TIMES TO APPROXIMATE 
THE INTEGRAL FROM A TO B GF THE FUNCTION FNCX) * GNCA)D- 
FN AND GN ARE FUNCTIGN SUBPROGRAMS WHICH MUST BE SUPPLIED 
BY THE USER» THE RESULT IS STORED IN VINT. 


aanaaaa 


DOUBLE PRECISION As AG» AMC2s3)s Bs FC2)s FN» GC3)> 
* GNs Hs VINTs X€2) » YC2)» DBLE 
DATA AMC 1912s AMC2,3) /2 * 1+718245836551854D0/> 
* AMC 122) AMC2,2) £2 * 1eDO/s AMC ESI)» AMC2s1) 
* 72 * ~02182458365518 542007 

H = (B = A) / DBLECFLOATCN)) 


X€1) = A + «1127016653792583D0 * H 
X€2) = A + 28872983346207417D0 * H 
YCl) = A + H 7 2-D0 
Y¥C2) =A+H 
VINT = 0-00 
GC3) = GNCA) 
D@3 15 ts N 
AG = FNCYC1)) 


GO1) = G6C3) 

OG 1 J = ls 2 
FCJ) = FNCACJI)) 
GCJ+id = GNCYCJ)) 
XCJ) = XCJ) + H 


1 YoU) = ¥CJ) + H 
VINT = VINT + AG * 4.-D0 * G(2) 
DO 3 J = 122 
AG = 0-DO 
DO 2 K = te 3 
2 AG = AG + AMC JsK> * GCK) 
3 VINT = VINT + FCJ>) * AG 
VINT = H * VINT 7 9-DO 
RETURN 
END 
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Algorithm 440 


A Multidimensional 
Monte Carlo Quadrature with 


Adaptive Stratified 
Sampling [D1] 


L.J. Gallaher (Recd. 10 Dec. 1970, 20 July 1971) 
Rich Electronic Computer Center, Georgia Institute 
of Technology, Atlanta, GA 30319 


Key Words and Phrases: Monte Carlo quadrature, stratified 
sampling, adaptive quadrature, sequential stratification 

CR Categories: 5.16, 5.5 

Language: Algol 


Description 
This procedure evaluates the n-dimensional integral 


fe by bo bn 
i v(x) dx = | i ff v(x, X2, °°*, Xn) dtn +++ dtndx; 
V (a,b) a, Yao an 


by the Monte Carlo method. The variance reduction scheme used 
here is a form of stratified sampling. 

The advantages of stratified sampling are well known [1], and 
the concept of optimum stratification is discussed in most text books 
on Monte Carlo methods [2, 3, 4]. The advantages of adaptive 
quadrature are also well known, and many such algorithms have 
been published in Communications and elsewhere [5, 6, 7]. Com- 
bining adaptive quadrature with stratified sampling is a straight- 
forward process [8, 9]. 

The workings of this procedure are somewhat similar to 
Algorithm 303 [6]. Algorithm 303 is one-dimensional, and while 
it can be used for multidimension integrals by recursive calls, for 
more than approximately six dimensions the number of evaluations 
of the integrand becomes intolerable. The goal of the algorithm 
given here is to try to overcome this defect of Algorithm 303 and 
other algorithms like it. 

The procedure works as follows: 

1. A set of samples is taken, uniformly stratified throughout the 
entire volume being integrated. 

2. Based on the variance in these samples, a decision is made as 
to whether more samples are needed. 

3. If more samples are needed, the volume is cut in half and the 
entire procedure (but with fewer samples) is repeated on each 
half, recursively, the halvings being repeated as required. The 
choice of axis for the halving is based on samples of the gradient. 

The result of this process is that the overall stratification is not 
uniform, but approaches optimum as more and more samples are 
taken, since more halvings (thus more samples) are taken in the re- 
gions of high variance. 

A certain amount of caution must be used in the choice of the 
input parameter m (m + nis the number of samples taken initially). 
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If the function being integrated is reasonably smooth, relatively low 
values of m (say 5 to 10) are satisfactory. If v(x) is known to have 
sharp peaks, ridges, valleys, or pits, then large values of m will be 
necessary in order to avoid: missing these high and low spots. A 
rough rule is that m should be inversely proportional to the error 
tolerance and proportional to the logarithm of volume of anoma- 
lous regions. If V4 is the fractional volume of the anomalous re- 
gions and £, is the relative error tolerance, then the empirical rule 
m = (—2 In(V4))/E, has proved satisfactory. For this quadrature 
algorithm tc be useful, the results should. be insensitive to the users 
choice of m, and this has been observed provided m is not chosen 
too small. (This difficulty about the occasional need to choose m 
shrewdly is characteristic of all adaptive quadrature schemes, 
whether Monte Carlo or ‘“‘exact’’ methods such as Romberg, 
Simpson, or others.) 

As a test of this procedure, 100 evaluations were made of the 
volume of 1/32 of a hypersphere in five dimensions (in rectangular 
coordinates), i.e. 


rok pk pr {if Dy x? > R* then 0) 
fled 


1<t<4 | 

{ dx, dx, dx; dx, ; 
else (R2 — >> x,2)12 | 

with 3% accuracy requested. A histogram is given below of the 

values obtained. 


1<i< 


Number of . 

occurrences 4 0 8 14 18 16 20 14 5 1! 
+++ —_} —_4— 

Vobs/tezact 0.94 0.95 0.96 0.97 0.98 0.99 1.00 1.01 1.02 1.03 1.04 


Here /,3, is the value observed, /ezace is the correct value. The initial 
value of m was 120, and the average number of function evaluations 
per integral was 1427. The standard error for the 100 evaluations 
was approximately 2%. For corresponding accuracy, about 4.5 
times as many samples would have been needed by unstratified uni- 
form sampling. 

Finally it should be pointed out that the results given by adap- 
tive stratification are not entirely unbiased in the usual sense of the 
Monte Carlo method. There is, in fact, a biasing in favor of regions 
having low values of the magnitude of the gradient. However, this 
bias should normally be expected to be much smaller than the re- 
quested error tolerance. 

Acceptable random number generators for this algorithm may 
be found in [10]. 
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Algorithm 
real procedure guadmc (n, a, x, b, vx, esq, m, Vab, rn); 
value 1, esq, m, Vab; a 
integer n, m; real vx, esq, Vab, rn; 
array a, x, b; 
comment The procedure parameters are: 
n- number of dimensions, n > 1 
a — array of n lower bounds 
x — array of n position coordinates of which v(x; , x2, ... 
a function, x is called by name 
6 — array of n upper bounds (it is not required that 5; > a;) 


vx ~ function to be integrated, vx must be a function of the array 


x (Jensen’s. device) and be called by name . 
esq — Square of the absolute error tolerance for the quadrature 


m — the number of samples to be taken at the first level is m + x, 


man 
Vab — volume being integrated, i.e. Vab = Thi<i<n | (6: — a) | 


rn ~ procedure to give a new random number uniform on the 


open interval zero to one (0 < rn < 1) each time referenced, 
called by name. 


All of these parameters are input parameters to be supplied by 


the user. 
Some of the local variables of this procedure are: 
vbar — average value of v(x) for m + # samples, i.e. 


1 


m+ Hh li<i<mtn 


v(x;) 


v= 


vsqbar -- average value of v(x)? for m + n samples, i.e. 


_ I 
Wea we) 
mM -+ Nn 1<izmin 


ssq — the square of the standard error of the mean (of the integral) 
for m + n samples, i.e. 


‘ (v? = p2) ‘ 


Oe 


vi — value of v(x) at ith sample, i.e. v(x,) 


vip — a value of v(x) such that 2 | vip — vi| is a sample of the 


magnitude of the ith component of the average normalized 
gradient, 1 <i<an 

it — vector of shuffled integers 1 to m 

j - array of » different vectors of shuffled integers 1 to m used in 
constructing the (uniform) stratification 

cl — point on the /th axis that divides the volume of integration in 
half for the next recursive level, i.e. cl = (6{/] — af/})/2, 


/ - index of the axis having the largest in magnitude sample of 


the component of the average normalized gradient. 
end of comment; 


, Xn) IS 


$$ 2 
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for kK := 1 step | until 7 do 
begin 
h|k] := (b[k] — alk])/m; 
for i := 1 step 1 until m do 
begin 
ir := entier (rn X m) +1; 
comment 0 < rn < 1; 
itemp := iti); it{i] := it{ir); itlir) := itemp; 
end; 
for i: 
end; 
l:=1; 
vsqgbar := vbar := gm:= 0; 
for i := 1 step 1 until do 
begin 
for k := 1 step 1 until 1 do 
x|k) := alk) + Gilk, i] — rn) X Alk); 
vi t= WX; 
vbar := vbar + vi; 
vsqbar := vsqbar + vi 7 2; 
if i < then 
begin 
comment Sample the gradients; 
x[i] := xlé] + abs(b[i] — ali])/2x 
(if x{i] < (d|é] + ali})/2 then 1 else —1); 
vip := Vx; . ; 
vbar := vbar + vip; 
vsqbar := vsqbar + vip 7 2; 
if gm < abs(vip — vi) then 
begin 
1:= i; gm := abs(vip — vi); 
end ; 
end; 
end; 
vbar := vbar/(m + n); 
vsqbar := vsqbar/(m+n); 
ssq.:= Vab ¢ 2 X (vsqbar — vbar ¢ 2)/(m+n—1); 
end; 
if ssq < 2 X esq then quadmc := vbar X Vab else 
begin 
real temp, cl, al, bl; 
m:= mX 0.707; 
ifm < ssq/esq then m := ssq/esq; 
comment The author is indebted to the referee’s 
discussions pointing out the significance of maintaining 
m 2 ssq/esq; 
esq := esq X ssq/(ssq — esq); 
al := a{l|; bl := bff; 
bl/| := cl := (61 + al)/2; 
temp := quadmce(n, a, x, b, vx, esq/2, m, Vab/2, rn); 
bli] := bl; all) := el; 
temp := quadme(n, a, x, b, vx, esq/2, m, Vab/2, rn) + temp; 
a{/| := al; 
guadmc := (temp X ssq + esq X vbar X Vab) /(ssq+-esq) ; 
end; 
end of quadmc 


1 step 1 until m do /{k, i) := itl/); 


a: ill ill 


begin 

integer /; real vbar, ssq; 

ifm <nthen m := 2; 

begin 
real gm, vi, vip, vsqbar; 
integer itemp, ir, k, i; 
array A{1:7]; 
integer array j{1:n, 1:m), it[1:m); 
for i := 1 step 1 until m do it{i] := i; 
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Algorithm 441] 


Random Deviates from the 
Dipole Distribution [G5] 


Robert E. Knop [Recd. 12 Jan. 1971, 7 May 1971, 23 
Aug. 1971, and 8 Mar. 1972} 

Department of Physics, The Florida State University, 
Tallahassee, FL 32306 


Key Words and Phrases: random number, probability density, 
probability distribution, Dipole distribution, Cauchy distribution, 
simulation, Monte Carlo 

CR Categories: 3.17, 5.5 

Language: Fortran 


Description 

The function subprogram D/POLE returns a random deviate 
—« < z <o sampled from the two parameter (R? < 1, a arbi- 
trary) family of density functions: 


f(z) = 1/(r(1+z?)) 
+ R?X((1—z*) Xcos(2a) + 2%zXsin(2e))/ (eX (1+z2)?) 
The cumulative distribution function is: 


F(z) = (1/2) 4- (l/r) X tan“(z) 
+ RX (zXco(s(2a -— sin(2a))/(mX (1+z?)) 


Densities of this type commonly occur in the analysis of 
resonant scattering of elementary particles. We note that when 
R = 0 we have the Cauchy [1] or Breit-Wigner [2] density. When 
R = 1 and a = O we have the single channel dipole density.! 
The dipole density with free parameters has been proposed to 
describe multichannel resonant scattering [3]. 

The algorithm begins by sampling the random vector (x, y) 
from a density uniform over the unit disk. The center of the unit 
disk is then displaced from the origin by the transformations 
u = x + RXcos(a) and v = y + RXsSin(a). Letting u = rXcos(6) 
and v = rXsin(@) we can find the marginal density of @: 


2 2 
f(0) = 1/(2m) x (ke ds + a) 
0 0 


where the limits of integration for r are given by: 
r.(0) = RXcos(@—a) + (1— R*Xsin?(0—a))}. 
The marginal density of @ is thus: 

f(@) = (1 + R’cos(2X (6—a)))/m 


for —21/2 < 6 < 2/2. The transformation z = tan(@) = v/u then 
yields the dipole density function. Other densities which could be 


This. work was supported in part by the U.S. Atomic Energy 
Commission. 

1The density is named after the analytic property of having 
poles of order 2 in the complex plane. See [2]. 
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easily sampled by computing rational functions of uw and y are 
suggested by transformations such as z = tan*(@), sin®(6), sin(2X6), 
or 1/| sin(2<6)| — 1. 

Function DIPOLE has two arguments which must be calcu- 
lated by the calling program, 4 = RXcos(a) and B = RXsin(a). 
DIPOLE calls the function R11(D) which must return a random 
deviate from the uniform distribution over the interval (—1,1). 
D represents a dummy argument. 


The author wishes to express his gratitude to Prof. R.G. Glasser 
of the University of Maryland for comments concerning this 
algorithm. 
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Algorithm 


FUNCTION DIP@LECAsB) 
10 xX = RIICD) 
Y = RIICD) 
IFC 1 eO-X*#X-Y*¥Y) 10210520 
20 DIPOLE 2 CY+B)/(X+A) 
RETURN 
END 
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Algorithm 442 
Normal Deviate [S14] 


G.W. Hill and A.W. Davis [Recd. 20 Jan. 1971 and 2 
Aug. 1971] 

C.S.I.R.O. Division of Mathematical Statistics, Glen 
Osmond, Sth. Australia 


Key Words and Phrases: normal distribution inverse, probit 
transform, Taylor series approximation 

CR Categories: 5.12, 5.5 

Language: Algol 


Description 

This procedure evaluates the inverse of the cumulative normal 
distribution, i.e. the normal deviate u(p), corresponding to the 
probability level p, where 


o(t) = me exp(—??/2). 


p = Plu) = i o(t) dt, (Qn)! 


—o 
An initial approximation to u(p), such as x(p), may be improved by 
using an expansion of u(z), defined as the inverse of 


u 
z=p— P(x) = | o(t) dt. 

z 
u(z) may be developed in a Taylor series about z=0, where 
u(0) =x, see ref. [1], 


n zZ Tr 
Un = x + yy cr(x) Call rl, 


and 
a(x) =1, c2(e) =x, 03(x) =2x?-+1, c4(x) = 6x? +-7x, 
Cr4i(x) = (rx -+-d/dx)c;(x). 


An error ¢(x) in the initial approximation, u=x(p), entails an 
error é€n(x) in uw, of the order of e*Me¢n41(x)/(n+1)!In order to mini- 
mize the maximum relative error R,=max | ¢n/un| in the result 
obtained from n terms of the Taylor series, several sets of coeffi- 
cients in an initial rational approximation styled after Hastings [2] 


a+ bs + cs* 
d+es + fs? + 53’ 
have been obtained such that | [e(x)]"*en41(x)/x | is minimax for 
|x |<40. For odd » the minimized expression is an even function 
of « and x, so that the relative error level may be halved when nx 
is odd by adding 4xR, . The resulting precision is shown below as 
Sn, 1.€. 


x(p) = s- s=(—2n(p)}, O<p<0O.5, 


‘ error(result) 
10-5" = max | —-——— 
result 
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a b c d e f Rn Sn 


ur 1271.059 450.636 7.45551 500.756 750.365 110.4212 0.62210 —7 7.50 
uz 1484,397 494.327 7.61067 589.557 855.441 119.4733 0.64410 — 10 10.19 
uz 1251.789 444,751 7.51005 493.187 739.156 109.3967 0.7431, — 13 13.43 
us 1637.720 494.877 7.47395 659.935 908.401 117.9407 O.1f lio — 15 15.95 
us 1488.369 460.200 7.38458 598.957 831.379 110.7527 0.94010 — 19 19.37 
ue 1269.225 448.718 7.49755 499.171 749.275 110.0194 0.75910 — 21 21.12 
ur 1266.846 448.047 7.49101 498.003 748.189 109.8371 0.16610 — 23 24.07 


According to the precision required, one set of coefficients and the 
corresponding labeled statement, selected from the following list, 
should be incorporated in. the procedure body as illustrated for 
the case of uw. 
ul: normdev := z + xX1.0000000311 
(x*zX0.5 + 1.0)Kz + x 
(((s + 0.5)Xz/3.0 + xX0.5)XKz + 1.0)Kz +x 
+ 0.3713:0>—13 
u4: normdev := (((((sX0.75 + 0.875)XKz + x) Xx + 0.5)*z/3.0 
+ xX0.5)*z + 10)Xz+ x 
u5: normdeyv := ((((((sX0.6 + 1.15)Xs + 0.175) Xz 
+ (sX0.75 + 0.875) Xx) &z + s + 0.5)Xz/3.0 
+ x X0.5)Xz + 1.0)Xz+-x 
+ 0.4219 —19Xx 
u6: normdev := ((((((120Xs + 326) Xs + 127)XxXz/6 
+ (24xs + 46)Xs + 7)Xz/40 + (0.75Xs 
+ 0.875) Xx) Xz 
+s + 0.5)X2z/3.0 + xX0.5)Xz + 1.0)Xz + x 
u7: normdey := (((((((((720 &s + 2556) Xs + 1740) Xs 
+ 127)X2z/7 
+ ((120Xs + 326)*s + 127)Xx)Xz/6 
+ (24X%s + 46)Xs + 7)Xz/40 + (0.75Xs 
+ 0.875) Kx) Xz 
+s + 0.5)Xz/3.0 + xX0.5)Xz + 1.0)Xz + x 
+ 0.832;.>—24x 
Coefficients in a similar Taylor series in powers of /n(P(x)/p), used 
in AS Algorithm 24 [3], require more computation than the cr(x) 
in these approximations. 

The real procedure supplied by the user for normal(x,y) should 
return the value of the tail area to the left of x and, via the second 
parameter, y, should return the value of #(x), which is often avail- 
able in the process of computing the tail area. A procedure based on 
Algorithm 304 [4] is recommended since other algorithms such as 
Algorithm 209 [5] and CDFN [6] lose precision as p approaches 
their error levels (about 1077, 107~!° respectively), whereas Algo- 
rithm 304 maintains precision until calculations involving (x) 
exceed the capacity of floating point representation. The similar 
CJ Algorithm 39 [7] matches the precision of v2 and may be readily 
modified to return also the value of (x). 

The user-supplied real procedure extreme (p) should cater for 
the cases p=0, p= 1, by returning suitable extreme values dependent 
on the floating point representation for the processor used, e.g. 
extreme(0) = -—37 where binary exponents are ten bits, since 
¢(—37) is approximately 2-?"° and ext:eme(1) = +7 for 36-bit 
precision, since P(x>7) is approximately 1—2-**, If p lies outside 
(0,1) the procedure should provide a diagnostic warning and may 
terminate or return an extreme value such as +37 as an indication 
of error to the calling program. 

Precision may be extended by using the D decimal digit result 


u2: normdev : 
u3: normdey := 
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from one application of normal and the n-term Taylor series as an 
initial approximation for a second application, thus increasing 
precision to at least (7+1)(D-logio(x?+1)) decimal digits (as noted 
by the referee) or at most the precision of wormal, e.g. m(m) as in 
CDFNI {6] would have a relative error O(107!4(x?++1)), if not 
limited by the use of the lower precision CDFN for normal. For 
double precision calculations the more elaborate higher order 
terms of the Taylor series may be evaluated using single precision 
operations, enabling achievement of extended precision with rela- 
tively little increase in processor time. Calculations to 25 decimal 
digit precision and independently calculated check values to 18 
significant digits {8] confirmed achievement of at least 10 significant 
decimal digits for Algorithm AS 24 and S, significant digits for this 
procedure, except for limitations of representation of p near 1.0. 
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Algorithm 
real procedure xormdev(p, normal, extreme); 
value p; real p; real procedure ormal, extreme; 
comment Input parameter p is the cumulative normal probability 
definéd by 


p = | o(t) dt, = ot) = ms exp(—?/2), 
aay (21)? 
normal (x,y) is a procedure for evaluating the above integral 
for u= x and which returns y = (x), extreme(p) is a procedure 
designed to handle extreme values of p. On completion of ex- 
ecution of this procedure normdev is an approximation for u; 
begin 
real s, x, 2; 
x := if p > 0.5 then 1.0 — p else p; 
if x < 0.0 then normdev := extreme (p) 
else 
begin 
comment Initial rational approximation; 
§.:= sqrt(—2.0 X In(x)); 
x := ((—7.49101 & s — 448.047) * s — 1266.846)/ 
(((s + 109.8371) K s + 748.189) * s + 498.003) + s; 


if p < 0.5 then x := —x; 
zZ:= p — normal(x,s); z:= z/s;s:= xT 2; 
ul: 
normdev := (((((((((720 &K s + 2556) X s + 1740) XK s 
+ 127) X 2/7 


+ ((120 XK s + 326) X s + 127) X x) X 2/6 
+ (24 K s + 46) X s +7) X 2/40 + (0.75 & s + 0.875) 
xXx) Xz 
+s+0.5) X 2/3.0+ x X 0.5) Xz4+10)Xz+x 
+ 0.832)5— 24 xx 
end seven term Taylor series for 24 decimal precision 
end normal deviate 
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Algorithm 443 


Solution of the Transcen- 
dental Equation we” = x [C5] 


F.N. Fritsch, R.E. Shafer, and W.P. Crowley [Recd. 11 
Dec. 1970, and 15 Sept. 1971] 

University of California, Lawrence Livermore 
Laboratory, Livermore, CA 94550 


Key Words and Phrases: transcendental function evaluation, 
solution of transcendental equation 

CR Categories: 5.12, 5.15 

Language: Fortran 


Description 

Purpose. WEW solves the transcendental equation wee = x 
for w, given x > 0, by an iteration that converges much more 
rapidly than either Newton’s method or fixed-point iteration. 
The user provides x = X. The routine returns w = WEW and the 
last relative correction ¢, = EN. Two versions are described here. 
Version A produces CDC 6600 machine accuracy (48 bits), and the 
relative error should be approximately e,?. Version B produces at 
least six significant figures, and the relative error should be approxi- 
mately en/. 

Iteration. Assuming x > 0, we may rewrite the equation de- 
fining w as . . 


w + log(w) = log(x). (1) 
For a given approximation w, to w, let Wai == Wn -+ 5, be a much 
better approximation. Substitution into (1) yields 
bn + log (1 + 6n/Wn) = log x — log Wa — Wn 

= Zn, say. 


Using the approximation [1] log (1 + 5/w) ~ (6w + 1/6 8)/ 
(w? + 2/3 6w) and clearing fractions yields the following quadratic 
equation for 6, : os 


(2/3 Wn +: 1/6)8,2 + (We? + Wa — 2/3 ZnWn)bn — ZnWn? = 0. 
Solving for the root that tends to zero as zn, -—> 0 gives 


2ZnWn 
- (1+ Wn — 2/3 Zn) + (1 + wa + 2/3 Za)? = 2zn)t ; 
This has a continued fraction expansion [3] 


5n 


2WnrZn 


b= 


201 + Wa) — zi 


2(1 + Wn +f 2/3 Zn) 
= 2Zn 
2(1 + Wa + 2/3 Zn) 


for which the third convergent yields sufficient accuracy. If we ig- 


Work performed under the auspices of the U.S. Atomic Energy 
Commission. 
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nore the quantity 2/3 z, in the third term, we obtain the iteration 
formula 


Wn+1 = Wa + on = wall + Cn), (2) 
where 


~ Zn ie 2(1 + wa)(1 + Wa t+ 2/3 Zn) — 2n 
1 We 21 -E wa)(1 + We + 2/3 2n) — 2zn’ 


and the error term is O(e,‘). An iteration which is O(e,') is obtained 
by truncating the continued fraction at the second convergent: 


=) zn(1 + Wr -+ 2/3 Zn) 
~ + wi + wa + 2/3 zn) — 1/2 za 


Initial guesses. For small values of x, the given equation has a 
series solution due to L. Euler [2]. A Padé rational fraction ap- 
proximation to this series is 


_ x + 4/3 x? 
~~ L 47/3 x + 5/6 x?” 


As computed from (5), wo(x) < w(x), good to within 5 percent if 
x = 2.5 and much better for smaller values of x. For larger values 
of x we may use 


Wo = log(x), (6) 


which has a maximum relative error no greater than 37 percent 
for x > e. Version A actually switches from (5) to (6) at x = 6.46, 
the approximate location of the intersection of the two relative 
error curves. With these initial guesses, one iteration of (2) with e, 
computed from (3) produces a maximum relative error of about 
2.7 X 10-5 (see Figure 1), so that a second iteration using (4) 
produces CDC 6600 machine accuracy. 

A much better initial guess for x > 0.7 can be derived by sub. 
stituting wo = /og(x) + 4 into (1) to obtain 


s+ tog (1+ 6+ 106 ()) <0. 


Exponentiation yields 


et—1— a= tos(Z). 
e 


Using a Padé approximation to the series expansion of the left 
hand side, we have 


(3) 


Cn 


(4) 


en 


(5) 


Wo 


25 — 1/2 & 
1 — 1/128?’ 
so that (approximately) 


(172 + 1/12 log ()) 8? — 25 — log @ =O: 


If this equation is solved approximately by the same procedure 
that was used to derive (3) and (4), the second convergent of the 
continued fraction yields the approximation 
24(/og*(x) + 2log(x) — 3) 

Tlog*(x) + 58/og(x) +127 © 


eF-1— $e — 


(7) 


Wo = log(x) — 


Version B switches from (5) to (7) at x = 0,7385. With these initial 
guesses, a single iteration of (2) with e, computed from (3) yields 
at least six-figure accuracy (see Figure 2). 

Testing. WEW has been tested for x in the range 0.01 < x < 
1000 against an algorithm that uses Newton’s method for small to 
moderate values of x and fixed-point iteration for large values of x 
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on both the CDC 6600 and 7600 computers. Measured computing 
times were about the same for small x (< 1.), but the time required 
by WEW is better hy a factor of 1.5 to 3.4 (depending on the re- 
quired relative error) for moderate to large x. Some typical times 
(microseconds) obtained on the Livermore Time Sharing System are 
given in Table I. 


Implementation Note. The section of coding preceding state- 
ment 20, labeled ‘‘set constants,’ provides a machine-independent 
means for setting the values of the constants C1, C2, C3, C4 on the 
first execution of WEW. Since the object of these algorithms is 
speed, it is recommended that the user compute these constants 
to the accuracy required for his particular machine and set them 
initially by means of a DATA statement. 


Fig. 1. Relative error | w — w, | /w with wo computed from (5) for 
x < 6.46 and from (6) for x > 6.46. The apparent cusp is due to 
the fact that the error curve (w — w,) has a zero near x = 80.4. 
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Fig. 2. Relative error | w — w, | /w with wo computed from (5) for 
x < 0.7385 and from (7) for x > 0.7385. The strange appearance of 
the curve for small x is due to the fact that wo = w for x = e and 
several values of x between 0.7 and 1.1. 
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Table I. Execution Times for WEW (microsec) 


CDC 6600 CDC 7600 
Version Version Version Version 
A B A B 
X< X,* 118 88 25.8 17.8 
X > X.* 105 87 23.0 18.1 


a NS TS A LS 
* X¥, = 6.46 for Version A or 0.7385 for Version B 
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Algorithms 


FUNCTION WEW (Xs EN) 


Cc 
Cc ITERATIVE SOLUTION OF X=W*EXP(W) WHEKE A IS GIVEN» CNOVEMBER 1970) 
c CREVISED - SEPTEMBER 1971) 
c VERSION A -~ CDC 6600 MACHINE ACCURACY + 
Cc 
Cc INPUT PARAMETER] 
c x ARGUMENT OF WCX)- 
c 
c OUTPUT PAKAMETERS) 
c wEW THE DESIRED S@LUTION. 
Cc EN THE LAST RELATIVE CORRECTION TO WCX)- 
c 
c 
Cc SET CONSTANTS ee 
DATA NEWE/17 
7 IF (NEWE) 10220210 
10 NEWE = 0 
C134-/3. 
C227 ./3- 
C3eS-/6- 
C4az2-/3. 


c COMPUTE INITIAL GUESS««. 

20 FLOGX = ALOGCX) 
IF €X=6-46) 30730, 40 

30 WN ® X¥C164+C1¥X)/C1 0 +X¥CC24C3#X) D 
ZN = FL@GX ~ WN - ALOGCWN)D 
G@ T® 50 

40 WN = FLOGX 
ZN = -ALOGCWN) 

50 CONTINUE 


c ITERATION GNE... 
TEMP = 1. + WN 
Y = 2-*TEMP*C(TEMP+C4#ZN) - ZN 
WN 2 WN#¥(1. + ZN*Y/CTEMP*CY-ZN))) 


Cc ITERATION TWGece 
ZN = FLOGX - WN - ALBGCWN) 
TEMP = 1. + WN 
TEMP2 = TEMP + C4*#ZN 
EN = ZN*®TEMP2/¢ TEMP*TEMP2- -5*ZN) 
WN = WN*C1.+EN) 


Cc RETURNe ce 
WEW = WN 
RETURN 
END 


FUNCTIGN WEW (Xs END 
ITERATIVE SOLUTION OF X2WeEXPCW) WHEKE X IS GIVEN. CNOVEMBER 1970) 
CREVISED - SEPTEMBER 1971) 
VERSIGN B -- MAXIMUM RELATIVE ERROR 3-E-7 - 


INPUT PARAMETER) 
x ARGUMENT OF WCX)- 


QGUTPUT FARAMETERS) 
WEW THE DESIRED SOLUTION. 
EN THE LAST RELATIVE CORKECTIGN TO WCXd> 


anaaagaaanaaananng 


SET CONSTANTS «ee 
EG@UIVALENCE (Fs FLOGX) 
DATA NEWE/1/ 

IF (NEWE) 10220410 
10 NEWE = 0 

Cli=4e/3~6 

C227 e/3e 

C325 -/66 

C4220/36 


Cc COMPUTE INITIAL GUESS+«-« 
20 FLOGX = ALOGCX) 
IF €X=«7385) 30230» 40 
30 WN = XCD S+CLeXI/C Le tX¥CC24C34X)) 
G@ T@ 50 
40 WN = F = 24ekC CF +20) #F 9-30) /0C 0 THF + SS 0 DHF+I2T od) 
sO CONTINUE 


Cc 
c ITERATION GNEcee 
ZN 2 FLOGX - WN - AL@GCWN) 
TEMP = le + WN 
Y = 2e*TEMP*CTEMP+C4*ZN) - ZN 
EN 2 ZNeY/CTEMP*CY-ZN) > 
WN = WN*(C1.+EN) 


aa 


RETURNe ee 
WEW = WN 
RETURN 
END 
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Remark on Algorithm 443 [C5] 

Solution of the Transcendental Equation we" = x 
[F.N. Fritsch, R.E. Shafer, and W.P. Crowley, Comm. 
ACM 16 (Feb. 1973), 123-124] 


Bo Einarsson [Recd. 5 Mar. 1973 and 4 June 1973] 
Research Institute of National Defense, Box 98, 
S-147 00 Tumba, Sweden 


This algorithm contains a violation of the Fortran standard as 
defined in [1]. According to Section 10.2.6 of the standard, certain 
variables in'a subprogram will be undefined at the exccution of 
the RETURN statement, if they are not in a common block. This 
applies to the section in Algorithm 443 labeled ‘‘set constants” 
and commented in the Implementation Note. The IBM FORTRAN 
IV H Extended Compiler (Program Product) makes use of the 
standard in such a way that the variable NEWE does not have the 
value zero at a reentry to the subprogram, so that the variable 
NEWE does not fill its purpose. On the other hand this compiler 
performs the divisions and stores the quotients, so that no divi- 
sions are needed at the execution of the subprogram. The IBM 
FORTRAN IV G Compiler performs as the authors of Algorithm 
443 take for granted. Other optimizing compilers may have the 
value of NEWE as zero at reentry but have undefined values of 
Cl, C2, C3, and C4, In that case the subprogram would produce 
erroneous results, 

The remark above is similar to the third paragraph of Remark 
on Algorithm 352 {2], where the consequences of Section 10.2.5 
of the standard are discussed. 

The problem with the local variables can be evaded without 
loss of computing efficiency by replacing statement 30 with 


30 WN = X * (3. +4. * X) / (3. +X *(7. +2.5 *X)), 


replacing C4 * ZN wherever it appears with ZN/1.5, and finally de- 
leting the section ‘‘set constants” and the “Implementation Note”. 
in version B the statement EQUIVALENCE (F, FLOGX) must 
be kept. 

I have also certified the routine (version B) by testing it in 
single precision on an IBM 360/75 by performing some statistics 
on R(x) = (we — x)/x. The first test used « = 0.01 (0.01) 10.00 
and the second a thousand x values from a normal random dis- 
tribution with mean value zero and variance I, but if the obtained 
random value x was nonpositive, a new value of x was computed. 
The values of R were calculated in double precision. 

The following results were obtained: 


Standard 
Mean value deviation Maximal value 
Test of R of R of | R | 
Linear —1.7-10°6 1.2-1076 4.2-1076 
Random —0.4-1076 0.5-1076 3.3-10°6 


Since the relative error in a single precision value on IBM 360 may 
be as high as 0.5 - 107°, the above results appear reasonable. 


References 
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An Algorithm for Extracting 
Phrases in a Space- Optimal 
Fashion [Z] 


R.A. Wagner [Recd. 5 Mar. 1971 and 30 Aug. 1971] 
Department of Systems and Information Science, 
Vanderbilt University, Nashville, TN 37203 


Key Words and Phrases: information retrieval, coding, text 
compression 

CR Categories: 3.70, 5.6 

Language: PL/I 


Description 

Introduction. The algorithm PARSE computes and prints a 
minimum-space form of a textual message, MS. The 
minimization is performed over all possible ‘“‘ parses” of MS 
into sequences of phrase references and character strings. Each 
phrase reference represents one of a finite collection, P, of 
phrases. The collection, P, must be selected before PARSE is 
applied. 

Assumptions and requirements. PARSE assumes that the unit 
of storage is the byte, defined such that one byte can hold either 


a single character of text or an integer ij in the range 0 < i < W. 


(For IBM 360 equipment, W = 256 = 2**8). PARSE also 
assumes that the number of different phrases in the collection P 
is no larger than W** PHC, and that each message to be parsed 
contains fewer than W**CHC characters of text. The parameter 
values CHC = PHC = 1 appear appropriate on IBM 360 
equipment, when PARSE is applied to short messages, such as 
compiler error messages. 

PARSE requires two arguments. The first is the message to 
be parsed; the second is the table of common phrases which 
may be used in the parse. 

PARSE assumes that an external procedure HASH is 
present; HASH(MS,1I,K) is defined as follows: Let Hi, H2,..., 
H,,, be a sequence of indices such that among them they exhaust 
all entries P(H;) such that 
SUBSTR(MS,1,3) = SUBSTR(P(H,),1,3). 

(That is, the H/,’s include indices for every phrase P(H;) which 
agrees with characters J, / + 1, and J + 2 of the given 
message. Other indices may occur among the H,’s, as well.) 
Then HASH(MS,I,0) = H,, HASH(MS,1,H;) = Hy4:, and 
HASH(MS,LH,) = 9. 
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A “hash table” procedure can easily be modified to yield 
this performance; an equally useable, although slower version 
returns MOD(K + 1, M + 1) onevery call. A procedure 
HASH is included below. 

Methods. The method used to determine which phrases to 
extract from the given message is described in [1]. The resulting 
parsed message requires least space, assuming that messages are 
storable only as described in [1]—that is, as sequences of 


C (number) (character string) 
| P (number) 


representing a literal string of characters, and a reference to a 
common phrase, respectively. 

During the course of the computation, arrays G and H are 
filled with values of functions g and A, respectively, as defined 
in [1]. Just before label BUILD is reached, 


H(I) = length of the best parse of SUBSTR(MS,D, and 
G(D length of the best parse of SUBSTR(MS,/I) among 
those parses beginning with a character string, 


both for J = 1,..., LENGTH(MS). 

Internally, PARSE uses a single array, Z, paralleling the 
function arrays G and H, to retain the information needed for 
re-constructing the parsed form of the message. 


Z(1) K, if G() > H(), where K is the number of the “best” 
common phrase matching MS at J, or 
J, if GW) = HW). (GW) < AD is impossible.) 


J gives the index of the end (plus one) of the character string 
starting at J. In this case, the best parse at J begins with this 
character string. J satisfies: G(J) > H(J) and for all k, I < k 
< J, G(k) = H(k). 

Results: To make the printed form of the parsed message 
more intelligible, PARSE prints: 
'C (number)! as '#ddd' 
'P (number)! as '%ddd' 
where ‘‘ddd”’ is the 3-digit decimal representation of (number) 
+ 1. In practice, a number representing a character count or 
phrase index can be stored as an integer, in place of CHC or 
PHC characters respectively. Thus, the character string 
‘ABC' would be stored as 'C2ABC', where 2 is a CHC-byte 
integer whose value is 2. The same string would be printed by 
the PARSE algorithm as '#003ABC'. 

The program PARSE returns the number of bytes needed to 
store MS, given the particular set of extractable phrases in P. 

A sample driver, two sample input streams and associated 
output follow the procedures PARSE and HASH. 


References 

1. Wagner, R.A. Common phrases and minimum-space text 
storage. Comm. ACM 16, 3 (Mar. 1973), 148-152. 

2. Bell, James R. The quadratic quotient method; a hash code 
eliminating secondary clustering. Comm. ACM 13, 2 (Feb. 1970), 
107-109. 
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Fig. 1. The PARSE Algorithm. 


PARSE: PROC(MS ,P) RETURNS(FIXED BINARY); 
DCL (MS, P(*)) CHAR(*) VARYING; 
DCL N; 
DCL HASH RETURNS(FIXED BINARY) ; 
DCL (CHC, /* BYTES PER CHARACTER-COUNT */ 
PHC) /* BYTES PER PHRASE-INDEX */ 
STATIC EXTERNAL FIXED BINARY; 


N=LENGTH(MS) ; 

BEGIN; 
DCL(G,H,Z)(N+1) FIXED BINARY; 
DCL(I,J,K,L,T) FIXED BINARY; 


G(N+1 )=3; H(N+1)=1; J,Z(N+1)=N47; 
MSGP: DO I=N BY -] To 1; 
K=HASH(MS ,I ,08); 
rae G(T} = MIN( G(I+1)+1, H(1+1)+CHC+2 ); 
I)=d; 
/* J HOLDS INDEX OF END+1 OF NEXT CHAR-STRING */ 
M1: DO WHILE (K>0); 
L=LENGTH(P(K)); 
IF L a> N-I+1] THEN 
IF L < N THEN 
IF SUBSTR(MS,I,L)=P(K) THEN 00; 
T=H(I+L)+PHC+1 ; 
IF H(1)>T THEN DO; 
H(I)=T; Z(1)=K; J=I; 
END; 
END; 
K=HASH(MS ,1,K) 5 
END M1; 
END MSGP ; 


PUT SKIP EDIT(H(1),N+3,': ')(2 F(4),A) 
I=1; GOTO Bl; 
BUILD: 
IF H(1)<G(I) THEN a 
PUT EDIT('%', (1) (A,P'999'); 
I= TALENGTHEP 2 ))); 
END; 
ELSE 00; 
J=Z(1)-1; 
PUT EDIT('#',d,SUBSTR(MS ,1,J))(A,P'999',A); 
I=Z(1); 
END; 
Bl; IF Ia>N THEN GOTO BUILD; 
PUT EDIT('. as 


RETURN(H(1) 
END PARSE; 
Fig. 2. An acceptable HASH procedure. 
HASH: PROC(MS,I,K) RETURNS(FIXED BINARY) ; 


DCL MS CHAR(*), J FIXED BINARY(31,0), 
(HT (0: SOOVINTTCCROTIOY, 
KJ, HP INIT(197), 
HX HY ,HZ) FIXED BINARY STATIC; 
DCL (CHC, /* BYTES PER CHARACTER-COUNT */ 
PHC) /* BYTES PER PHRASE-INDEX */ 
STATIC EXTERNAL FIXED BINARY; 


CALL HCMN(K) ; 
RETURN(HT(HZ)); 


HCMN: PROC(K); 
IF K = Q THEN 
IF LENGTH(MS)-I < PHC+] THEN HZ=-1; 
ELSE DO; 
UNSPEC (J) ae a 1,PHC+2)); 
HZ=MOD(J ,HP) 
HY=J/HP; 
HX=0; 
END, 
ELSE DO; 
HX=MOD(HX+HY ,HP} ; 
HZ=MOD(HX+HZ ,HP) ; 
END; 
HZ=HZ+1 ; 
RETURN; 
END HCMN; 


ENTER: ENTRY(MS,1,K); 
IF LENGTH(MS) < PHC+2 THEN RETURN; 
KJ=0; 

El: CALL HCMN(KJ); 

KJ=HT(HZ) ; 
IF KJ > © THEN GOTO E1; 
HT(HZ)=K; 
RETURN; 
END HASH; 


Fig. 3. A driver for the PARSE procedure. 


DRIVER: PROC OPTIONS(MAIN ); 
DCL MS CHAR(256) VARYING; 
OCL NP,M; 
DCL (HASH RETURNS(FIXED BINARY), ENTER) 


ENTRY(CHAR(256) VARYING, FIXED BINARY, FIXED BINARY); 


DCL PARSE RETURNS(FIXED BINARY); 

DCL (CHC, /* BYTES PER CHARACTER-COUNT */ 
PHC) /* BYTES PER PHRASE-INDEX */ 
STATIC EXTERNAL FIXED BINARY; 


CHC , PHC=1 ; /* COUNT/INDEX SIZE=1 BYTE */ 
GET SKIP LIST(NP,M); 
BEGIN; 

DCL P(NP) CHAR(M) VARYING; 

DCL NB,NA,I,J; 


NB,NA=0; 

DO I=1 TO NP; 

GET SKIP LIST(P(I ; 
CALL ENTER(P(I),1 i 
END; 
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PUT PAGE LIST('PHRASES, AND THEIR PARSED FORMS'); 
DO I=1 TO NP; 
PUT SKIP(2) EDIT(I, roeer tr pery fp orersy 


(F(4),A 
NA= NA+PARSE(P(I), P); 
END; 


PUT PAGE LIST( ‘MESSAGES: '); 
L1: GET SKIP LIST(MS) ; 
PUT SKIP(2) LIST( '''' |] MS |{ '''' ); 
IF MS='' THEN GOTO L2; 
NB=NB+LENGTH(MS)+CHC+2; 
/* ALLOW FOR STRING-OVERHEAD + END MARK */ 
NA=NA+PARSE(MS ,P) ; 
GOTO LI; 


L2: PUT SKIP EDIT('FINAL STATISTICS:', 
‘WITHOUT PHRASE EXTRACTION: ',NB, 
"AFTER PHRASE EXTRACTION: ',NA, 
"SAVING: ',NB-NA, 

"  (',(NB-NA)*100/NB, '%)') 

(A, 3(SKIP ,A,F(5)),A,F(5,1),A); 
RETURN; 
END DRIVER; 


Fig. 4. Sample input files. 


0 


(a) Two phrases, four messages. Illustrates heavily overlapping 


phrases. 


(b) Five phrases, 23 messages. These messages are the first 23 
numbered error messages from the syntactic analysis section of 


the PL/C compiler. 


A 
CMSO3 LISTING OF INPUT STREAM 

00001 
00002 2,10 
00003 "AAAAA' 
00004 " AAAAAAA’ 
00005 ‘AAAAAAAADA ' 
00006 ‘ MAAAAAAAAAAA ' 
00007 " AAAAAAAAAAAAAA ' 
00008 " AMAAAAAAAAAAAAA ' 
00009 pe 


. CMSO3 LISTING OF INPUT STREAM 
00001 
00002 5,20 
00003 ‘EXTRA ' 
00004 "MISSING ' 
00005 "IMPROPER ' 
00006 ‘SEMI-COLON ' 
00007 ‘ EXPRESSION‘ 
00008 EXTRA (' 
00009 ‘MISSING (' 
00010 ‘EXTRA }' 
00011 ‘MISSING )' 
00012 "EXTRA COMMA’ 
00013 "MISSING COMMA ' 
00014 ‘EXTRA SEMI-COLON' 
00015 "MISSING SEMI-COLON' 
00016 "MISSING :' 
00017 ‘MISSING = 
00078 ‘IMPROPER *' 
00019 "MISSING *' 
00020 ‘EXTRA END' 


00021 ‘MISSING END' 

00022 "MISSING KEYWORD' 

00023 ‘INCOMPLETE EXPRESSION' 
00024 "MISSING EXPRESSION’ 

00025 "MISSING VARIABLE‘ 

00026 "MISSING ARGUMENT, 1 SUPPLIED’ 
00027 "EMPTY LIST' 

00028 ‘IMPROPER NOT' 

00029 ‘IMPROPER ELEMENT ' 

00030 "UNTRANSLATABLE STATEMENT ' 
00031 os 


Fig. 5. Result of applying DRIVER to the cards listed in Figure 
4(a). Note that phrase 2 is itself reduced in size by PARSE, while 
each of the messages are reduced to strings of phrase references 


alone. 
PHRASES, AND THEIR PARSED FORMS 


1 'AAAAA' 

8 8: #O05AAAAA. 
2 “'AAAAAAA' 

7 10: #002AA%001. 


MESSAGES : 


"AAAAAAAAAA' 
5 13: %0012%001. 


‘AAAAAAAAAAAA' 
5 15: %001%002. 


‘AAAAAAAAAAAAAA' 
5 17: %002%002. 


"AAARAAAAAAAAAAAA ' 
7 18: %001%001%001. 


FINAL STATISTICS: 
WITHOUT PHRASE EXTRACTION: 63 
AFTER PHRASE EXTRACTION: 37 
SAVING: 26 {( 41.3%) 
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Fig. 6. Result of applying DRIVER to the cards listed in Figure 4(b). 
PHRASES, AND THEIR PARSED FORMS 


i "EXTRA ' 

9 9: #Q06EXTRA . 

2 ‘MISSING ' 
TI 11: #OO08MISSING . 
3. IMPROPER ' 
12 12: #OO9IMPROPER . 


4 'SEMI-COLON' 


13. 13: =#010SEMI-COLON. 
5‘ EXPRESSION' 
13. 13: #O10EXPRESSION. 
MESSAGES : 
"EXTRA (' 
6 10: %001#001(. 
"MISSING (' 
6 12: %002#001(. 
"EXTRA )' 
6 10: %001#001). 
"MISSING )} 
6 12: %002#001). 
"EXTRA COMMA' 
10 14: %001#005COMMA. 
"MISSING COMMA‘ 
10 16: %002#005COMMA. 
"EXTRA SEMI-COLON' 
5 19: %001%004. 
"MISSING SEMI-COKON' 
5 21: %002%004. 
"MISSING :' 
6 12: %002#001:. 
"MISSING =' 
6 12: %002#001=. 
"IMPROPER *' 
6 13: %003#001*. 
"MISSING *' 
6 12: %002#001*. 
‘EXTRA END' 
8 12: %001#003END. 


"MISSING END' 


8 14: 


#002#003END. 


"MISSING KEYWORD ' 


12 (18: 


%002#007KEYWORD. 


"INCOMPLETE EXPRESSION’ 


16 24: #011 INCOMPLETE %005. 


"MISSING EXPRESSION‘ 
5 21: %002%005. 


"MISSING VARIABLE ' 


13° 19: %002#008VARIABLE. 


"MISSING ARGUMENT, 1 SUPPLIED' 
25 31: %002#020ARGUMENT, 1 SUPPLIED. 


"EMPTY LIST' 


13° 13: #O10EMPTY LIST. 


‘IMPROPER NOT' 


8 15: %003#003NOT. 


‘IMPROPER ELEMENT ' 


12 19: %003#007ELEMENT. 


‘UNTRANSLATABLE STATEMENT‘ 
27° 27: #024UNTRANSLATABLE STATEMENT. 


FINAL STATISTICS: 


WITHOUT PHRASE EXTRACTION: 
AFTER PHRASE EXTRACTION: 
SAVING: 93 ( 24.6%) 
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Algorithm 445 


Binary Pattern 
Reconstruction from 
Projections [Z] 


Shi-Kuo Chang [Recd. 4 Nov. 1970 and 12 May 1971] 
School of Electrical Engineering, Cornell University 
Ithaca, NY 14850. 


Key Words and Phrases: pattern reconstruction, image 
reconstruction, data compression, picture processing 

CR Categories: 3.63, 5.30 

Language: Algol 


Description 

This procedure reconstructs a binary pattern from its horizon- 
tal and vertical projections [1]. The parameters are described as 
follows. m, n are the dimensions of the binary pattern f. switch is 
an integer variable. fx [1:7] is the projection of f on the horizontal 
axis. fy [l:m, 1] is initially set to (1, 2, ..., m). fy [lim, 2] is the 
projection of f on the vertical axis. f{1:7, 1:m] is the pattern to be 
reconstructed, initially set to 0. 

The projections fx and fy are inconsistent if there is no pattern 
Jf having such projections. The pattern f is unambiguous if there is 
no other pattern having the same projections as f. Given the pro- 
jections fx and fy, there are three possibilities: (1) fx and fy are 
inconsistent; (2) they are consistent but the pattern fis ambiguous; 
or (3) they are consistent and the pattern fis unambiguous. 
(1) Inconsistent Projections. This procedure sets switch to —1 and 
reconstructs a pattern f having the correct horizontal projection 
Jx. Its vertical projection will be different from /y. 
(2) Ambiguous Pattern. This procedure sets switch to 0 and re- 
constructs a pattern f having projections fx and /y. 
(3) Unambiguous Pattern. This procedure sets switch to 1 and 
reconstructs a pattern f having projections fx and /y. In this case f 
is unique. 


References 

1. Chang, S.-K. The reconstruction of binary patterns from their 
projections. Comm. ACM 14, 1 (Jan. 1971), 21-25. 

2. Chang S.-K., and Shelton, G.L. Two algorithms for multiple- 
view binary pattern reconstruction. EEE Trans. Syst., Man, 
Cybern. (Jan. 1971), 90-94. 


Algorithm 

procedure Pattern Reconstruction (switch, m, n, fx, fy, f); 
integer m, n, switch; integer array f/x, fy, f; 
comment The parameters are defined as follows: switch is an 
output parameter with values —1, 0, or 1 according as the 
projections aie inconsistent (switch = —1), the pattern is 
ambiguous (switch = 0), the pattern is unambiguous (switch = 
1). m is the column dimension of the binary pattern f, and 77 is 
the row dimension of the binary pattern f. m and n are input 


Author’s present address: Institute of Mathematics, Academia 
Sinica, 910 Nankang, Taiwan, Republic of China. 
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parameters. The array fx [1:7] is the projection of the binary 
pattern fon the x axis. fx is an input array. The array fy [l:m, 
1:2] contains 1, 2,..., m in column 1 initially, and column 2 
contains the projection of the binary pattern fon the » axis. fy 
is an input array, and it is modified by this procedure. The array 
f [1:n, 1:m] contains 0 initially and contains the reconstructed 
binary pattern finally; 


begin 


integer ix, iy, j, number; 
procedure Sort; 
begin 
integer limit, ind, i; 
limit := m — 1; 
SI: 
ind := 0; 
for i := 1 step 1 until /imit do 
if fy [i, 2] < fy [i+1, 2] then 


begin 
integer /1, 72; 
ind := 1; 


H:= fy fit, 1]; 2:= fy (/+1, 2]; 
fy (i+1, 1) := fy [4 1); 
fy [i+], 2) := fy fi, 2); 
Wiis Aye Wb 2)4= 2 
end; 
limit := limit —1; 
if (limit > 0) A (ind = 1) then go to S! 
end Sort; 
procedure Merge; 
if fy [number, 2] < fy [number+1, 2] then 
begin 
integer n1, 2, t1, 72; 
ni := number; 
S§2: 
if 1 > 1 then 
begin 
if fy [nl, 2] = fy [n1—1, 2] then 


begin 71 := nl — 1; go to S2 end 
end; 
n2:= number + 1; 
S3: 
if 12 < m then 
begin 
if fy [n2+1, 2] = fy [n2, 2] then 
begin n2 := n2+ 1; goto S3 end 
end; 
S4: 
tH := fy [al, 1); 12 := Sy [al, 2]; 
Sy (nl, 1) := fy [n2, 1); fy (al, 2] := fy [n2, 2]; 
fy [n2, J := A; fy [n2, 2) := 12; 


if (nl < number) /\ (number+1 < 2) then 
begin nl := nl +1; 12:= n2— 1; goto S4 end 

end Merge; 

comment The procedure Sort orders fy, and the procedure Merge 
reorders fy. The main procedure now follows; 


switch := 1; 

Sort; 

for ix := 1 step 1 until 7 do 
begin 


number := fx [ix]; 
if number > O then 
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begin 
for j := 1 step 1 until number do 
begin 
iy := fy Ui, 1); 
fy i, 2 = fy Ui, 21 — 4; 
Sf {ix, iy] := 1 
end; 


comment One column of fis reconstructed; 
if number < m then 


begin 
if (switch=1) /\ (fy|number, 2] < fy[number-+1, 2]) 
then switch := 0; 


comment The above condition indicates that the 
pattern fis ambiguous, and the switch is set to 0; 
Merge; 
comment fy is reordered before we start to reconstruct 
the next column; 
end 
end 
end; 
for j := 1 step | until 7m do 
if fy Uj, 21 ¥ 0 then switch := —1; 
comment The above condition indicates that the projections are 
inconsistent, and the swifch is set to —1; 
end Pattern Reconstruction 


Remarks on Algorithm 445 [Z| 

Binary Pattern Reconstruction from Projections [by 
Shi-Kuo Chang, Comm. ACM 16 (Mar. 1973), 
185-186} 


John Lau [Recd. 22 July 1971] 
Department of Computer Science, University of 
British Columbia, Vancouver 8, B.C., Canada 


Key Words and Phrases: pattern reconstruction, image 
reconstruction, data compression, picture processing 

CR Categories: 3.63, 5.30 

Language: Algol 


The procedure works well for all consistent patterns, ambiguous 
or unambiguous. However, when fx and /y are inconsistent, the 
procedure can construct a pattern f[1:n, 1:m] with fx satisfied, only 
if all elements of fx have values between 0 and m. If any of these 
elements is greater than m, a program interrupt would usually be 
caused by “value of subscript outside declared bounds’’ when the 
program executes the lines 


for j := 1 step 1 until number do 
begin 

iy: = fyl/, 1; 

Sys, 2) := fli, 2) — 1; 

Slix, iy] := 1 
erd; 


and execution of program would then be terminated. Even if a 
pattern could be constructed in this case, it would not be able to 
satisfy fx entirely. 
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Algorithm 446 


Ten Subroutines for the 
Manipulation of Chebyshev 
Series [Cl] 


R. Broucke [Recd. 17 May 1971 and 7 April 1972] 
University of California, Los Angeles, CA 90024, and 
Jet Propulsion Laboratory, Pasadena, Calif. 


Key Words and Phrases: Chebyshev series, differentiation, 
integration, curve fitting, approximations, negative powers 

CR Categories: 5.12, 5.13, 5.16 

Language: Fortran 


Description 

Introduction. These subroutines deal with the manipulation of 
Chebyshev series. The operations performed are the construction 
of the Chebyshev approximation of functions, the evaluation of the 
series or their derivative, the integration or differentiation, and the 
construction of negative or fractional powers of such a series. 

The subroutines are written in ANSI Fortran. They have been 
used without modification on such computers as the IBM-7094, 
IBM-360/91 (Fortran-I1V-G compiler) and Univac 1108 (Fortran-V 
compiler), 

The ten subroutines are considered as a single set, principally 
because they all use the same storage philosophy. All information 
is transmitted through the CALL-sequence rather than through 
the use of COMMON statements. Therefore, the user must provide 
storage for all the series in his main program, taking into account 
that all operations are performed in double precision. The coeffi- 
cients of each series occupy a one-dimensional double-precision 
array according to the rules of ANSI Fortran. When several 
Chebyshev series are being manipulated, it is convenient to store 
all the series in a matrix. Each column of the matrix contains a 
single series, in order that the coefficients of each series occupy 
consecutive storage locations. 

The first six subroutines contain no calls to other subroutines; 
in this sense they may be considered as independent. Each sub- 
routine can be used separately. 

In the present type of operations, it is extremely important to 
design and perform a large number of tests to certify all of the 
subroutines. We have tested the subroutines by generating some 
Chebyshev series which were published by Clenshaw [4], but we 
have also tested them with a number of additional methods; for 
instance: 

a. The series for several elementary functions such as sin(x), 
cos(x), sin(2x), and cos(2x) have been constructed directly. These 
series have then been evaluated, and the values have been compared 
with the values of the functions. 

b. The series for cos(2x) and sin(2x) have been derived from the 
series sin(x) and cos(x) by multiplication and addition of series. 

c. The series for sin(x) and cos(x) have been derived from each 
other by integration and differentiation. 
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d. Many tests have been made by multiplying a series f(x) by the 
series 1/f(x) or for instance by squaring the series for f(x), or other 
similar operations. 

The generation, evaluation and multiplication subroutines. The 
methods for the generation of a Chebyshev series have been taken 
from C.W. Clenshaw’s papers [3, 4, 5]. The rule for the multiplica- 
tion of Chebyshev series is also described by Clenshaw (3, p. 137], 
but the flowchart of our subroutine is from L. Carpenter {2]. 

We only consider the interval (—1, +1) of the independent 
variable x, and we represent a truncated Chebyshev series of order 
nin the form: 


J(x) = (€0/2) + OT, (x) + c2T2(x) + tee + CnT n(x). (1) 


We want to draw the user’s attention to the fact that we use a 
factor 4 in the zero-order term but not in the last term of the series. 
Some authors have used different conventions in relation to this 
factor 4 for the first and last terms. 

In the applications of the subroutines some caution is also 
necessary, because the independent variable x (the Chebyshev 
independent variable) is within the limits (—1, +1). If the user’s 
variable ¢ (the physical independent variable) is within the limits 
(ti , t2), the conversions between ¢ and x should be made with the 
linear relations 


t = ((t + t)/2) + ((t — t)/2)x; (2) 
x = ((2t — (4 + h))/(t — t)). 


The coefficients c; in formula (1) are computed with the rule 
given by Clenshaw (4, p. 3]: 


ci = (2/n D0” f(cos (mj/n)) cos(mij/n); i =0,1,...,0. (3) 


The double accent means that the first and last terms of the sum are 
divided by two. It is seen that 7 + 1 special values of the function 
J (x) are needed. In some applications, 7 has been as large as 1,500. 

A large number of applications have shown that in most in- 
stances the user desires to construct the Chebyshev series for not 
just one function but for several functions simultaneously. For 
instance, in the study of the motion of a particle there will always 
be three coordinates, x; , x2, x3, rather than just one. For this 
reason we programmed the subroutine CHEBY to efficiently con- 
struct several Chebyshev series simultaneously. In particular, the 
number of cosine calculations has been minimized. There will be 
only 2 cosine calculations, no matter how many functions are 
being analyzed simultaneously. 

Besides the main program, the user will have to provide his own 
subroutine for the evaluation of the special values of the functions 
to be analyzed, as explained in the comments of the subroutine 
CHEBY. The user may choose any name for this subroutine; how- 
ever, this name has to be transmitted through the CALL CHEBY- 
statement. This function subroutine will generally evaluate the 
function values either by using the appropriate formulas or by per- 
forming table lookup and interpolations if the data is only available 
in the form of a table with discrete points. 

The subroutine ECHEB evaluates a Chebyshev series with the 
aid of Clenshaw’s recurrence rule [4, p. 9]. The c,’s being the coeffi- 
cients of the given series, we compute the values bays , Bngi, On, -- - 
bo with: 


2 


base = bngt = 0; b= 2xbis1 = bisa + ci, (4) 


where the subscript i runs from n to 0. The number of arithmetic 
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operations involved is only 3, and the value of the function is then 
f(x) = (6) — by)/2. 

The subroutine EDCHB evaluates the derivative of a Chebyshev 
series (without storing the coefficients of the differentiated series). 
It implements a combination of the evaluation formula (4) and 
the differentiation formula (6) given below. 

The differentiation and integration subroutines. Clenshaw’s 
formulas [4, p. 11] have again been used for the differentiation and 
integration operations. The coefficients a; of the integrated Cheby- 
shev series are derived from the input coefficients c; by: 


@i = (Cin — Ci41)/2i5 

i=1,2,...,a—-1. 
The coefficients d; of the differentiated series are obtained by a set 
of recurrence equations: 


d, = 0; 


a = 0; An = Cn1/2n;3 


(5) 


Ai = digi + ic; ; 
ae | ea A | ea ae 


dn-1 _ 2NCp ; 


(6) 


When using the differentiation and integration subroutines, the 
user should remember the relation between the differentials of 7 
and x: 


dt = ((te — f1)/2) dx = (At/2)dx. (7) 


This should be considered whenever differentiation or integration 
of Chebyshev series is performed. For instance we have for any 
Chebyshev series /: 


§ fdt = (At/2) J fdx. (8) 


Negative and fractional powers. Our last four subroutines, 
dealing with expansion or iteration methods for the generation of 
noninteger powers of a Chebyshev series, are somewhat more so- 
phisticated than the first six subroutines, but the theoretical basis 
of their operation has recently been described in detail [1]. For this 
reason, they will not be described in more detail here. All four sub- 
routines use the multiplication subroutine MLTPLY but are 
otherwise independent. The subroutines BINOM, XALFA?2, and 
XALFA3 all have the same purpose but operate with different 
methods and have different convergence properties. All three are 
given in order to allow the user to experiment and eventually select 
the one that is most efficient for his particular application. 

Acknowledgments. I wish to thank Nancy Hamata at the Jet 
Propulsion Laboratory for her assistance in the programming and 
debugging of the present subroutines; also the two anonymous 
reviewers for their useful suggestions. 
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Algorithm 

SUBROUTINE CHEBYCNFs NPL» NPLMAAs N2s FUNCTNs Xo 
SIMULTANEOUS CHEBYSHEV ANALYSIS GF NF FUNCTIGNS 
COMPUTES A MATRIXs Xs CONTAINING ONE CHEBYSHEV SEKIES PEK 
COLUMN FOR A GIVEN NUMBEK GF FUNCTIONSs NFe INPUT NFLe 
THE NUMBER GF TERMS IN ALL SERIES» NPLMAXs THE ROW 
DIMENSION @F X IN THE CALLING PK@GRKAM (MUST BEeGE«NPL)> 
N2,s DIMENSION @F GC (MUST BE«GEe2*(NPL-1))% AND FUNCTNs 
THE NAME OF USEK SUBROUTINE WHICH DEFINES THE NF 
FUNCTIGNS- FXJ AND GC AKE ¥:OKK SPACE. 
AN EXAMPLE @F SUCH A SUBX@UTINE IS AS FOLLOWS 
SUBR@UTINE FUNCTNCAs VAL) 
DOUBLE PRECISION As VALC2) 
VALCLIFDSINCA)D 
VALC2)2DCOSCA) 
RETURN 
END 


FXJs GC) 


aanananaanaagaaganaaan 
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DGUBLE PKECISION XCNPLMAXsNF)s FXUCNF D2 GCCN2)> 
* FK» PEN», FAC 
D@ 20 K#lsNPL 
DQ 10 J=lsNF 
X€KaJ)? = 04D0 
10 CONTINUE 
20 CONTINUE 
N = NPL - 1 
ENN = N 
PEN = 341415926535897932D0/ENN 
DO 30 K=1,N2 


30 CONTINUE 
D® BO J=t»NPL 
XJ = GCCUD 
CALL FUNCTNC XJ» 
IF (JeNEol «ANDe 
DO 40 K=tsNF 
FXRJCK) = «SDO*eFAICK) 
40 CONTINUE 
50 D@ 70 L=ts NPL 
LM = MODCCL-1)*CJ-1)d2N2) + 1 
DO 60 K=1sNF 
X(LeK) = XCLeK) + FXISCK)*GCCLM) 
60 CONTINUE 
70 CONTINUE 
80 CONTINUE 
FAC = 2eQODO/ENN 
DO 100 K=1lsNPL 
D@ 90 J=ieNF 
XCKa J) = FACKXCKs J) 
90 CONTINUE 
100 CONTINUE 
RETURN 
END 


FXJ) 
JeNEeNPL) GO Te SO 


SUBROUTINE MLTPLYCAXs K2e NPL» X3) 
C MULTIPLIES TwO GIVEN CHEBYSHEV SERIES» KX AND A2@s WITH 
C NPL TERMS T@ PRODUCE AN CUTPUT CHEBYSHEV SEKIESs A3- 

DOUBLE PRECISICN XXCNPL)» X2CNPL)» ASCNPLI» EX 

DO 10 K=1i,sNPL 

X3CK) = 0-0D0 
10 CONTINUE 

D@ 30 K=1,NPL 

EX = 0.0D0 


M+K- 1 
EX = EX # XXCMI*¥X2CL) # XACL)¥X20M) 
20 CONTINUE 
XA3CK) = 
30 CONTINUE 
X3CL) = X3BC1) = O-SDOeXXC1L)*XNCI1) 
D@ 50 K=3,NPL 
EX = 0.0D0 
MM = K = 1 
D@ 40 M=2sM™M 
Lz=K-M + I 
EX = EX + KXCM)*X2¢L) 
40 CONTINUE 
K3CK) 2 OeSDO*EX + X3CK) 
SO CONTINUE 
RETURN 
END 


0 «SDO*EX 


SUBROUTINE ECHEB(X» CQEFs NPL» FX) 
C EVALUATES THE VALUE FX(X) @F A GIVEN CHEBYSHEV SERIES» 
C COEFs WITH NPL TEKMS AT A GIVEN VALUE @F X BETWEEN 
C -t- AND 1. 
DOUBLE PRECISIGN CQEFCNPL)» Xs 
Bk = 0+0D0 
BRPP = 0.0D0 
DQ 10 K=1.NPL 
J = NPL - K +1 
BRP2 = BRPP 
BRPP = BK 
BR = 2¢ODO*#X*BRPP - BRP2 + C@EFCJ) 
10 CONTINUE 
FX 3 0+5D0%(BR-BRP2) 
RETURN 
END 


FX» Bros BRKPPs BKP2 


SUBROUTINE EDCHEBCXs CGEFs NPL» FX) 
C EVALUATES THE VALUE FXC(X) OF THE DERIVATIVE OF A 
C CHEBYSHEV SERIES» CQEFs WITH NPL TERMS AT A GIVEN 
C VALUE @F X BETWEEN <-1+ AND 1. 
DOUBLE PRECISION COEFCNPL)»s Xs 
* BJPL» BJs BFs DJ 
XJF2 2 0.0D0 
XJPL = 0-0D0 
BUJP2 = 0.0bD0 
BJPL = 0-0DO0 
N = NPL - 1 
D@ 10 K=1lsN 
J = NPL - K 
DJ 2 J 
XJ * 2eDO*COEFC U+1)*DJ * XJP2 
BJ = 2eD0*X*BJPL - BUP2 + XJ 
BF = 
BJUP2 = 
BJPL = 
XJP2 = 
XJPL = XJ 
10 CONTINUE 


FX» XJP2s XJPLs XJe 


FX = »5SD0*(BJ-BF) 
RETURN 
END 


ENN» XJs 


BJUP2s 


0 
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SUBROUTINE DFRNTCXXs NPL» X2) 


C@MPUTES THE DERIVATIVE CHEBYSHEV SERIESs X2s OF A GIVEN 


CHEBYSHEV SERIES» XXs WITH NPL TERMS. 
TQ REPLACE A SERIES X BY ITS DERIVATIVEs USE 
CALL DFRNTCXsNPL»X) 


D@UBLE PRECISION XXCNPL)» XXN» XXL» DN» DL» X2CNPL) 


DN = NPL - 1 
AXN @ XXCNPL-1) 
X2CNPL-1) = 2eDOeXXXENPL)*DN 
X2(NPL) = 0.DO 
D@ 10 K=3s5NPL 
L = NPL ~ K + 1 
DL = L 
XX. = XXCL) 
X2CL) = X2CL+2) + 2eDO*AKXN*DL 
XAXN = XXL 
10 CONTINUE 
RETUKN 
END 


SUBKOUTINE NTGRTCXXs NPL» X2) 


COMPUTES THE INTEGRAL CHEBYSHEV SEKIESs» X2s OF A GIVEN 


CHEBYSHEY SERIESs XX» WITH NPL TEKMS. 

TO REPLACE A SERIES 4 BY ITS INTEGRAL» USE 

CALL NTGRTCXsNPLoX) 
DOUBLE PRECISI@N XAXCNPL)» APKs TEXMs DK» X2CNPL) 
XPR = XXC1) 
x2c1) = 0-0D0 


TERM = CXPR-XXCK+1)9)702 «DO*DK) 
XPK = XXC€K) 
xXecK) = TEKM 
10 CONTINUE 

DK = N 

K2CNPL) = XPR/€2.D0*DK) 

RETURN 

END 


SUBROUTINE INVEKTCXs XXs NPL» NETs XNVSEs Weis WO) 
COMPUTES THE INVERSE CHEBYSHEV SEKIESs XNVSEs GIVEN A 
CHEBYSHEV SEKIES» xX» A FIRST APPKOAIMATION CHERBYSHEV 
SERIESs AX» WITH NPL TEKMSs AND THE NUMBEK OF 
ITERATIONS» NET. THE SUBRGUTINE USES THE EULEK METHOD 
AND COMPUTES ALL PQWERKS EPS*#*K UP TE K=2**C(NET+1)>» 
WHERE EPS=1-X*(XX INVERSE) « Ww AND We AKE WORK SPACE. 
SUBROUTINES USED ~ MLTPLY 


DOUBLE PRECISION XCNPL)» XXCNPL)s ANVSECNPL)»s WWONPL),» 


* W2CNPL) 
CALL MLTPLY(X» AXs NPL» WwW) 
WWCL) = 2-D0 - WhCl> 
DO 10 K=2sNPL 
WWCOK) = =WWCK> 
10 CONTINUE 
CALL MLTPLYChWs Whs NPL» we) 
VWC1) = 2-DO + WWC1) 
DO 40 K=1sNET 
CALL MLTPLYCWWs, W2s NPL» XNVSE) 
D@ 20 J=lsNPL 
WwWwCS) = WWCUJ) + ANVSECJ) 
20 CONTINUE 
CALL MUTPLYCW2s W2s NPL» XNVSE)D 
D@ 30 J=lsNPL 
W2¢J) = XNVSECJ) 
30 CONTINUE 
40 CONTINUE 
CALL MLTPLYCWWs, XX» NPL» XNVSE) 
KETUNN 
END 


SUBROUTINE BINGMCXs» XX» NPL» Ms NTs KAs WWs W2s W3) 
C@MPUTES THE BIN@MIAL EXPANSIGN SEKIESs XAs FOR (-1/7M) 


POWER @F A GIVEN CHEBYSHEV SERIESs Xs WITH NPL TERMS, 
WHERE M IS A POSITIVE INTEGER. XX IS A GIVEN INITIAL 
APPROXIMATION TO X**C-1/7%)6 NT IS A GIVEN NUMBER OF 


TERMS IN BIN@MIAL SERIES« WWs, W2s AND W3 ARE WORK SPACE 


SUBROUTINES USED - MLTPLY 


DQ@UBLE PRECISI@N XCNPL)» XXCNPL)» XACNPL)»s WWONPL)» 


* WECNPL)» W3CNPL)» ALFAs COEFs DM» DKMMs DKM2 
DM = ™ 
ALFA = -1-DO/DM 
D@ 10 Jel»NPL 
WWwCJ) 3 XC) 
10 CONTINUE 
D@ 30 K=15M 
CALL MLTPLYCWWs XX» NPL» W2) 
DQ 20 J=lsNPL 
WwCJ) = W2CJ) 
20 CONTINUE 
30 CONTINUE 
WWC1l)d) = WWCL) = 2¢DO 
XACI) = 2eDO 
D@ 40 J=2sNPL 
XACJ) = 0-0D0 
Ww3¢J) = 0.DO 
40 CONTINUE 
W3C1) 5 2eDO 
D@ 60 K=2s,NT 
DKMM = K = 1 
DKM2 = K ~ 2 
COEF = CALFA-DKM2) /DKMM 
CALL. MLTPLYCW3s WWs NPL» We) 
D@ $0 J=!,NPL 
W3CJ) = W2CJ)*COEF 
XACJ) = XACI) + W3ICI) 
50 CONTINUE 


agaaAaAaNgANgANAAAN 


QAaAANAANAAAN 


60 


70 


CONTINUE 
CALL ME TPLY(XAs AX» NPLs be) 
D@ 70 J=1sNPL 
KACS) = W205) 
CONTINUE 
RETURN 
END 


SUBROUTINE XALFA2(Xs XX» NPL» Ms MAXETs EPSLNs NET» 


* We) 


REPLACES A GIVEN INITIAL APPKOXIMATION CHEBYSHEV SERIES» 


XXs 


BY A GIVEN CHEBYSHEV SERIES» X» WITH NPL TEKMS»s 


RAISED T@ THE (€-1/M) POWEKs WHEKE M IS AN INTEGER. 
INPUT MAXETs MAXIMUM ALLOWED NUMBER OF ITEKATIONSs, AND 
EPSLN» REQUIRED PRECISION EPSILON» GUTPUT AKGUMENTs 


NET 


» TS NUMBER GF ITERATIONS PXKEFOKMEDs IF MAXET=NETs 


REQUIRED PKECISION MAY N@T HAVE BEEN KEACHED AND THERE 


MAY 
CON 


BE DIVERGENCE. WW AND W2 AKE &GKK SPACE. 
VERGENCE IS QUADRATIC 


SUBROUTINES USED - MLTPLY 


40 


50 


60 


KEP 
XXo 
RAI 
INP 
EFS 
NET 
REO 
MAY 
CON 
SUB 


30 


40 


50 


60 


70 


80 


90 


D@UBLE PKECISIQON XCNPL)» XXCNPLD» WWCNPL)s WECNPL)» 
* EPSLN»s DALFAs DMs S» TDMM 
DM = 
DALFA = 1-DO/DM 
TDMM = 2¢DO*CDM+1-D0) 
DQ 60 JX=1sMAXET 

D@ tO Lai,NPL 
WWCL)> = XCL) 

CONTINUE 

D@ 30 K=tsM 
CALL MLTPLYCWWs, XX» NPL» we? 
D@ 20 L=tsNPL 

WWCL) = WetL) 

CONTINUE 

CONTINUE 

S = -2-D0 

D@ 40 L=1,NPL 
S = S + DABSCWhWCL)) 
VWwCL) = -WWCL) 

CONTINUE 

WWC1>) = wWCL) + TDMM 

CALL MLTPLYCWhs XX» NPL» We) 

D@ 50 L=1,NPL 
XX(L) = W2CL)*DALFA 

CONTINUE 

NET = JX 

IF (DABS(S)-LT+EPSLN) RETURN 

CONTINUE 
RETURN 
END 


SUBRGUTINE XALFA3CXs XXs NPL» Ms -MAXETs EPSLN»s NET» 
* We) 
LACES A GIVEN INITIAL APPROXIMATION CHEBYSHEV SEXKIES> 
BY A GIVEN CHEBYSHEY SEKIES» Xs WITH NPL TERMS» 
SED T@ THE (-1/M) POWERs WHEKE M IS AN INTEGER. 
UT MAXETs MAXIMUM ALLOWED NUMBEK @F ITEKATIONSs AND 
LN» REGUIKED PRECISION EPSILON. OUTPUT AKGUMENTs 
» IS NUMBER @F ITERATIONS PREFQKMEDe IF MAXET=NETs 
UIRED PKECISI@GN MAY NGT HAVE BEEN KEACHED AND THEKE 
BE DIVERGENCE.» WW AND W2 AKE WORK SPACE. 
VERGENCE IS GF GRDEK THXEE 
ROUTINES USED - MLTPLY 
DOUBLE PRECISION XCNPL)»s XXCNPLI» WHONPL)» W2CNPL)> 
* EPSLN» DALFAs DMs Ss» TDMMs PSDML 
DM =™M™ 
DALFA = 1.D0/D™M 
TDMM = 2-D0*(DM+!1-DO>) 
PSDML = .5D0*(DM+1.D0) 
D@ 9O JX=1eMAXET 
D@ 10 L=tsNPL 
WutL>d = XCL) 
CONTINUE 
DQ 30 K=15™ 
CALL MLTPLYCWWs XXs NPL» We) 
D@ a0 L=1,NPL 
WWCL)-= W2CL) 
CONTINUE 
CONTINUE 
S = -2.D0 
D@ 40 L=1,NPL 
S = S + DABSCWWCL)) 
CONTINUE 
Wel) = WWCE) = 26D0 
D@ 50 L=1,NPL 
WVWCL) = WhCL)*DALFA 
CONTINUE 
CALL MLTPLYCWWs, Whs» NPL» We) 
DE 60 L=l»sNPL 
WWCh > “wWwwWCL) 
we¢L) w2 (Ll) *P SDML 
CONTINUE 
WWC1) = WHC) + 2-D0 
DQG 70 L=lsNPL 
WecL) = W2CL)d + WWCL) 
CONTINUE 
CALL MLTPLYCW2s XXs NPL» Wh) 
D@ 80 L=1sNPL 
XXC(L) = WWCL) 
CONTINUE 
NET = JX 7 
IF (€DABSCS)-LTeEPSLN) RETURN 
CONTINUE 
RETURN 
END 


P 3- 
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Remark and Certification on Algorithm 446 

Ten Subroutines for the Manipulation of Chebyshev 
Series [C1] [R. Broucke, Comm. ACM 16 (Apr. 1973), 
254-265] 


Robert Piessens and Irene Mertens |Recd 11 Jan. 
1974] Applied Mathematics and Programming 
Division, University of Leuven, B-3030, Heverlee 
(Belgium) 


1. Two corrections are needed in the subroutine CHEBY: 
(i) The statement after statement 50 must be changed into: 
LM = MOD(L—1)*«(J—1), 2*N) + 1 


(ii) formulas (1) and (3) for the computation of Chebyshev series 
coefficients c, do not agree with the exact formulas given by Fox and 
Parker [1, p. 66]. Indeed the last coefficient must be halved. This 
can be accomplished in the routine by replacing the five statement 
before RETURN by 


DO: 100 J = 1, NF 
DO 90K = I, NPL 
X(K, J) = FAC#X(K, J) 
90 CONTINUE 
X(NPL, J) = 0.5 DO*X(NPL, J) 
100 CONTINUE 


2. Moreover, the number of cosine-evaluations in CHEBY 
can be reduced by a factor 4 if the DO-loop: 


DO 30K = 1, N2 


30 CONTINUE 
is replacéd by 


NN = (NPL+1)/2 
DO 30K = 1, NN 
FK = K — 1 
GC(K) = DCOS(FK*PEN) 
NPLK = NPL+1 — K 
GC(NPLK) = —GC(K) 
30 CONTINUE 
DO 35K =1,N 
NPLK = NPL + K 
GC(NPLK) = —GC(K+1) 
35 CONTINUE 
3. In subroutine MLTPLY, the DO-loop 


DO 10K = 1, NPL 


10 CONTINUE 


may be deleted. 
We have tested INVERT and BINOM by calculating 


[To(x) + aTi(x) I, 
and BINOM, XALFA2 and XALFA3 by calculating 


a qQ 1/2 
| (3 + 5) To(x) + -2aTi(x) + 2 ro | 


The results are compared with the exact Chebyshev series expansion 


io) 


(1+ax)7 = aS and s.(x) 


k=0 


where 


2 (l—a?)t — 1\k 
= = 1. 
ak aa a ) , ja| < 
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The rate of convergence of this series depends strongly on the 
value of a. For this reason, we have given a the values 0.1, 0.2, 
..., 0.9. 

We have noted that, especially in the case of slowly converging 
series, INVERT, XALFA2 and XALFA3 are more efficient than 
BINOM. Moreover, in order to have convergence, BINOM re- 
quires more accurate initial approximations than the other rou- 
tines. 


Reference 
1. Fox, L., and Parker, I.B. Chebyshev Polynomials in Numerical 
Analysis. Oxford University Press, London, 1968. 
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Algorithm 447 


Efficient Algorithms for Graph 
Manipulation [H] 


John Hopcroft and Robert Tarjan [Recd. 24 March 
1971 and 27 Sept. 1971] 
Cornell University, Ithaca, NY 14850 


Abstract: Efficient algorithms are presented for partitioning a 
graph into connected components, biconnected components and 
simple paths. The algorithm for partitioning of a graph into simple 
paths is iterative and each iteration produces a new path between 
two vertices already on paths. (The start vertex can be specified 
dynamically.) If V is the number of vertices and EZ is the number of 
edges, each algorithm requires time and space proportional to 
max (V, E) when executed on a random access computer. 

Key Words and Phrases: graphs, analysis of algorithms, graph 
manipulation 

CR Categories: 5.32 

Language: Algol 


Description 

Graphs arise in many different contexts where it is necessary 
to represent interrelations between data elements. Consequently 
algorithms are being developed to manipulate graphs and test them 
for various properties. Certain basic tasks are common to many 
of these algorithms. For example, in order to test a graph for 
planarity, one first decomposes the graph into biconnected com- 
ponents and tests each component separately. If one is using an 
algorithm |4] with asymptotic growth of V log(V) to test for 
planarity, it is imperative that one use an algorithm for partition- 
ing the graph whose asymptotic growth is linear with the number 
of edges rather than quadratic in the number of vertices. In fact, 
representing a graph by a connection matrix in the above case 
would result in spending more time in constructing the matrix 
than in testing the graph for planarity if it were represented by a 
list of edges. It is with this in mind that we present a structure for 
representing graphs in a computer and several algorithms for simple 
operations on the graph. These include dividing a graph into con- 
nected components, dividing a graph into biconnected compo- 
nents, and partitioning a graph into simple paths. The algorithm 
for division into connected components is well known [7]. The 
description of an algorithm similar to the biconnected components 
algorithm has just appeared [6]. For a graph with V vertices and E 
edges, each algorithm requires time and space proportional to 
max(V, £). 

Standard graph terminology will be used throughout this dis- 
cussion. See for instance [2]. We assume that the graph is initially 


This research was carried out while the authors were at Stan- 
ford University and was supported by the Hertz Foundation and 
by the Office of Naval Research under grant number N-00014-67- 
A-0112-0057 NR-44-402. 
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Fig. 1. Flowchart for connected components algorithm. 


STARTPOINT 1 


Empty Stack 


Number startpoint, put on stack. 


N 
7 Edge out of top point on stack? 


Yes 


Delete edge from graph, 


add to current connected component. 


N 
= Head of edge new point? 


Number new point, put on stack. 


One point in stack? No 


Yes 


Is there an unnumbered point? No 


Remove top point 
from stack. 


8 


(«) et it be new startpoint 


given as a list of pairs of vertices, each pair representing an edge 
of the graph. The order of the vertices is unimportant; that is, the 
graph is unordered. Labels may be attached to some or all of the 
vertices and edges. 

Our model is that of a random-access computer with standard 
operations; accessing a number in storage requires unit time. We 
allow storage of numbers no larger than k max(V, E) where k 1s 
some constant. (If the labels are large data items, we assume that 
they are numbered with small integer codes and referred to by their 
codes; there are no more than k max(V, E) labels.) It is easy to 
see and may be proved rigorously that most interesting graph pro- 
cedures require time at least proportional to E when implemented 
on any reasonable model of a computer, if the input is a list of 
edges. This follows the fact that each edge must be examined once. 

It is very important to have an appropriate computer repre- 
sentation for graphs. Many researchers have described algorithms 
which use the matrix representation of a graph [|]. The time and 
space bounds for such algorithms generally are at least V? [3] 
which is not as small as possible if E ts small. (In planar graphs for 
instance, E < 3V — 3.) We use a list structure representation of a 
graph. For each vertex, a list of vertices to which it is adjacent is 
made. Note that two entries occur for each edge, one for each of 
its end points. A cross-link between these two entries is often useful. 
Note also that a directed graph may be represented in this fashion; 
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if vertex v. is on the list of vertices adjacent to v, , then (¥ , v2) is 
a directed edge of the graph. Vertex »; is called the sail, and vertex 
v» 1s Called the head of the edge. 

A directed representation of an undirected graph is a repre- 
sentation of this form in which each edge appears only once; the 
edges are directed according to some criterion such as the direction 
in which they are transversed during a search. Some version of this 
structure representation is used in all the algorithms. 

One technique has proved to be of great value. That is the 
notion of search, moving from vertex to adjacent vertex in the 
graph in such a way that all the edges are covered. In particular 
depth-first search is the basis of all the algorithms presented here. 
In this pattern of search, each time an edge to a new vertex is dis- 
covered, the search is continued from the new vertex and is not 
renewed at the old vertex until all edges from the new vertex are 
exhausted. The search process provides an orientation for each 
edge, in addition to generating information used in the particular 
algorithms. 


Detailed Description of the Algorithms 

Algorithm for finding the connected components of a graph. This 
algorithm finds the connected components of a graph by perform- 
ing depth-first search on each connected component. Each new 
vertex reached is marked. When no more vertices can be reached 
along edges from marked vertices, a connected component has been 
found. An unmarked vertex is then selected, and the process is 
repeated until the entire graph is explored. 

The details of the algorithm appear in the flowchart (Figure 1). 
Since the algorithm is well known, and since it forms a part of the 
algorithm for finding biconnected components, we omit proofs of 
its correctness and time bound. These proofs may be found as 
part of the proofs for the biconnected components algorithm. The 
algorithm requires space proportional to max(V, E) and time pro- 
portional to max(V, E), where V is the number of vertices and E 
1s the number of edges of the graph. 

Algorithm for finding the biconnected components of a graph. 
This algorithm breaks a graph into its biconnected components by 
performing a depth-first search along the edges of the graph. Each 
new point reached is placed on a stack, and for each point a record 
is kept of the lowest point on the stack to which it is connected 
by a path of unstacked points. When a new point cannot be reached 
from the top of the stack, the top point is deleted, and the search 
is continued from the next point on the stack. If the top point 
does not connect to a point lower than the second point on the 
stack, then this second point is an articulation point of the graph. 
All edges examined during the search are placed on another stack, 
so that when an articulation point is found the edges of the cor- 
responding biconnected component may be retrieved and placed 
in an output array. 

When the stack is exhausted, a complete search of a connected 
component has been performed. If the graph is connected, the 
process is complete. Otherwise, an unreached node is selected as a 
new starting point and the process repeated until all of the graph 
has been exhausted. Isolated points are not listed as biconnected 
components, since they have no adjacent edges. They are merely 
skipped. The details of the algorithm are given in the flowchart 
(Figure 2). Note that this flowchart gives a nondeterministic 
algorithm, since any new edge may be selected in block A. The 
actual program is deterministic: the choice of an edge depends on 
the particular representation of the graph. 

We will prove that the nondeterministic algorithm terminates 
on all simple graphs without loops, and we also derive a bound on 
the execution time. We will then prove the correctness of the algo- 
cithm, by induction on the number of edges in the graph. Note 
that the algorithm requires storage space proportional to max(V, E), 
where V is the number of vertices and E is the number of edges 
of the graph. 

Let us consider applying the algorithm to a graph. Referring 
to the flowchart, every passage through the YES branch of block A 
causes an edge to be deleted from the graph. Each passage through 
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the NO branch of block B causes a point to be deleted from the 
stack. Once a point is deleted from the stack it is never added to the 
stack again, since all adjacent edges have been examined. Each 
edge is deleted from the stack of edges once in block C. Thus the 
blocks directly below the YES branch of block A are executed at 
most E times, those below the NO branch of block B at most V 
times, and the total time spent in block C is proportional to E. 
Therefore there is some & such that for all graphs the algorithm 
takes no more than k max(V, E) steps. A more explicit time bound 
may be calculated by referring to the program. 

Suppose the graph G contains no edges. By examining the 
flowchart we see that the algorithm, when applied to G, will termi- 
nate after examining each point once and listing no components. 
Thus the algorithm operates correctly in this case. Suppose the 
algorithm works correcly on all graphs with E-1 or fewer edges. 
Consider applying the algorithm to a graph G with E edges. Since 
the stack of points becomes empty at least once during the opera- 
tion of the algorithm, and since the YES branch at block D must 
be taken when only two points are on. the stack, every edge must 
not only be placed on the stack of edges but must be removed in 
block C. Consider the first time block C is reached when the algo- 
rithm is applied to graph G. Suppose not all the edges in the graph 
are removed from the stack of edges in this execution of block C. 
Then p, the second point on the stack, is an articulation point 
and separates the removed edges from the other edges in the graph. 

Let FE, be the set of removed edges, let E, be the set of edges 
still on the stack, and let E; be the set of remaining edges of G. 
Let G; be the subgraph of G made up of the edges from E, , and 
let G. = G — G,. Since G; and G, each have at most E-1 edges, 
the induction hypothesis implies that the algorithm operates cor- 
rectly on both G; and G;. 

Assume that the edges for each vertex in G, and Gz are listed 
in the same order as for G. Consider the sequence of steps taken 
when the algorithm is applied to G. The sequence of steps taken on 
G, can be divided into an initial sequence of steps which results 
in placing the edges E; on the stack, followed by the remaining 
sequence S;:. The sequence of steps taken on G consists of the 
sequence S, , followed by the steps taken on G, with p as the start 
point, followed by S2. 

The behavior of the algorithm on G is simply the composite 
of its behavior on G; and G;; thus the algorithm must operate 
correctly on G. 

Now suppose that the first time block C is reached, all the 
edges of G are removed from the stack of edges. We want to show 
that in this case G is biconnected. Suppose that G is not bicon- 
nected. Then choose a biconnected cornponent of G which may be 
separated by removing some one point p and which does not con- 
tain the start point of G. Let the edges making up this component 
be subgraph G; of G; let the remainder of G be G, . The algorithm 
operates correctly on G; and on G;, by assumption. The behavior of 
the algorithm on G is a composite of its behavior on G; and on G;, 
Assume that the edges for each vertex in G, and G, are listed in the 
same order as for G. The sequence of steps on G is identical to the 
sequence of steps on G; until an edge of G2 out of vertex p is se- 
lected. Then the sequence of steps of G is identical to the sequence 
on G, with start point p. The remaining steps on G are the same 
as the remaining steps on G;. But the algorithm reaches block C 
once while processing G; and at least once while processing G,. 
This contradicts the fact that the algorithm only reaches block C 
once while processing G. Thus G must be biconnected. and the 
algorithm operates correctly on G. By induction, the algorithm is 
correct for all simple graphs without loops. 

Algorithm for finding simple paths in a graph. This algorithm 
may be used to partition a graph into simple paths, such that all 
the paths exhaust the edges of the graph. Each iteration of the algo- 
rithm produces a new path which contains no vertex twice, and 
which connects the chosen startpoint with some other vertex which 
already occurs in a path. Total running time is proportional to the 
number of edges in the graph. The starting point for each successive 
path may be selected arbitrarily. In fact, the initial edge of each 


COLLECTED ALGORITHMS (cont.) 


Fig. 2. Flowchart for biconnected components algorithm. 
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successive path may be selected arbitrarily from the set of unused 
edges. 

The algorithm is highly dependent on the graph being bicon- 
nected. (The biconnected components of a graph are found using 
the previously described algorithm.) In order to find a new path, 
the initial edge is selected and the head of the edge is checked. 
If this point has never been reached before, a depth-first search is 
begun which must end in a path since the graph is biconnected. 
The search generates a tree-like structure: specifically, it is a tree 
with edges connecting some vertices with their (not necessarily im- 
mediate) ancestors. (We will visualize the tree drawn so that the 
root, which is an ancestor of all points, is at the bottom of the tree.) 
Enough information is saved from this tree so that if a point in it 
is reached when building another path, the path may be completed 
without any further search. 

The flowchart (Figures 3 and 4) gives the details of the algo- 
rithm, It is divided into two parts; one for the depth-first search 
process and one for path construction using previously gathered 
information. We shall prove the correctness of the algorithm and 
give a time bound for its operation. To derive the time bound, we 
assume that one point is marked old initially, and a different point 
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Fig. 3. Flowchart for pathfinding algorithm (1). 


Unused edre from startpoint? Step 
Yes (No path exists) 


Put edge in pathstack. 


Let point be head of edge. 


Yes ; 
(:) ss Point reached previously? 


No v) 
Number point 


Unsearched edge from point? 


No Yes 


Set backward edge Mark edge searched. 

of point to edge 

on pathstack. Set 

pastpoint to tail Head of edge unreached? ) 
of edge. If LOW- 

POINT of point less' 

than LOWPOINT of No Yes 
pastpoint, modify 
LOWPOINT and for- 
ward edge of past- 
point to indicate 
edge to point. 


Put edge 
on path- 
stack. Set 
point to 
head of 
edge. 


Head edge 
old and not 
startpoint? 


Yes 
If number of head of 
Set point to past- edge less than LOW~ 
point. Delete POINT of point, mod- 
edge from -path- {fv LOWPOINT and for- 
stack. ward edge of point 


to indicate edge. 


Mark points in path 0 


Mark edges in path used. 


Fig. 4. Flowchart for pathfinding algorithm (II). 
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is selected as the initial startpoint. The algorithm is then run re- 
peatedly with arbitrary startpoints until all edges are used to form 
paths. 

Let us consider path generation using depth-first search; that is, 
suppose the algorithm is applied and that the head of the first edge 
selected is previously unreached. Referring to the flowchart, we see 
that the search process is very similar to that used in the bicon- 
nectivity algorithm. A search tree is generated, and each edge 
examined is either part of the tree or connects a point to one of its 
predecessors in the tree. LOW POINT is exactly the same as in the 
biconnectivity algorithm; it gives the number of the lowest point 
in the tree reachable from a given point by continuing out along the 
tree and taking one edge back toward the root. The forward edges 
point along this path, while the backward edges point back along 
the tree branches. We have shown in the correctness proof of the 
biconnectivity algorithm that, if the graph is biconnected, LOW- 
POINT of a given point must point to a node which is an ancestor 
of the immediate predecessor of the given point. In particular, 
LOW POINT of the second point in the search tree must indicate 
an old point which is not the startpoint. Therefore the algorithm 
will find a path containing the initial edge. Note that all points 
encountered during the search process must either be old or un- 
reached, since every point reached in a previous search either has 
had all its edges examined or has been included in a path. 

Let us now suppose that the head of the first edge has been 
reached previously but is not marked old. Then the forward and 
backward pointers, along with the LOW POINT values, allow the 
algorithm to construct a path without further search. First, if the 
number of the head of the edge is less than the number of the 
startpoint, then following backward pointers will certainly produce 
a simple path, since the root of a search tree must be old and each 
successive point along a backward path has a lower number and 
thus is distinct from the other points in the path. If the initial edge 
is part of a search tree and the startpoint is the predecessor of the 
second point, then LOW POINT of the second point must be less 
than the number of the startpoint. Following forward edges until 
reaching a point numbered lower than the startpoint and then fol- 
lowing backward edges will produce a simple path. This is true 
since the forward edges point through descendants of the tree, 
with the single exception of the edge whose head is a point below 
startpoint in the tree. The last case to consider occurs when the 
initial edge is not part of a search tree but points from a node to 
one of its descendants in a tree. In this case some node in the tree 
between the startpoint and the second point of the path must have 
a LOW POINT value less than the number of the startpoint. If we 
follow backward edges until the first such point is reached, then 
follow forward edges until a point numbered less than the startpoint 
is reached, and finally follow backward edges until an old point is 
reached, we will generate a simple path. Note that the first forward 
edge taken cannot lead to the previous point because, if it did, the 
LOW POINT value at the previous point would b2 less than the 
number of startpoint, and the forward edge from this point would 
have been chosen instead of the backward edge. 

We thus see that each execution of the pathfinding algorithm 
produces a simple path, assuming that the algorithm is applied to 
a biconnected graph with at least one point which is not the first 
startpoint. marked old initially. Since each edge is examined at 
most once in the search section of the algorithm, and since each 
edge is put into a path once, there is a constant k such that the time 
required to execute the algorithm until no edges are unused is less 
than AE steps, where E£ is the number of edges in the graph. (Note 
that the number of vertices, V, is less than E if the graph is bicon- 
nected.) Detailed examination of the program will produce a more 
exact time bound. 

Another algorithm for finding simple paths exists. Lempel, 
Even, and Cederbaum [5] have described an algorithm for number- 
ing the vertices of a biconnected graph such that: (i) each number 
is an integer in the range 1 to V, where V is the number of vertices 
on the graph; (ii) vertices 1 and V are jointed by an edge; (iii) for 
all 1 < i < V, vertex i is joined to at least two vertices, one with a 
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higher number and one with a lower number. We may use this algo- 
rithm to partition a graph into simple paths. 

Given a start point and an adjacent end point, number the 
vertices so that the startpoint is 1, the endpoint is V, and the 
numbering satisfies the conditions above. Take edge (1, V) as the 
first path. Given an arbitrary startpoint, find an edge to a higher 
numbered vertex. Continue to find edges to successively higher 
numbered vertices until an old vertex is reached. 

This algorithm is clearly correct and looks conceptually simple. 
However, Lempel, Even, and Cederbaum present no efficient im- 
plementation of their numbering algorithm, and the only efficient 
way we have found to implement it requires using the previously 
described pathfinding algorithm in a more complicated form. Thus 
the new algorithm requires time and space proportional to 
max(V, £), but the constants of proportionality are larger than 
those for the implemented algorithm. 

Implementation. The algorithms for finding connected com- 
ponents, biconnected components, and simple paths were originally 
implemented and tested in Algol W. The programs were then 
translated to Algol for publication and tested using the OS/360 
Algol compiler. Auxiliary subroutines were also implemented. Brief 
descriptions of the procedures are provided below. 

ADD2(A, B, STACK, PTR): This procedure adds value A 
followed by value B to the top of stack STACK and increments 
the pointer to the top of the stack (PTR). Stacks are represented as 
arrays; the top of the stack is the highest filled location. 

NEXTLINK(POINT,VALUE): This procedure is used to 
build the structural representation of a graph. It adds VALUE 
to the list of vertices adjacent to POINT. (POINT, VALUE) is an 
edge (possibly directed) of the graph. 

CONNECT(V, E, EPTR, EDGELIST, COMPONENTS): This 
procedure, given a graph with V vertices and E edges, whose edges 
are listed in EDGELIST, computes the connected components of 
the graph and places the edges of the components in COMPO- 
NENTS. Each component is preceded by an entry containing the 
number of edges E’ of the component. The edges are oriented for 
output according to the direction in which they were searched 
(head first, tail second). 

BICONNECT(V,E,E PTR,EDGELIST,COM PONENTS): This 
procedure, given a graph with V vertices and E edges, whose 
edges are listed in EDGELIST, computes the biconnected com- 
ponents of the graph and places them in BICOMPONENTS. Each 
component is preceded by an entry containing the number of 
edges E of the component. The edges are oriented for output 
according to the direction in which they were searched (head first, 
tail second). 

PATHFINDER(STARTPT, PATH PT,CODEVALUE, PATH): 
This procedure, given a list structure representation of a bicon- 
nected graph with certain vertices marked as old, constructs a 
simple path from STARTPOINT to some old vertex, saving 
information to be used in constructing succeeding paths. The new 
path is stored in array PATH. Calling PATHFINDER repeatedly 
may be used to partition the graph into simple paths. 

The procedure PATHFINDER requires that the structural 
representation of the graph be stored as follows. Each edge is 
treated as a pair of directed edges each of which is represented 
by an integer between vy + landyv+ 2 Xe. Ifi,,i2,..., % are the 
integers corresponding to the edges out of vertex /, then initialize 
NEXT(i) to ii, NEXT(i;) to ij4n, 1 <j < k, and NEXT(k) to 0. If 
the edge i; terminates at vertex /, initialize HEAD(i;) to 1. LINK(i;) 
is the integer corresponding to the edge in the other direction. For 
1<i< v, BACK(i), FORWARD(i), PATHOCDE(}) are initialized 
to 0, LOW POINT(i) is initialized to v +- 1, NODE(i) is initialized to 
NEXT(i) and OLD(i) is initialized to FALSE. Foryv +1 <i< 
y+ 2 X e MARK(i) is initialized to FALSE. Before the first call 
of PATHFINDER some nonnull set of vertices must be marked 
as OLD and assigned successive PATHCODE values. CODE- 
VALUE is set equal to the number of vertices marked as OLD. 
If this is not done the first path cannot end at an OLD vertex. 

Further comments may be found in.the program listings below. 
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Algorithm ‘ 

procedure add2 (a, b, stack, ptr); 
value a, b; integer a, 6, ptr; integer array stack; 

comment Procedure adds values a and b to stack stack and in- 
creases stack pointer prr by 2; 

begin 
ptr := ptr + 2; stack[ptr — 1] := a; stack[ptr] := b 

end of add2; 

procedure. nextlink (point, val); 
value point, val; integer point, val; 

comment Procedure adds directed edge (point, val) to structural 
representation of a graph. Global variables are described as fol- 
lows. head|v+1:v+-2 Xe] and next[1:v-+-2 Xe] contain the struc- 

_ tural representation of the graph. freenext is the current last 
entry in next array; 


begin 
freenext := freenext + 1; next[freenext] := next( point]; 
next{point| := freenext; head|freenext] := val 


end of nextlink; 

integer procedure min(a, b); 
value a, b; infeger a, b; 

comment Procedure computes the minimum of two integers; 

if a < b then min := aelse min := b; 

procedure connect (v, e, cptr, edgelist, components), 
value v, e; integer v, e, cptr; 
integer array edgelist, components; 

comment Procedure finds the connected components of a graph. 
The parameters are described as follows. v and e are the number 
of vertices and edges of the graph. edgelist(1:2e] is the initial 
list of edges of the graph. components|1:3 Xe} is the list of edges 
for each component. The list of edges for each component is pre- 
ceded by an entry giving the number of edges of the compo- 
nent. cptr is a pointer to the last entry in components. The global 
variables are described as follows. head[v+1:v+2xXe] and 
next{1:v-+-2Xe] contain the structural representation of the 
graph. freenext is the last entry in the array next. The local 
variables are described as follows. number[l:v+1] is used for 
numbering the vertices during the depth first search. code con- 
tains the current highest vertex number. point is the current 
vertex being examined during the search. v2 is the next vertex 
to be examined during the search. o/dptr contains the position 
in components to place the value of the next component. The 
global procedures are add2 and nextlink. A. recursive depth- 
first search procedure is used to examine connected components 
of the graph; 

begin 
integer array number [1:v-+1]; 
integer code, point, v2, oldptr, i; 
procedure connector (point, oldpt); 

value point, oldpt; integer point, oldpt; 
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comment This recursive procedure finds a connected component 
using a depth-first search. The parameters are described as fol- 
lows. point is the startpoint of search. oldpt is the previous 
startpoint. Global variables are the same as for connect. The 
global procedures are add2; 
comment Examine each edge out of point; 
for i = i while nex1| point] > Odo 
begin 
comment v2 is head of edge. Delete edge from structural repre- 
sentation; 
v2 := head{next[poinr]); 
next(point| := next(next{point]]; 
comment Has this edge been searched in the other direction? 
If so, look for another edge; 
if (number[v2] < number[point|) /\ (v2~oldpt) then 
begin 
comment Add edge to components; 
add2( point, v2, components, cptr); 
comment Determine if a new point has been found; 
if number[v2] = 0 then 


. begin 
comment New point found. Number it; 
number\v2| := code := code + 1; 


comment Initiate a depth-first search from the new point; 
connector(v2, point) 
end 
end 
end; 
comment Construct the structural representation of the graph; 
freenext := v; 


for i := 1 step 1 until v do next] := 0; 
for i := 1 step 1 until e do 
begin 


comment Each edge occurs twice, once for each endpoint; 
nextlink (edgelist(2 Xi—1], edgelist|2 Xi)); 
nextlink (edgelist(2 Xi], edgelist(2xXi—1]) 
end; 
comment Initialize variables for search; 
cptr := 0; point := 1; 
for i := 1 step 1 until v + 1 do numberi| := 0; 
for ¢ := i while point < vdo 
begin 
comment Each execution of connector searches a connected 
component. After each search, find an unnumbered vertex 
and search again. Repeat until all vertices are investigated; 
number|point| := code := 1; 
oldptr := cptr:= cptr +1; 
connector(point,O); 
comment Compute number of edges of components; 
components\oldptr| := (cptr-oldptr) ~ 2; 
for i := i while number|[point| ~ 0 do point := point + | 
end 
end; 
procedure biconnect(y, e, bpir, edgelist, bicomponents) ; 
value v, e; integer v, e, bptr; 
integer array edgelist, bicomponents; 
begin 
comment Procedure finds biconnected components of a graph. 
The parameters are described as follows. v and e are the num- 
ber of vertices and edges of the graph. edgelist({1:2 Xe] is the 
initial list of edges of the graph. bicomponents|1:3 Xe] is the list 
of edges for each component found. Each component is pre- 
ceded by an entry giving the number of edges of the com- 
ponent. bprr is a pointer to the last entry of dicomponents. The 
global variables are described as follows. head|v-+1:v+2Xel 
and next|1:v-+2 Xe] contain the structural representation of the 
graph. freenext is the last entry in the array next. The local 
variables are described as follows. number{1:v+1] is an array 
used for numbering the vertices during the depth-first search. 
code is the current highest vertex number. edgestack|1:2 Xe| 
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is used for storage of edges examined during search. eprr is 
a pointer to last entry in edgestack. point is the current point 
being examined during search. v2 is the next point to be ex- 
amined during search. newlowpt is the lowpoint for the bi- 
connected part of graph above and including v2. o/dprr is 
pointer to position in bicomponents to place a value of next 
component. The global procedures are min, add2, and next- 
link. A recursive depth-first search procedure is used to divide 
the graph. The lowest point reachable from the current point 
without going through previously searched points is calculated. 
This information allows determination of the articulation 
points and division of the graph; 

integer array number|1:v+1], edgestack[1:2 Xe]; 

integer code, eptr, point, v2, newlowpt, oldptr, i; 

procedure biconnector (point, oldpt, lowpoint); 
integer point, oldpt, lowpoint; 

comment Recursive procedure to search a connected component 
and find its biconnected components using depth-first search. 
The parameters are described as follows. point is the startpoint 
of the search. o/dpt is the previous startpoint. /owpoint is the 
lowest point reachable on a path found during search. The 
global variables are the same as for biconnect. The global 
procedures are min and add2; 

comment Examine each edge out of point; 

for i := i while next|point] > 0 do 

begin 
comment v2 is the head of the edge. Delete edge from structural 

representation; 


integer v2; 
v2 := head|next{point]]; 
next(point] := next(next(point)]; 


comment If the edge has been searched in the other direction, 
then look for another edge; 
if (number(v2] <number[point|) /A\ (v2oldpt) then 
begin 
comment Add edge to edgestack; 
add2 (point, v2, edgestack, eptr); 
if number[v2| = O then 
begin 
comment New point found. Number it; 
number|v2| := code := code + 1; 
comment Initiate a depth-first search from the new point; 
newlowpt := v + 1; 
biconnector (v2, point, newlowpt); 
comment Note that although the global variable v2 is 
changed, its value is restored upon exit from this pro- 
cedure. Recalculate /owpoint; 
lowpoint := min(lowpoint, newlowpt); 
if newlowpt > number|point] then 
begin 
comment point is an articulation point. Output edges 
of component from edgestack; 
oldptr := bptr := bptr + 1; 
for i := i while number[edgestack{eptr—1]] > 
number| point] do 
begin 
add2 (edgestack|eptr— 1), 
ponents, bptr); 
eptr := eptr — 2 
end; 
comment Add last edge; 
add2( point, v2, bicomponents, bptr); 
eptr := eptr — 2; 
comment Compute number of edges of component; 
bicomponents{oldptr| := (bptr-oldptr) + 2 
end 
end 
else 
begin 


edgestack(eptr), bicom- 
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comment New point not found. Recalculate /owpoint; 
lowpoint := min(lowpoint, number|v2}) 


end 
end 
end; 
comment Construct: the structural representation of the graph; 
Sfreenext := Vv; 


for i := 1 step 1 until v do next [i] :== 0; 
for i := 1 step 1 until e do 
begin 
comment Each edge occurs twice, once for each endpoint; 
nextlink (edgelist{2 xi—1], edgelist(2xi)); 
nextlink(edgelist(2 Xi], edgelist(2xi—1]) 
end; 
comment Initialize variables for search; 
eptr := 0; bptr := O; point := 1; v2 := 0; 
for i := 1 step 1 until vy + 1 do number|i| := 0; 
for i := iwhile point < vdo 
begin 
comment Each execution of biconnector searches a connected 
component of the graph. After each search, find an unnum- 
bered vertex and search again. Repeat until all vertices are 
examined; 
aumber([point| := code := 1; newlowpt := v + 1; 
biconnector(point, v2, newlowpt); 
for i := i while number|point] = ~ 0 do point := point + 1 
end 
end; 
procedure pathfinder (start point, pathpt, codevalue, path); 
integer startpoint, pathpt, codevalue; 
integer array path; 
begin 
comment Procedure finds disjoint paths with arbitrary starting 
points in a biconnected graph. The points of each path are 
listed in the array path. The following variables are assumed 
global. next{l:v+2Xe],  head|v+1:v+2xXe| and - link 
[v+1:.+2xXe| define the graph using singly linked edge 
lists and a set of cross reference pointers. o/d|1:v] and mark 
[v+1:v+2 xe] indicate used points and edges. pathcode[1:v] 
is the consecutive numbering of the points. /owpoint(1:v], 
forward|1:v] and back|1:v] give information saved from depth- 
first search, node[1:v] gives the next unsearched edge from each 
point; 
integer point, pastedge, edge, pastpoint, v2, i; 
path{\] := startpoint; 
comment Choose initial edge; 
edge := next(start point]; 
for i := i while (if edge=O then false else markledge}) 
do edge := nextledge]|: 
begin 
comment No unused edge and thus no path exists: 
next(start point] := 0; pathpt := 0: 
go to done 
end; 
next(startpoint| := nextledge}|; path{2] := edge; 
point := head\edge); pathpt := 2; 
if old{point] then go to pathfound; 
if forward|point| 0 then 
begin 
comment Use previously found information to build a path. 
forward, back, lowpoint describe trees investigated using 
depth-first search; 
if pathcode|startpoint] > pathcode|point| then 
go to nextback; 
nextmark: 
if pathcode[start point] > lowpoint| point] then 
begin 
nextforward: 
edge := forward(point|; point :== head[edge]; 
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pathpt := pathpt +1; path|pathpt]: = edge; 
if old|point] then go to pathfound; 
if pathcode(startpoint| > pathcode[point] 
then go to nextback; 
go to nextforward 
end; 
edge :== back[point]; point := headledge}; 
pathpt := pathpt + 1; path[pathpt] := edge; 
if old{ point] then go to pathfound else 
go to nextmark; 
nextback: 
edge := back|point); point := head\edge]; 
pathpt := pathpt + 1; path|pathpt] := edge; 
if o/d[point] then go to pathfound else 
go to nextback 
end; 
comment Use depth-first search to find a path. Save information 
describing search tree; 
nextpoint: 
codevalue := codevalue + 1; pathcode[point] := codevalue; 
nextedge: 
edge := node|point]; 
for i := iwhile edge = Odo 
begin 
back{[point] := link[path|pathpt]}, 
pastpoint := head|back(point]]; 
if (forward|pastpoint] = 0) VV 
(lowpoint| point} < lowpoint{pastpoint]) then 
begin 
forward|pastpoint| := path|pathpt); 
lowpoint|pastpoint| := lowpoint|point] 
end; 
point := pastpoint; pathpt := pathpt — 1; edge := node[point]| 
end; 
node| point] := nextledge]; v2 := headledge}; 
if pathcode|v2| = O then 
begin 
point := v2; pathpt := pathpt + 1; 
path| pathpt| := edge; go to nextpoint 
end; 
if old{v2] A (v2#startpoint) then 
begin 
pathpt := pathpt + 1; path{pathpt] := edge; 
go to pathfound 


end; 
if (forward|point|=0) \V (pathcode[v2] < lowpoint|point|) then 
begin 
forward point] := edge; lowpoint{point| := pathcode\v2] 
end; 


go to nextedge; 
comment Path found. Convert stack of edges to list of points in 
path. Mark all edges and points in path; 


pathfound: 
for i := 2 step 1 until pathpt do 
begin 


edge :== path {i]; point := head|edge); 
forward|point] := back{point]: = 0; old{point] := true; 
mark {link[edge|| := mark [edge] := true; 
path {i\ := point 
end; 
done: 
end 
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Number of Multiply- 
Restricted Partitions [A1] 


Terry Beyer* and D.F. Swinehart{ (Recd. | Jan. 1971 
and 28 June 1971) 

* Computer Science Department and Computing Cen- 
ter, University of Oregon, Eugene, OR 97403. 

| Department of Chemistry, University of Oregon, 
Eugene, OR 97403. 


Key Words and Phrases: partitions, enumeration, change 
making, energy-level degeneracies, molecular vibrational 
energy-levels 

CR Categories: 3.13, 5.30 

Language: Fortran 


Description 
Given a positive integer m and an ordered k-tuple c = (a1, 
-, Cz) of not necessarily distinct positive integers, then any 
ordered k-tuple s = (s,, --- , s;) of nonnegative integers such that 
m= a 5;C; 1S said to be a partition of m restricted to c. Let P.(m) 
denote the number of distinct partitions of m restricted to c. The 
subroutine COUNT, when given a k-tuple c and an integer 1, com- 
putes an array of the values of P.(m) for m = 1 to n. Many com- 
binatorial enumeration problems may be expressed in terms of the 
numbers P.(m). We mention two below. 

Applications: Change making. Letting c = (1,5,10) and = 100, 
the subroutine computes the number of ways of making each 
amount of change from one cent to one dollar using pennies, 
nickels, and dimes. Letting c = (1,5,5,10) corresponds to using two 
distinct types of nickels. 

Applications: Chemistry. This algorithm is of some importance 
to problems in chemistry. In the theory of unimolecular reactions 
[2,6] a quantity appears, eek P(e,), in which P(e,) is the number 
of ways a given amount of vibrational energy, e,., may be distributed 
among the quantized vibrational modes of a polyatomic molecule, 
assuming all of these modes to be harmonic. Setting mm = e, and 
c = (¢1, --*, Ce), where c; is the energy corresponding to the 
fundamental frequency of the ith vibrational mode, then s; becomes 
the corresponding vibrational quantum number and we have 
P(e.) = P.(m). The desired quantity a P(e.) may thus be 
readily obtained from the output of the subroutine COUNT. 
No algorithm previously available has been sufficiently efficient 
for calculating these sums directly. Various functions have been 
proposed as approximations for this calculation [5]. The present 
algorithm allows calculation of Da P(e.) directly and efficiently. 

Method. Input to COUNT is a positive integer N and an integer 
array C containing K entries. Output is the array P containing NV 
Integers where P(M) is the number of partitions of M restricted to 
C for M = 1toN. The following assumptions are made concerning 
the input; (1) K is positive: (2) C contains positive integers only; 
and (3) N is greater than the maximum value in C. Restriction 3 
is not inherent in the problem but is a restriction required by 
COUNT. The algorithm operates by initializing P to contain the 
number of partitions of an integer restricted to an empty sequence. 
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Each pass through the outer loop which follows, updates P to 
reflect an additional element of C by using the recursive relations 


Preyr--se¢4)M) ifm <c¢;, 
Pr, ponds c,)(m) = Prey cokes e¢-1)(M) + i if m = C; ; 
Pr eys-esogq)M) Hb Preps c,(m —c) ifm>c;. 


These equations are derived by counting additional partitions of m 
obtained by using c; . Thus if m < c;, no additional partitions are 
obtained. If m = c; , the single additional partition consisting of c; is 
obtained. If m > c;, then any partition of m involving c; comes 
from a partition of (m — c:) which involves one less occurrence of 
c;. Readers may wish to refer to [3 and 4] which contain recurrence 
algorithms for more classical forms of the partition enumeration 
problem of which the problem presented here is a generalization. 

Scaling. The time required by the algorithm is roughly propor- 
tional to A X n. If the integers c; , «++ , c, have a common divisor d, 
the results may be obtained approximately d times as quickly by 
making use of the relations 


_ fo if dm 
= \ Pera(m/d) if d | m 
where c/d = (c:/d, --+ , cx/d). The computation of P.a(m/d) for 
m/d = 1 to n/d will require time proportional to k < (n/d) and 
an array of dimension n/d rather than n. COUNT does not auto- 
matically perform this scaling. 

Accuracy. The algorithm itself is precise. However in typical 
applications to chemistry the numbers P(M) generated may exceed 
the magnitude limitation for Fortran integers. In this case one may 
simulate multiple precision integer arithmetic and continue to 
obtain precise results, or one may switch to floating point. In the 
latter case, roundoff errors will be introduced into the calculation. 
The authors have not investigated the accumulation of roundoff 
errors under these conditions. 

Test cases. The subroutine COUNT has been tested on the 
following compiler/computer combinations. 


IBM FORTRAN IV(G) IBM S/360 (Mod. 50) 
University of Waterloo WATFOR IBM S/360 (Mod. 50) 


Results for several change counting problems were compared with 
results from hand calculations. Results. for the special case of un- 
restricted partitions were compared to published table values [1]. 
Acknowledgment. The authors wish to thank the University of 
Oregon Computing Center for supporting the testing of this work. 
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Algorithm 


SUBROUTINE COUNT(Cs Ks Ps N) 
INTEGER Cs P 
DIMENSION CCK)» PCN) 


C COUNT COMPUTES THE NUMBER @F PAKTITIGNS OF AN INTEGER 
C RESTRICTED TO C FOR INTEGERS IN THE RANGE 1 TO Ne 
C INPUT: K -= A POSITIVE INTEGER 
Cc C -- AN ARRAY OF K POSITIVE INTEGERS 
Cc N -- AN INTEGER LARGER THAN THE MAXIMUM VALUE IN C 
C OUTPUT: P -= AN AKRAY @F N INTEGERS, WHEKE P(M) IS THE 
om NUMBER @F PARTITIONS OF M RESTRICTED TO C 
C INITIALIZE P 
D@ 10 I=1.N 
PCT) = 0 
10 CONTINUE 


EACH PASS THRK@UGH THE @UTER L@@P BELQ@W TRANSFORMS P FROM 
PARTITIONS RESTRICTED TO CCl)» eee » CCI-1) TO 
PARTITI@NS RESTRICTED TO CCl)» see » CCI)~ 
D@ 30 I=14K 
J 2 CCI) 
JPL = 5 + t 
PCJ) = PCJ) + 1 
DO 20 M=JPtsN 
MMJ = M- J 
PCM) = PCM) # PCMMJ) 
20 CONTINUE 
30 CONTINUE 
RETURN 
END 


aaa 
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Algorithm 449 
Solution of Linear 


Programming Problems in 0-1 
Variables [H] 


Frantisek Fiala [Recd. 5 Feb. 1971] 
Department of Computing Science, University of 
Alberta, Edmonton, Alberta, Canada* 


Key Words and Phrases: linear programming, zero-one variable 
CR Categories: 5.41 
Language: Fortran 


Description 

This subroutine solves the linear zero-one programming prob- 
lem of the following form. 

Find the maximum and all maximizing points of the objective 
function 


ft = aixXy +++ + ainXn + Gio (1) 
subject to 
Anx, + +++ + Ainkn > bi,i = 2,...,m, (2.i) 


where x; = 0 or 1; a:; , 6; are integer coefficients. 
The algorithm follows the procedure described in [1, 2]. 
First of all we add a supplementary constraint 


aux, +e) + aindtn > D1, (2.1) 


where 5; is equal either to the value of f — ayo for a solution to the 
system of constraints, or to a lower bound of f — aio. AS soon as 
we find a feasible solution to the system of constraints, we replace 
b; by the corresponding value of f — aio . Consequently, if a feasible 
solution is-found, then the following procedure can lead only to 
solutions with the same or better value of the objective function. 
Using the formula x = 1 — x, we bring (2.1) into the form 


‘ 4 ~ 
Q1j,3, + ++ + aij, X;,, 2 5’, 


> ai;, > 0, (3) 


(2'.1) 
with aij, Bee 


where x is either x or x. If there are coefficients with the same 
absolute value in (2.1), then their order in (2’.1) corresponds with 
that in (2.1). The order of coefficients in (2’.1) indicates the order of 
branching points. Coefficients in (2’.1) are used in the accelerating 
test. 

At every stage of the procedure we have a partial solution and 
the corresponding (current) problem derived from the original one. 
In the partial solution, some variables are assigned fixed values (0 
or 1) and the others remain free. The partial solution corresponding 
to the original problem has all variables free. A partial solution is 
completed if all variables are fixed. 

Given a partial solution we try to complete it. If there is a com- 
pletion, we change the supplementary constraint and backtrack. If 
there is no completion, we backtrack. In both cases we go back to 


* This research was done while the author was a post-doctorate 
fellow with the Department. Present address: Department of Mathe- 
matics, Carleton University, Colonel By Drive, Ottawa, Ontario, 
Canada. 
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the last branching point and examine the new partial solution with 
the complementary value for the branching variable. We use the 
accelerating test if applicable. As a result we find either all maxi- 
mizing points and maximum of f or the problem has no solutions. 

Accelerating test. Suppose that at a certain step we have a 
partial solution with the fixed variables x;,, ACHC{1,..., n}, 
and we have to branch. We take the first variable <,, still occurring 
in (2’.1)—branching variable—and put first X;, = 1 and then X,;, = 
0. We examine the new partial solution with X;, = 1. If there is a 
feasible completion of the partial solution and if 
arjp > Dari,» (4) 

keK 
where K is the set of all indices ke{1, ..., } — HA such that 
X;, = 0 in the completion, then the branch with x;, = 0 can be 
dropped out. 

The subroutine MAXL01 is self-contained, and communication 
to it is through the argument list. The calling statement is of the 
following form 
CALL MAXLO1 (MO, NO, NEST, M,N, AO, BO, A, B, B1, S1, C, 
X, S, BC, T, IND, INC, NESTEX, V, NOPT, OPTS, NI, NAT) 


The meaning of the parameters is described in the comments at the 
beginning of the subroutine. Here the meaning of only two output 
parameters is explained. IVC = 0 or 1 means that the problem has 
feasible solutions or not, respectively. As we have to estimate the 
number NEST of all alternative optimal solutions in advance (as to 
define the array OPTS), NESTEX = 1 or 0 indicates whether the 
estimated number is exceeded or not, respectively. Consequently, 
after return from the subroutine we have to examine first the values 
of INC and NESTEX in order to give the proper answer. 

Test results. The subroutine has been tested on an IBM 360/67. 
No breakdown of the method has occurred. Further details about 
the computational experience are given in [1]. 

Two examples. 
(i) The objective function: 
f= 2x1 + 5x2 + 4x03 + x4 — 3x5 — X6 + 3. 


The constraints; 


2x1 — Xo + 3x3 + 5x5 — 2x6 > 3 
4x2 — 7X3 + 3x1 + Xs -—X— > —-9 
Xi + Bxe + 4x4 + 2x5 + 3x6 > 7 
5x1 — 2x2 + 4x3 + 3x4 — 5xg > —S 
X1 — Xo + x4 +x. >0 


Maximum: 15. Maximizing point: (1, 1, 1, 1, 0, 0). 
Iterations: 5. Accelerating test: 3. 
(ii) The objective function: 
f= 2x1 — X2 + 4x3 + 7x4 - 5X5 + 12x65 + 9x1 _ 4xs — X9 + 
2X10 + 5. 
The constraints: 
3x1 — x2 + 2x3 + 4x6 — 3x7 + 8x9 + x9 > 5 
4x2 + 7X3 + xX4 + 2X5 = 5x6 + 3X9 + 9x10 = 1 
Xx. — X2 + 3x4, + 7x5 + 8x— + 5x7 — x8 — Txo + 4x19 > 12 
2X1 + 4x3 — X4 + 4x8 + 5x9 + 3X10 = 2 
Maximum: 41. Maximizing point: (1, 0, 1, 1, 0, 1, 1, 0, 0, 1). 
Iterations: 9. Accelerating test: 7. 
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SUBROUTINE MAXLOICM@s N@» NEST» Ms Ns A@s BOs As Bs Bio 
* Sl» Cs Xs Ss SOs BCs Ts IND» INCs NESTEXs Vs» NOPT» OPTS» 


* NI. NAT) 
INTEGER AOCM@sN@)» ACMOsNO)>s BOCMG)s BCMO)s BICMO)s»s | 

% SI1CM@)» CONG)» XCNQ)s SCN)» SOCNGs BCCKNG)» TINO)» 

* IND(M@)s Vs VNEGs @PTSCNESTs NO) 
THIS SUBR@UTINE FINDS THE MAXIMUM AND ALL MAXIMIZING 
POINTS T@ THE LINEAR OBJECTIVE FUNCTION (1) SUBJECT T@ M-1 
LINEAR CONSTRAINTS (2eI) WITH NCGREATER THAN 1) ZERO-@GNE 
VARIABLES AND INTEGER COEFFICIENTS. 
THE MEANING OF THE INPUT PARAMETERS. 
M@s N@s NEST ARE THE ADJUSTABLE DIMENSIGNS SPECIFYING THE 
UPPER B@UNDS FOR THE NUMBER OF ALL CONSTRAINTS,» VARIABLES 
AND ALTERNATIVE OPTIMAL SOLUTIGNSs»s RESPECTIVELY. 
M IS THE NUMBER @F CONSTRAINTS INCLUDING THE SUPPLEMENTARY 
ONE. N IS THE NUMBER @F THE VARIABLES. A@ IS THE TwWG- 
DIMENSI@NAL ARRAY CONTAINING IN THE FIRST M ROWS AND N 
COLUMNS THE C@EFFICIENTS OF ALL CONSTRAINTS. THE FIRST ROW 
CONTAINS THE COEFFICIENTS OF THE SUPPLEMENTARY CONSTRAINT. 
THE @NE~DIMENSIONAL ARRAY B@ CONTAINS THE RIGHT-HAND SIDE 
TERMS OF THE CONSTRAINTS» BO¢1) IS THE ABSOLUTE TERM OF 
THE OBJECTIVE FUNCTI@Ne A@s» BO REMAIN UNCHANGED DURING 
THE WHOLE PROCEDURE. 
THE MEANING OF THE AUXILIARY PARAMETERS~ 
THE TWO-DIMENSI@NAL ARRAY A OR THE ONE-DIMENSI@NAL ARRAY B 
CONTAINS THE COEFFICIENTS @R THE RIGHT-HAND SIDE TERMS 
OF THE CURRENT SYSTEM @F CONSTRAINTS» RESPECTIVELY. 
VNEG IS THE SUM OF ALL NEGATIVE COEFFICIENTS IN THE 
OBJECTIVE FUNCTION MINUS 1. 
ITEST=1 @R O INDICATES IF THE WHOLE SYSTEM OF C@NSTRAINTS 
IS REDUNDANT @R NOT» RESPECTIVELY. SIMILARLY» THE I-TH 
CO@MPQNENT OF THE @NE-DIMENSIONAL ARRAY- IND INDIGATES 
WHETHER THE I-TH CONSTRAINT IS REDUNDANT OR NOT. 
THE ONE-DIMENSI@NAL ARRAY X CONTAINS THE CURRENT PARTIAL 
SOLUTI@Ne A FREE VARIABLE IS REPRESENTED BY A COMPONENT 
EQUAL TO 2- 
THE @NE-DIMENSI@NAL ARRAY S @R BC BOR T INDICATES THE @RDER 
AND MANNER IN WHICH THE FIXED VARIABLES WERE ASSIGNED 
THEIR VALUES @R THE BRANCHING P@INTS @R THE BRANCHING 
POINTS IN WHICH THE ACCELERATING TEST CAN BE APPLIED» 
RESPECTIVELY. NS IS THE NUMBER OF COMPONENTS IN S AND BC. 
THE ONE-DIMENSIONAL ARRAYS Bl»S12S@ AND C HAVE AN 
AUXILIARY CHARACTER. 
THE MEANING OF THE OUTPUT PARAMETERS. 
INC=O0 @R 1 MEANS THAT THE GIVEN PROBLEM IS CONSISTENT 
OR INCONSISTENT» RESPECTIVELY. 
NESTEX=0 @R 1 INDICATES THAT THE ESTIMATED NUMBER OF 
FEASIBLE SOLUTIONS WAS N@T OR WAS EXCEEDEDs RESPECTIVELY. 
V IS THE MAXIMAL VALUE OF THE OBJECTIVE FUNCTION. 
N@PT IS THE NUMBER @F ALL MAXIMIZING FOINTS. 
THE TWO-DIMENSI@NAL ARRAY OPTS CONTAINS IN THE FIRST NOPT 
R@WS ALL MAXIMIZING POINTS.» A COMPONENT MAY BE EQUAL T@ 2 
WHICH INDICATES THAT THE VALUE @F THE CORRESPONDING 
VARIABLE CAN BE ARBITRARY+ NI @R NAT INDICATE THE NUMBER 
OF ITERATIONS OR THE NUMBER @F SUCCESSFUL APPLICATIONS OF 
THE ACCELERATING TESTs RESPECTIVELY. 
THE CALLING PROGRAM SHOULD CONTAIN THE FOLLOWING TYPE- 
STATEMENT 
INTEGER A@CMGsNO)sACMOsNO)s BOCMO)sBCMO)sB1 CMG)» SI CMO)» 
XCCNG)s XCNO)2SCND)s SOONG)» BCC NO)s TCNG)s INDCMO) 2 Ve 


XOPTSCNESTs NOD 
INC = 0 
NESTEX = 0 
N@PT = 0 
NS = 0 
NI = 0 
NAT = 0 
DO 10 J=1lsN 

TOU) = O 


10 CONTINUE 
C@PY THE ARRAYS AGs BO- 
DO 30 Il=leM 
BCI) = BeCI) 
D@ 20 J=1sN 
ACIsJ) = AO(Is J) 
20 CONTINUE 
30 CONTINUE 
ADD THE SUPPLEMENTARY CONSTRAINTs DETERMINE THE INITIAL 
PARTIAL SOLUTION. 


VNEG = -! 
D® 40 J=1.N 
x€J2 = 2 


IF CAC1s J) eLT+O) VNEG = VNEG + AC1s J) 
40 CONTINUE 
BC1) & VNEG 
V = VNEG 
50 D@ 60 I=1sM 
IND(I) = 0 
60 CONTINUE 


C EXAMINE THE CURRENT SYSTEM QF CONSTRAINTS « 


70 D@ 80 I=1sM 
Bicr) = BCID 
80 CONTINUE 
NI a NI + 1 


ITEST = 1 
D@ 110 TsleM 
SicI> = 0 


IF CINDCI)-EQe1) GOB TO 110 
D® 90 J=1.N 
IF CACIs J) eL TO) BICID = BICID = ACIoJ) 
Sicl) = S1CI) + IABSCACIs JD) 
90 CONTINUE 
IF (BICI)-LE-O) G@ TA@ 100 
ITEST = 0 
GQ TO 110 
100 INDCI) = 1 
110 CONTINUE 
IF CITEST«EQ-1) GO TO 420 


C THE SYSTEM C@ONTAINS AT LEAST ONE IRREDUNDANT INEQUALITY. 


D@ 120 Is1sM 
IF CINDC1)+EQe1) GO TO 120 
IF €S1C1)-B1CI)-LT.0O) G@ TO 560 
120 CONTINUE 


THE 
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SYSTEM DOES NOT CONTAIN ANY INCONSISTENT INEQUALITY. 


CONSIDER EACH INEQUALITY SEPARATELY. 


130 


Isi1 
IF CINDCI>-EQ-1) GO TO 360 
IF €S1¢€1I)-B1C1)-GT.0)> GO TA 200 


SOME OF THE FREE VARIABLES ARE FORCED TO CERTAIN FIXED 
VALUES. 


140 


150 


160 


170 


180 
190 


200 
210 


220 
230 


240 


250 


260 


270 


ONE 


280 
290 


300 
310 


320 
330 
340 
350 


360 


D@ 190 J=1.N 
IF CACIsJ»-EQ-0) G@ T@ 190 
NS = NS + 1 
BCCNS) = 1 
IF CACIsJ+-LT-O) GO TA 160 
SONS) 2 J 
X(J) = 
D@ 150 IJz1isM 
BCIJ) = BCIJ) = ACIJs J) 
CONTINUE 
G@ TO 170 
SONS) = -J 
X(J> = 0 
Dg 180 IJetoM 
ACIJsJ) = 0 
CONTINUE 
CONTINUE 
G®@ Te 70 
D2 210 Jz1sN 
CCJ>) = IABSCACIs J)? 
CONTINUE 
Liz 
Jabki+i 
IF (CCL1)+GE-CCJ2) GO T@ 240 
IP = CCLI) 


IF CJeGT»N) GB TH 250 
6@ TS 230 
Li 2bi + 1 
IF CLI-eLT.N) GQ T@ 220 
IF €CC(L1)>-GT.0) GO TO 270 
Li = bi- 1 
G@ TS 260 
IF (S1ICI)-CCLI)eLT-BICI)) GB TO 140 
IF €S1CI)-CC1)-B1iCI)-GE-0) G@ TS 360 
FREE VARIABLE IS FORCED T@ A CERTAIN FIXED VALUE. 
NS = NS + 1 
BCCNS) = 1 
D@ 290 Jz=isNn 

IF CIABSCACIsJ))-EQsCC1)> GO TO 300 
CONTINUE 
IF CACIsJ)+LT-O) GO TO 330 
SONS) = J 
x€J) = 1 
D@ 320 IJ=1sM 

BCIJ) = BCIJ) = ACIJs J) 
CONTINUE 
G@ T® 340 
SONS) 3 -J 
XCJ> = 0 


DZ 350 IJ=1sM 
ACIJs J)» = 0 

CONTINUE 

Gg TS 70 

Ist+i 


IF CI«LEsM) G@ T@ 130 
IF (NS+EQ.N) GO T@ 480 


FIND A NEW BRANCHING POINT. 


370 


380 


390 


400 


410 
THE 


D@ 370 J=1sN 
CCJ>) = IABSCACIs JD) 
CONTINUE 
D@ 380 J=2sN 
IF (€CC1>-GE-CCJ>) GO TA 380 
Ccl)d = CCD 
CONTINUE 
IF (€CC1>-EQ-0) GO TB 390 
NS = NS + 1 
BC(NS) = 0 
I=. 
G@ T@ 280 
DO 410 JsisNn 
D@ 400 J1=1sNS 
IF CJeEQeIABSCSCJ122) GO TA 410 
CONTINUE 
NS = NS + 1 
BCCNS) = 0 
G@ T@ 310 
CONTINUE 
SYSTEM @F CONSTRAINTS IS REDUNDANT. S@LVE AN 


UNCONSTRAINED PROBLEM. 


420 


430 


440 


450 


460 
470 


D@ 470 J=isN 
IF (NS-EQ-N) G@ TQ 480 
IF (CCXCJeNE2) oORe CACLs J) -EQ-0)) GB TA 470 
NS = NS + 1 
BCCNS) = 1 
IF CAC1s J) -LT-O) GO TO 440 
SCNS) = J 
XCJ2 = 1 
D@ 430 I=1sM 
BCI) = BCID - ACIsJ) 
CONTINUE 
G@ TO 450 
SONS) = oJ 
xX€J) = 0 
D@ 460 I=ts™ 
ACIsJ) = 0 
CONTINUE 
CONTINUE 


FIND THE NEW VALUE @F THE @BJECTIVE FUNCTION. 
ADJUST THE ACCELERATING TEST SEQUENCE T+ 


480 


490 


NEWV = 0 
D@ 490 J=1sN 
NEWV = NEWV + XCJ)#AQGC12I) 
CONTINUE 
D@ 500 J=14NS 
K=NS+1- 4 
IF (BCCK)+EQeO) TCK) = 1 


0 


COLLECTED ALGORITHMS (cont.) 


500 CONTINUE 
IF CNEWV.GTeV) G@ TQ 510 
NOPT = NOPT + 1 
IF (N@PTe«LE«NEST) GO TO S40 
THE ESTIMATED FIRST DIMENSION OF THE ARRAY OPTS IS 
EXCEEDED» 
NESTEX = 1 
RETURN 
THE NEW SOLUTION FOUND GIVES A BETTEK VALUE T@ THE 
OBJECTIVE FUNCTION. CHANGE THE SUPPLEMENTARY CONSTRAINT. 
510 NOPT = 1 
Vo = NEWV 
Bel) = V 
D@ $20 J=1sN 
IF (XCJ)eNE.1) GO TO S20 
BCH) = BCL) = AWCI1s J) 
$20 CONTINUE 
D@ 530 J=1sN 
S@C5) = SCID 
530 CONTINUE 
MODIFY THE SET OPTS. 
$40 D@ 550 Jetson 
OPTSCNOPTs J) = XC) 
550 CONTINUE 
560 IF ¢(NS-EQ-0) GO TO 580 
QUESTION IF A BACKTRACKING IS P@SSIBLE. 
1S = 0 
D@ S70 J=1eNS 
IS = IS + BCCJ) 
570 CONTINUE 
IF CIS-LT-NS) GO TO 600 
IF CVeGTeVNEG) GO TO 590 
THE SYSTEM OF CONSTRAINTS IS INCONSISTENT+ N@ SOLUTIGNS- 
580 INC = 1 
RETURN 
THE GIVEN PROBLEM HAS A SOLUTION. ALL S@LUTIG@NS HAVE BEEN 
FOUND. 
590 V = V + BOC1) 
RETURN 
THE BACKTRACKING IS POSSIBLE. 
600 D@ 610 JIZIsNS 
K = NS + 1 = Jl 
IF CBCCK).EQG@-0) GO TO 620 
610 CONTINUE 
620 IF (TCK)-«EQ+1) G@ TO 750 
BACKTRACK. 
630 D@ 740 Ji2KsNS 
D@ 640 J=IsN 
IF (CJ-eEG@-IABSCS(J1)))> G@ TO 650 
640 CONTINUE 
650 IF (KeEQ+J1) G2 T@ 700 
IF (XCJ)-EQei? GO TO 670 
D@ 660 I=15M 
ACI,J) = ABCIDJ) 
660 CONTINUE 
GQ TO 690 
670 D@ 680 I=1.™M 
ACI,J) = AGCIs JS) 
BBCi) = BCI) * ACIs Jd) 
680 CONTINUE 
690 ACJ) = 2 


G@ Ta 740 
700 SK) = =S¢K) 
BCCK) = 1 


XCJ) = 1 - XC) 
IF ¢X€J)-E@-0) GO TB 720 
D@ 710 I=tsM 
BCI) = BCI) - AGCIsJ) 
710 CONTINUE 
G@ T@ 740 
720 D@ 730 I=tsM 
BCI) = BCI) + AGCIsJ) 
730 CONTINUE 
740 CONTINUE 
NS = K 
GQ Te 50 
THE ACCELERATING TEST. 
750 TCK) = O 
ITi = 0 
1T2 = 0 
D@ 790 JIsKsN 
D@ 760 JatsNn 
IF (JeEQ-IABS(SO(0J1))2) G2 TO 770 
760 CONTINUE 
170 IF (KeEQ.J1)> GO TO 7RN 
Te CCOUXC SY «EQ6O) eANDe CAGC Io J) eGT-0)) .ORe 
* CCXCI)©EG@21) «ANDe CAQCISJ)-LT.O0)) IT2 = ITA + 
* IABSCAQC14J)) 
G@ TS 790 
7180 IT1 = IABSCAGC1,J)) 
790 CONTINUE 
IF CITL-LE-IT2) GO TO 630 
THE APPLICATION @F THE ACCELERATING TEST WAS SUCCESSFUL. 
BCCK) = 1 
NAT = NAT + 1 
GO TB 560 
END 
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Editor’s note: Algorithm 449 described here is available on magnetic 
tape from the Department of Computer Science, University of 
Colorado, Boulder, CO 80302. The cost for the tape is $16.00 (U.S. 
and Canada) or $18.00 (elsewhere). If the user sends a small tape 
(wt. less than 1 1b.) the algorithm will be copied on it and returned to 
him at a charge of $10.00 (U.S. only). All orders are to be prepaid 
with checks payable to ACM Algorithms. The algorithm is recorded 
as one file of BCD 80 character card images at 556 B.P.I., even 
parity, on seven track tape. We will supply algorithm at a density of 
800 B.P.I. if requested. Cards for algorithms are sequenced starting 
at 10 and incremented by 10. The sequence number is right justified in 
column 80. Although we will make every attempt to insure that the 
algorithm conforms to the description printed here, we cannot guaran- 
tee it, nor can we guarantee that the algorithm is correct.—L.D.F. 
and A.K.C. 
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Algorithm 450 


Rosenbrock Function 


Minimization [E4] 


Marek Machura* and Andrzej Mulawaf 
[Recd. 22 March 1971] 


* Institute of Automation and Measurements, Warsaw, Poland. 
t Institute of Computing Machinery, Warsaw, Poland. 


Key words and phrases: function minimization, Rosenbrock’s 
method 

CR Categories : 5.19 

Language : Fortran 


Description 

Purpose. This subroutine finds the local minimum of a function 
of n variables for an unconstrained problem. It uses the method for 
direct search minimization as described by Rosenbrock {1]. 

Method. The local minimum of a function is sought by con- 
ducting cyclic searches parallel to each of the 1 orthogonal unit 
vectors, the coordinate directions, in turn. # such searches con- 
stitute one stage of the iteration process. For the next stage a new 
set of # orthogonal unit vectors is generated, such that the first 
vector of this set lies along the direction of greatest advance for the 
previous stage. The Gram-Schmidt orthogonalization procedure is 
used to calculate the new unit vectors. 

Program. The communication to the subroutine ROMIN is 
solely through the argument list. The user must supply two ad- 
ditional subroutines FUNCT and MONITOR. The entrance to the 
subroutine is achieved by 


CALL ROMIN (N, X, FUNCT, STEP, MONITOR) 


The meaning of the parameters is as follows. N is the number 
of independent variables of the function to be minimized. X(N) is 
an estimate of the solution. On entry it is an initial estimate to be 
provided by the user; on exit it is the best estimate of the solution 
found. FUNCT (N, X, F) is a subroutine calculating the value F of 
the minimized function at any point X. STEP is an initial step 
length for all searches of the first stage. The subroutine MONITOR 
(N, X, F, R, B, CON, NR) supplies printouts of any parameter from 
the argument list and contains convergence criteria chosen by the 
user. (Different kinds of convergence criteria and their use arc 
discussed in [1} and [4].) R is the actual number of function evalua- 
tions. B is the value of the Euclidean norm of the vector repre- 
senting the total progress made since the axes were last rotated, i.e. 
the total progress in one stage. CON is a logical variable. At the 
start of the subroutine ROMIN CON is set .FALSE.. If the con- 
vergence criteria are satisfied CON must be set . TRUE. in the sub- 
routine MONITOR, which transfers control back to the main pro- 
gram. NR is the MONITOR index used as described in [3]. The 
CALL statement of the subroutine MONITOR with NR equal to 1 
occurs once per function evaluation and with NR equal to 2 once 
per stage of the iteration process. 

Test results. As a test example, the parabolic valley function 


Si, %2) = 100 (vz — v2)? + UL -— )? 
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was chosen. This function attains its minimum equal to 0 at the 
point (1, 1). Starting from the point (— 1.2, 1.0) the best estimate of 
the solution after 200 function evaluations as found by the sub- 
routine ROMIN was 0.29774-10~4 at the point (0.99513, 0.99053). 
The initial step length STEP was set equal to 0.1 [2]. 
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Algorithm 


SUBROUTINE ROMINCNs Xs FUNCTs STEPs MONITR) 
INTEGER Ns IP 
REAL STEP 
DIMENSION XCN) 
L@GICAL CON 
INTEGER Is Js Ks Ls Ps R 
REAL FO, Fl» Bs BETY 
DIMENSIGN AC30)s DC30)25 VC30s30)2. ALPHAC30230)s BETAC30)5 
* EC30)5 AVC30) 
THIS SUBROUTINE MINIMIZES A FUNCTION OF N VARIABLES 
USING THE METHOD @F R@SENBROCK. THE PARAMETERS ARE 
DESCRIBED AS FOLL@WS?3 
N IS THE NUMBER OF INDEPENDENT VARIABLES 
X(N) IS AN ESTIMATE OF THE S@LUTI@N © ON ENTRY = 
AN INITIAL ESTIMATE, @N EXIT - THE BEST ESTIMATE 
OF THE S@LUTION FQUND > 
FUNCTCNsXsF) IS A ROUTINE PROVIDED BY THE USER TO 
CALCULATE THE VALUE F OF THE MINIMIZED FUNCTION 
AT ANY POINT X 
STEP IS AN INITIAL STEP LENGTH FOR ALL COORDINATE 
DIRECTIONS AT THE START OF THE PROCESS 
MONITR (NsXsFsReoBsCONs NR? IS A ROUTINE PROVIDED BY 
THE USER F@R DIAGNOSTIC AND CONVERGENCE PURPOSES 
R IS THE ACTUAL NUMBER @F FUNCTI@N EVALUATIONS (€ FOR 
THE INITIAL ESTIMATE R=0 ) 
B IS THE VALUE @F THE EUCLIDEAN N@RM OF THE VECTOR 
REPRESENTING THE T@TAL PROGRESS MADE SINCE THE 
AXES WERE LAST ROTATED 
CON IS A L@GICAL VARIABLE. AT THE START OF THE 
SUBROUTINE R@MIN CON=-FALSE- IF THE CONVERGENCE 
CRITERIA @F THE ROUTINE MONITOR ARE SATISFIED 
CON MUST BE SET »TRUE- T@ STOP THE PROCESS 
NR IS THE MONITOR INDEX 
INITIALIZE CON» ECI) AND R 
ECI> IS A SET OF STEPS TO BE TAKEN IN THE CORRESPONDING 
COBRDINATE DIRECTIONS 
CON = «FALSE. 
D@ 10 I=14N 
ECCI) = STEP 
10 CONTINUE 
R = 0 
V¢CIsJ) 1S AN NXN MATRIX DEFINING A SET @F N MUTUALLY 
ORTHOGONAL GOORDINATE DIRECTIONS- VCI,J) IS THE UNIT 
MATRIX AT THE START OF THE PROCESS 
D@ 30 I[31sN 
D@ 20 J=1eN 
VCI»sJ) = 0-0 
IF CL-EQeJ) VCIsI) = 140 
20 CONTINUE 
30 CONTINUE 
CALL FUNCTCN, Xs FO) 
C START OF THE ITERATION LOOP 
40 D® 50 I=1.N 
ACI) = 2.0 
DCI> = 0-0 
50 CONTINUE 
C EVALUATE F AT THE NEW P@GINT X 
60 DO 130 I=1.N 
D@ 70 J=lsN 
XCJ) = XC) + FCI *VCI, JD 
70 CONTINUE 
R=R+i1 
CALL FUNCTCNs Xs FI) 
CALL M@NITRCNs Xs Fle Re O» CON» 1) 
IF (CON) G@ T@ 290 
IF (F1-FO) 80s 905 90 


MEaAQAANAQAAARAANAANANANAANAAANAAAAaANA 


eQaa 


COLLECTED ALGORITHMS (cont.) 


C THE NEW VALUE OF THE FUNCTION IS LESS THAN THE @LD @NE 


80 DCI) = DCI) + ECL) 
ECL) = 3.0*FC(ID 
FO = Fl 
IF CACI)+GT+1-5) ACID = 120 
G@ TO 110 


@ THE NEW VALUE OF THE FUNCTI@N IS GREATER THAN Ok EQUAL 


C TO THE BLD ONE 
90 D@ 100 J=1sN 
XCS) 3 XC) - FOL eVCIS UD 
100 CONTINUE 
ECI) = -0.5*FCI) 
IF CACTI) +L Tet 05) ACI) = 060 
110 DO 120 J=1sN 
IF CACJ)¢GE-0-5) GO TS 130 
120 CONTINUE 
'  G@ T@ 140 
130 CONTINUE 
GQ T@ 60 
GRAM-SCHMIDT ORTHOGONALIZATI@ON PROCESS 
140 D@ 160 K=1,.N 
D® 150 L=tsN 
ALPHACKsL) = 0-90 
150 CONTINUE 
160 CONTINUE 
D@ 190 I=1s5N 
D@ 180 J=lsN 
D@ 170 L=IlsN 


a 


ALPHACIsJ) = ALPHACIsJ) + DCLI*VCL, J) 


170 CONTINUE 
180 CONTINUE 
190 CONTINUE 

B = 0.0 

D@ 200 J=lsN 

B = B + ALPHACIsJ)**2 

200 CONTINUE 

B = SQRTCB) 


an 


DIRECTIONS ¢€ THE NEW MATRIX VCIsJ) >) 
D@ 210 J=lsNn 
VClsJ) = ALPHAC],J)/B 
210 CONTINUE 
D@ 280 P=2,N 
BETY = 0-0 
IP = P- 1 
D@ 220 M=1sN 
BETACM) = 0-0 
220 CONTINUE 
D@ 250 J=tsN 
D@ 240 K=1i,IP 
AVCK) = 0-0 
D@ 230 L=1t.N 
AVCK) = AVCK) + ALPHACPsL)*VCKsL) 


230 CONTINUE 
BETACJ) = BETACJ) - AVCK)*VCKs J) 
240 CONTINUE 


250 CONTINUE 
D@ 260 J=isN 
- BETACJ) = BETACJ) + ALPHACPs J) 
BETY = BETY + RETACJ) **2 
260 CONTINUE 
BETY = SQRTCBETY) 
0@ 270 J=tsNn 
_-VCPsJ) = BETACJ)/BETY 
270 CONTINUE 
280 CONTINUE 
C END @F. GRAM-SCHMIDT PROCESS 
CALL MONITRCN» Xs FOs Re Be CON» 2) 
IF (C@N? GO T@ 290 
C GO T® THE NEXT ITERATION 
G2 T@ 40 
290 RETURN 
END 


CALCULATE THE NEW SET @F @RTHON@RMAL COORDINATE 
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Remark on Algorithm 450 [E4] 

Rosenbrock Function Minimization [Marek Machura 
and Andrzej Mulawa, Comm. ACM 16 (Aug. 1973), 
482-483] 


Adhemar Bultheel [Recd. 10 Oct. 1973] 

Katholieke Universiteit Leuven, Faculty of Applied 
Sciences, Applied Math Division, Celestijnenlaan 200 B, 
B-3030 Heverlee, Belgium 


1. Some misprints were found in the listing of the algorithm. 
(a) An E has to replace the F printed in the following statements: 

The one preceding the statement labeled 70. 

The one following the statement labeled 80. 

The one preceding the statement labeled 100. 

The one following the statement labeled 100. 

(b) The digit 1 should replace the character J as the first index of 
ALPHA in the statement preceding the statement labeled 200. 

(c) RETA should be read BETA in the statement preceding the 
statement labeled 260. 

2. Some compilers detect an error in the calling sequence of 
MONITR in the third line following the statement labeled 70 be- 
cause the fifth argument of MONITR is an INTEGER-type con- 
Stant, and in the subroutine MONITR the fifth argument stands 
for the norm B of a vector which is obviously a REAL-type variable 
as is also assumed in the other calls of MONITR. One way to over- 
come this difficulty is to replace 0 by any REAL constant, say 0. 

3. Since it is often useful to have the initial guess and the cor- 

responding function value printed, an additional call to MONITR 
could be inserted just preceding the COMMENT 
C START OF THE ITERATION LOOP 
This statement could be 
CALL MONITR (N, X, FO, R, 1.E 10, CON, 0). 
The last argument is the monitoring index NR. The user of Romin 
should program MONITR to handle the initial guess when NR=0 
(printing or not, checking for convergence or not, ...). The fifth 
argument is chosen to be a large constant because it stands for the 
norm B of a vector. The routine MONITR will contain a test to 
see if B < « with e ‘small’ and chosen by the user. If one wants to 
check the initial guess for convergence, then the routine would stop 
when B equals 0. . 

4. With these corrections and changes the algorithm was suc- 
cessfully used under a WATFIV compiler on the IBM 370-155 
computer of the Computing Centre of the University of Leuven. For 
the example of the parabolic valley function given by the authors 
of the algorithm and with the same starting point the following 
results were obtained: in a single-precision version 202 function 
evaluations were needed to reach F = 0.299986.10-4, and in a 
double-precision version 194 function evaluations to reach 
F = 0.297742.10-4 and 290 function evaluations gave F = 
0.489134.10713. 
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Remark on Algorithm 450 [E4] 

Rosenbrock Function Minimization [Marek Machura 
and Andrzej Mulawa, Comm. ACM 16 (Aug. 1973), 
482-483] 


Jiri Klemes and Jaroslav Klemsa (Recd. 14 Nov. 1973) 
Applied Mathematics Department, Research Institute of 
Chemical Equipment, CHEPOS, Brno, Czechoslovakia 


After correcting misprints [1] this algorithm runs successfully 
using an ODRA 1204 computer made by ELWRO, Poland. The 
results were the same as reported by authors. Some successful tests 
have been also made in optimization problems concerning the Wil- 
liams-Otto chemical plant [2]. It can be seen from the solution of 
some application problems [3] that it is very useful to select different 
Step lengths in different coordinate directions. 

Therefore, we recommend replacement of the third and fourth 
line in the subroutine ROMIN: 


REAL STEP 
DIMENSION X(N) 


by 

DIMENSION X(N), STEP(N) 

and the line before label 10 

E(I) = STEP 

by 

E(I) = STEP(I) 

In addition we recommend that the lines between labels 220 and 260 
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be replaced by the lines: 


220 CONTINUE 
DO 240 K =1,1P 
AV(K) =0.0 
DO 230 L=1,N 
AV(K) = AV(K) + ALPHA(P,L) * V(K,L) 
230 CONTINUE 
240 CONTINUE 
DO 260 J=1,N 
DO 250 K =1,IP 
BETA(J) = BETA(J) — AV(K)* V(K,J) 
250 CONTINUE 
BETA(J) = BETA(J) + ALPHA(P,J) 
BETY = BETY + BETA(J)**2 
260 CONTINUE 


Although, this change does not reduce the number of function 
evaluations, in each Gram-Schmidt step some computer time may 
be saved. This is caused by the difference between the number of 
executions of the statement AV(K) = AV(K) + ALPHA(P,L)* 
V(K, L) in the original program and the suggested modification 


which may be estimated as N(V — 1) ype (P — 1), whereas for 


the statement AV(K) = 0.0, this difference is (NV — 1) >-3_,(P — 1). 
(Note that N is the number of independent variables.) 
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REMARK ON ALGORITHM 450 


Rosenbrock Function Minimization [E4] 


[M. Machur and A. Mulawa, Comm. ACM. 16, 8 (Aug. 1973), 482-483 ] 


Alan M. Davies [Recd 20 June 1975 and 3 Dec. 1975 ] 
Institute of Oceanographic Sciences, Bidston Observatory, Birkenhead, Cheshire, 


L43 7RA, England. 


The algorithm, incorporating the corrections given in [1], was compiled using the 
Fortran H compiler OPT =2, and run on an IBM 370/165 computer in single pre- 
cision. The test problem given by the authors gave a function value of 0.29923 -10~4 
at (0.99512, 0.99051), after 200 funetion evaluations, and a minimum of 0.38379: 
10-8 was obtained at (0.99994, 0.99988), after 240 evaluations. 

In problems with a large number of variables, the Schmidt orthogonalization 
process can be affected by numerical errors, producing a set of vectors which are 
only approximately orthogonal, and this can increase the number of function 


evaluations required to reach a minimum. 


The orthogonalization of the basis can be enhanced by using the improved Gram- 
Schmidt procedure, together with a few re-orthogonalizations. These changes are 
readily incorporated into ROMIN by replacing the coding following statement 


210 through statement 280 with: 
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DO 300 JCYC=1,NCYC 


DO 250 P=2,N 
IP=P-1 

DO 230 M=P,N 
BETY =0.0 


DO 220 K=1,N 

220 BETY =BETY —ALPHA(M,K)«V(IP,K) 
DO 230 J=1,N 

230 ALPHA(M,J) =ALPHA(M,J)+BETY«V(IP,J) 
BETY =0.0 
DO 240 K=1,N 

240 BETY =BETY+ALPHA(P,K) ##2 
BETY =SQRT(BETY) 
DO 250 K=1,N 

250 V(P,K)=ALPHA(P,K)/BETY 
IF(JCYC.EQ.NCYC) GO TO 300 
DO 302 1=2,N 
DO 302 J=1,N 

302 ALPHA(I,J) =V(I,J) 

300 CONTINUE 


Since the arrays AV and BETA are no longer required (a slight saving of core), 
the DIMENSION statement becomes 


DIMENSION A(80), D(30), V(30,30), ALPHA(30,30), E(30) 


with the variable NCYC, which determines the number of re-orthogonalizations, 
being incorporated into the argument list of ROMIN. 


In problems with four or less variables, this coding did not improve the result. 
However, in an extension of Rosenbrock’s problem [2 ], 


avi 


f(a, t2,.-.,22) = D> 100(ta1 — 22)? + (1 — 2)? 
i=1,2 

(where 7=1,2 indicates that 7 increases in increments of 2), for N =6 starting at 
(0.5, 1.5, 0.6, 1.4, 1.7, 0.3) using this improved Gram-Schmidt procedure, with two 
re-orthogonalizations, a function value of 0.11241-10-? at (1.00150, 1.00308, 
1.00933, 1.01877, 0.99559, 0.99136) was obtained after 1000 function evaluations 
compared with 0.11296: 10-2 produced by the original program plus corrections [1 ]. 

For N = 12, after 1300 evaluations the re-orthogonalized (VC YC =3) calculation 
gave f =0.10871-10-? compared with 0.17160-10-? (NCYC =1, no re-orthogonaliza- 
tion), and after 2600 evaluations the results were 0.57029-10-* (VCYC=1) and 
0.75086-10-5 (VCYC =3). The original program gave 0.15628-10-* compared with 
the above value of 0.57029-10-4 produced by using the improved Gram-Schmidt 
procedure alone. However, using the original program but incorporating just the 
code for re-orthogonalizing a value of 0.73922-10-° (NCYC=83) was obtained, 
illustrating the improvement to be gained by just re-orthogonalization. 

The method was also tested on an extension of Box’s problem [3 ] using 18 vari- 
ables. The original program calculated a minimum of 0.40310-10-%; however, by 
incorporating the changes given above a minimum of 0.48176-10~4 was obtained. 
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Algorithm 451 
Chi-Square Quantiles | G1] 


Richard B. Goldstein [Recd. 30 June 1971 and 20 
March 1972] 


Department of Mathematics, Providence College, 
Providence, R.I. 


ae eae 


Key Words and Phrases: Chi-square statistic, asymptotic 
approximation, normal deviate, chi-square deviate, degrees of 
freedom 

CR Categories: 5.12, 5.5 

Language: Fortran 


a 


Description 

The algorithm evaluates the quantile at the probability level 
P for the Chi-square distribution with N degrees of freedom.’ The 
quantile function is an inverse of the function 


Vin) a 


0 


P(X |N) = (2%? P(N/2))7 i ZiN-Ne-22 dZ_ (x >0,N > 1). 


X(P) 


The function GAUSSD(P) is assumed to return the normal 
deviate for the level P, e.g. —1.95996 for P = .025. The proce- 
dure found in Hastings [5] may be used, or for increased accuracy, 
the procedure found in Cunningham [3] may be used. 


DEGREES OF FREEDOM 


The Wilson-Hilferty cubic formula [7] which is 
x2 ~~ N{l — 2/9N + X (2/9N)3}3 


where X is the normal deviate can be extended to the 19-term 
asymptotic approximation: 


x? ~ NUL — 2/9N + (4X44 16X2— 28) /1215N? 
+ (8X8+720X4+ 3216X2+ 2904) /229635Ni + +>: 
+ (2/N)4{X/3 + (—X843X)/162N 
— (3X°+40X2 + 45X)/5832N? 
+ (301).X7—1519X5— 32769.X* — 79349.X)/7873200N% + +++ J 8 


where X is the normal deviate by taking the cube root of the poly- 
nomial expansion in Campbell [2]. For N = 1 


x? = {GAUSSD (4 P)}? 
and for N = 2 
x? = —21n (P). 


For2 <N <2+ 4|X |, x? was fit with polynomials of the same 
form as the asymptotic approximation: 


x? = N{(1.0000886 — .2237368/N — .01513904/N?) 
“Fr N-1X(.4713941 +-.02607083/N — .008986007 /N?2) 
+ N-1X?(.0001348028 + .01128186/N + .02277679/N?) 
+ N~2X3(— 008553069 — .01153761/N — .01323293/N?) 
+ N~?X4(.00312558 + .005169654/N — .006950356/N?*) 
+ N7~*2X5(— 00084268 12+ .00253001 /N +- .001060438/N?) 
+ N~*X8(.00009780499 — .001450117/N + .001565326/N?) }é 


from Abramowitz and Stegun [1] for P = .0001, .0005,..., .995 
and Hald and Sinkbaek |4| for P = .999, .9995. The deviates 
for N = 3, 4,..., 9 were made accurate within 10° by using 
Algorithm 299 of Hill and Pike [6]. DEGREES OF FREEDOM 


$Ps,9995 = 
SS 


MAXIMUM RELATIVE ERROR ON ,OOO1 


COLLECTED ALGORITHMS (cont.) 


For N = 1 and N = 2 the.x? deviate is as accurate as the 
GAUSSD and ALOG procedure of the system. For .0001 < P< 
.9995 and N > 3 the absolute error in x? is less than .005 and the 
relative error is less than .0003. This is some 100 to 1000 times as 
accurate as the Wilson-Hilferty formula even for large N. Error 
curves for three approximations are shown in Figures 1 and 2. 

The program was tested on an IBM/360 at Rhode Island Col- 
lege and resulted in the output of Table I. 


Table I. 
Table of Computed Values 
Deg. 
Fr. 0.9995 0.9950 0.5000 0.0010 0.0001 
1 0.000000 0.000039 0.454933 10.827576 15.135827 
2 0.001000 0.010025 1.386293 13.815512  18.420670 
3 0.015312 0.071641 2.365390 16.268982 21.106873 
4 (0.063955 0.206904 3.356400 18.467987 23.510040 
5 0.158168 0.411690 4.351295 20.515503 25.744583 
10 «1.264941 =. 2.155869 9.341794 = 29.589081 35.565170 
1S 3.107881 4.601008 14.338853  37.697662  44.267853 
20 5.398208 7.433892 19.337418  45.314896  52.387360 
50 23.460876 27.990784 49.334930 86.660767 95.969482 
100 59.895508 67.327621 99.334122 149.449051 161.319733 
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Algorithm 


FUNCTI@N CHISQD(P»s N) 

DIMENSION CC21), AC19) 

DATA CCO1)/1-565326E~3/2 Cl2)/1-060438E-3/,5 
CO3)/-669S5S0356E-3/9 CC4)/-1.323293E-2/5 
COS) /2¢2TTET9E-2/e CC6)/-8 «9B 600TE=-3/5 
COT 421-51 390 4E-2/5 C(8)/2-530010E-3/5 
CO9/-1¢ ASOLITE-3/%5 CC10)/52169654E-3/s5 
COLDS“ beLS3IT6LE-2/,5 CC129/12128186E-2/7,5 
CO139/26607083E-2/5 C6(L4)/-0 2223736873 
COLSI/9«TEOAIDE-S/eo C616)/-8- 426812E- 4/5 
COITI/S3¢125580E-3/5 CC1B)/-8-S553069E-3/7,5 
COl9IIS1 -348028E-~4/5 C6(20)/0647139 41/75 CC21)/1 00008867 

DATA ACI)/1 626461 6E“2/9 AC2)/-1-425296E-2/5 


RHR HHH ER 


* AC3)/1+400483E-2/5 AC4)/-Se886090E-3/s5 
* ACSI/-~LeO9LAL4E-2/5 AC6)/-24304527E-2/,5 
* ACTI/3Z¢1ISALLE-3/s ACB)/-2 0728 48 4E- 4/5 
* ACID/-9 eBIFEBLE-3/2 AC1O)/1 6316872E-2/5 
* ACI1)D/2 «61891 4E-2/75 AC12)/-0.2222222/, 
* ACI3)/5+ 40667 4E-S/e AC1L4)/3-483789E-S/e 
* ACIS) /-Te2TAT6IE=4/3 AC169/3-292161E-3/5 
* ACIT)/-B+T2ITIBE“3/9 ACIBI/064T1L 4045/5 ACIDI/1L oS 
IF €N-2) 10s 20s 30 
10 CHIS@D = GAUSSD(C.5S*P) 
CHISG@D = CHISQD*CHISQD 
RETURN 
20 CHISQD = -2-*ALOGC(P) 
RETURN 
30 F=aN 
Fl = Le/F 
T = GAUSSD(1--P) 
F2 = SQRTCFL)*T 


IF (NeGE.C2+INTC 4¢*ABS(T)))) GOB TO 40 
CHISOD=( CC CC OCOCE I RF240€2) ) ¥F24CC3) )*F24+CC 4) )¥F2 
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* 4005) >*F24CC 6) FF24C CT) EF IFC CC OC OCCCBD4+CC9) FO) FS 
* 40010) )*F24C0011))eF24+CC12))*F24+CC13)9*F24+CC14)2)#F1 + 
® COCCCOCLES) &F24+CC 16) #F24+C017) #F24+CC 15) *F2 
* +0019) )#F2+C0020) )*F2+C(21) 
G@ Te 50 
40 CHISQD=(CCACL) +AC2)¥F2)*F 140 CCAC3)+AC 4) ¥F2) FO 
* +ACS)RF2+AC6)) EFL 4+CCCCCACT I +FAC8) RFQ) *F2+AC9) )*F2 
* 4AC10))*F24AC11)*F24AC12))>*FL + CCC CCACIS) *F2 
* 4AC14))*F24AC1S))*F24AC16) *F2t4AC17) *F2KFO 
* +AC18))*F2+AC019) 
50 CHISQ@D = CHISQD*CHISQD*CHISQD*F 
RETURN 
END 


Certification of Algorithm 451 [G1] 

Chi-Square Quantiles [Richard B. Goldstein, Comm. 
ACM (Aug. 1973), 483-484] 

William Knight [Recd 26 Nov. 1973] 

Department of Computer Science 

University of New Brunswick* 


The algorithm was tested for degrees of freedom, N = 3 (1) 
5 (5) 25 (25) 100, and tail probabilities, P, of 


.00010 .0010 .010 .10 .80 .980 .9980 
-00015 -0015 -O15 15 .85 .985 .9985 
-00020 .0020 .020 .20 .90 -990 .9990 
-00030 .0030 .030 .30 .93 .993 .9993 
-00050 -0050 .050 50 95 995 .9995 
-00070 -0070 .070 .70 97 .997 


The descriptive text of the algorithm claimed absolute error 
no more than 0.005 and relative error no more than 0.0003 for 
0.0001 < P < 0.9995; the values of P listed above were chosen to 
cover this domain. 

The largest absolute error found on the above grid was 0.0059 
at N = 3, P = 0.0003; a finer scale search nearby uncovered an 
erro: of 0.0062 at N = 3, P = 0.00031. The largest relative error 
found on the grid was 0.0035 at N = 3, P = 0.9985; this being 
an order of magnitude more than the figure claimed, I conjecture 
a typographical error, especially as the table of computed values 
accompanying the algorithm lists 0.071641 for N = 3, P = 0.9950 
which has a relative error exceeding 0.001. 

The remainder of this note describes computational details. 

Testing was done using the Watfiv compiler on an IBM 370/ 
165 at the University of Toronto. 

The following changes were made in the data statements. (1) 
Since the Watfiv compiler will not accept a representation of a 
number consisting of more than seven digits (including, it seems, 
leading zeros) as a short real constant, C(14), C(20), C(21), A(12) 
and A(18) were rejected by the compiler. This was easily circum- 
vented by changing these representations te “‘ E”’ form. (2) The two 
long data statements were broken into several shorter data state- 
ments to simplify detection and correction of punching errors. 
(Moreover, some compilers will not accept nine continuation 
cards!) 

For the inverse normal distribution function subroutine, 


* On sabbatical leave to Mathematics Department, University 
of Toronto, 1973-74. 


COLLECTED ALGORITHMS (cont.) 


GAUSSD, formula 26.2.23 of Abramowitz and Stegun [1] was 
used, followed by two Newton-Raphson iterations in double 
precision, the normal distribution function needed being con- 
structed from the DERFC (complimentary error function) which is 
included in Watfiv. This should give accuracy to single precision; 
spot checks against tables in Abramowitz and Stegun [1] bore this 
out. 

The actual testing procedure was this: From a given N and P, 
CHISQD computed a chi-square value. To establish the correct 
value with which to compare this, it was refined by a single Newton 
Raphson iteration. A rather free Fortran translation of Algorithm 
299 [2] was used to compute the chi-square integral. (Algorithm 299 
should be accurate to the limit set by word length and the square 
root, exponential, logarithm and error function routines.) Where 
possible corrected chi-square values were checked against table 
26.8 of Abramowitz and Stegun [1], agreement to at least three 
places after the decimal point or four significant-figures, whichever 
was more stringent, being found in all cases. 
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1. Abramowitz, M., and Stegun, I. (Eds) Handbook of Mathe- 
matical Functions. Appl. Math. Ser. Vol. 55. Nat. Bur. Stand., U.S. 
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Algorithm 452 


Enumerating Combinations 
of m Out of n Objects [G 6] 


C.N. Liu and D.T. Tang [Recd. 7 July 1971 and 

| May 1972] 

IBM Thomas J. Watson Research Center, Yorktown 
Heights, NY 10598 


Key Words and Phrases: permutations, combinations 
CR categories: 5.30 
Language: Fortran 


Description 

NXCBN can be used to generate all combinations of #7 out of 
n objects. Let the binary n-vector of m1’s and (1 — m) 0’s repre- 
senting a combination of m out of # objects be stored in an in- 
teger array, say 1C(). If NXCBN (n, m, IC) is called, a binary 
vector representing a new combination is made available in the 
array IC(#). If NXCBN (n, m, IC) is called (%) times successively, 
then all combinations will be generated. 

The algorithm has the following features; (a) each output 
binary n-vector differs from the input at exactly two positions— 
consequently each generated combination differs from the pre- 
vious one: by a single object: (b) the -vectors generated by this 
subroutine form a closed loop of (;,) elements ~—-therefore the ini- 
tial combination may be specified arbitrarily, and the enumeration 
of any subset of (;.) combinations can be readily achieved. The 
second feature is not found in Chase's algorithm [1]. 

The algorithm underlying this procedure is based upon our 
study of properties of Gray codes. It can be shown that constant 
weight code vectors from a Gray code sequence are separated by a 
Hamming distance of 2. The mathematical analysis is contained 
in {2] and {3}. 


References 

t. Chase, P.J. Algorithm 382, Combinations of 7 out of 1 
objects. Comm. ACM [3 (June 1970), 368. 

2. Tang, D.T., and Liu, C.N. On enumerating 7 out of 1 
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Algorithm 
SUBROUTINE NXCBNCNs Ms IC) 
C EXPLANATION QF THE PARAMETERS IN THE CALLING SEQUENCE 
Cc ON THE TOTAL NUMBER @F @BJECTS 
c oOM THE NUMBER OF @BJECTS T@ BE TAKEN FROM N 
C IF M=Q, OR M>=N, EXIT WITH ARGUMENTS UNCHANGED 
Cc IC AN INTEGER ARRAY. IC CONTAINS AN N-DIMEN- 
c SIONAL BINARY VECT@R WITH M ELEMENTS SET T@ 1 
c REPRESENTING THE M OBJECTS IN A COMBINATION 
C THIS ALGORITHM IS PROGRAMMED IN ANSI STANDARD FORTRAN 
INTEGER ICCN) 
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C CHECK ENDING PATTERN @F VECTOR 
IF (M-GEeN eORe M-EQe0) GO TO 140 
Nito=wN- 1 
D@ 10 J=1,NI 
NJ =N - J 
IF CICCN) -EQ@-IC(NJ)? GO TO 19 
Ji = J 
G2 Te 20 
10 CONTINUE 
20 IF (MODC(Ms2).EQ-1)> G@ TO 90 
C FOR M EVEN 
IF CICCN).EQ-1) GO TS 30 
Ki =N- Jl 
K2 = Ki + 2 
GQ TS 130 
30 IF (M@DC(J1s2)eEQ.1)> GO TO AC 
GQ T@ 120 
C SCAN FROM RIGHT TO LEFT 
40 JP = (N-J1) - 1 
D@ SO I=i»sJP 
Il = JP +2- I 
IF CICC11)-EQ.0) G@ TO 50 
IF CICCLI-1)>-EQ-1) GO TS VO 
G@ TS 80 
50 CONTINUE 
60 Kl = 1! 
K2 = CN+12 = M 
G@ T@ 130 
70 Kl = 11 ~- 1 
K2 = N- Jl 
G@ T@ 130 
80 Ki = Il - 1 
K2 = (N+1) = J! 
G@ T@ 130 
C FOR M BDD 
90 IF CICCN) -EQ-1) GO T@ 110 
K2 = €N-Jt) - 1 
IF (K2-.EQ-0) GO TO 60 
IF CICCK2+1)¢EQe1 eANDe IC(K2)-EQ-1) GB TO 100 
Ki = K2 + 1 
G@ T@ 130 
400 KI = N 
G@ T@ 130 
110 IF (MODC(J152)-EQ-1) G@ TO 120 
G@ TO 40 
i420 Ki = N- Jl 
K2 = MINOCCKE+2)5N) 
C COMPLEMENTING TW@ BITS T@ OBTAIN THE NEXT COMBINATION 


130 ICCKI)> = 1 - ICCK1I) 
IccK2) = t - IC¢K2) 
140 RETURN 
END 


COLLECTED ALGORITHMS FROM CACM 


Algorithm 453 


Gaussian Quadrature 
Formulas for Bromwich’s 
Integral [D1] 


Robert Piessens |Recd. 2 Aug. 1970 and 8 Feb. 1972] 
Applied Mathematics Division, University of Leuven, 
Heverlee, Belgium 


Key Words and Phrases: Gaussian quadrature, Bromwich’s 
integral, complex integration, numerical inversion of the Laplace 
transform 

CR Categories: 5.16, 5.13 

Language: Fortran 


Description 

BROMIN calculates the abscissas x;,"’ and weights w.” of the 
Gaussian quadrature formula 
EE x (s) (s) 
(1 /2z/) / ex F(x) dx = we Eve (i) 


oie k=l 


where c is an arbitrary real positive number, s is a real nonnegative 
parameter, and F(x) must be analytic in the right-half plane of the 
complex plane. 
Abscissas.x¢” and weights we” are to be determined so that (1) 
is exact whenever F(x) is a polynomial in 1~!, of degree <2N — I. 
The abscissas xe” are the zeros of Py,.(x7~!) where 


Pys(u) = (—1)%2Fo(-N,N +s — 1; —-; 4). (2) 


Properties of Py,.(u) are studied in |1]J. 

The quadrature formulas of even order have no real abscissas; 
those of odd order have exactly one real abscissa. All the abscissas 
have positive real parts and occur in complex conjugate pairs. 

The zeros of (2) are calculated using Newton-Raphson’'s 
method. Finding an approximate zero as starting value for the 
iteration process is based on a certain regularity in the distribution 
of the zeros (see [1] and [2]). The starting values, used by BROMIN 
were tested for s = 0.1(0.1)4.0 and N = 4(1)12. Each abscissa 
was found to at least eight significant figures in at most six iteration 


steps. 
The weights are given by 
bg N — 1)! INS 2 | 
Ai ety" Eee 2a rap . aS (3) 
I(N + 5 — I)Nxe? | Py_,,,(xe') 


The polynomial (2) is evaluated by a threesterm recurrence rela- 
tion (see [1]). Due to roundoff errors, the accuracy of abscissas 
and weights decreases significantly for increasing N. In Table I 
we give for some values of s and N the moduli of the relative errors 
in the abscissas and weights, calculated by BROMIN (with TOL = 
0.1£ — 10) onan IBM 370 computer in double precision (approxi- 
mately 16 significant figures). For comparison we used the 16 — S$ 
values given in [3]. 

Note that the relative errors in the weights are larger than in 
the abscissas. 

The use of complex arithmetic is avoided in BROMIN in 
order to facilitate the conversion to a double precision subroutine. 
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Table I. Maximum Relative Errors in Abscissas and Weights 


Maximum error in Maximum error in 


abscissas weights 
Ss N = 6 N = 12 N= 6 N = 12 
0.1 1.8 X 1078 1.7 X 10°79 1.2 K 10°73 2.3 & 1078 
10 1.9 X 107% 5.3 X10 1.5 K 10°14 6.4 * 107% 
4.0 1.3 K 10° 2.3 * 107% 1.0 X 1074 4.3 & 107! 
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Algorithm 


SUBROUTINE BROMINCNs Ss TOL» XRs XI» WRse WIs EPS, LER) 
D@UBLE PRECISION AKs ANs ARG» CIs CRs Ds Dh» D@s Es EPS» 
* FAC» FACTIs FACTR» PIs» PRs QI» QR» Ris RR» S» Tls T2s 
* TOL» Us Vs Ws WR» XIs XR»o YI» YRe Z 
INTEGER IER» Js Ks Lo» Ne Nis NUMs NUPs IGNAL 
DIMENSION XRON)» XICN)s WRONDs WIEN) 
THIS SUBROUTINE CALCULATES ABSCISSAS AND WEIGHTS OF THE 
GAUSSIAN QUADRATURE F@RMULA @F ORDER N FOR THE BROMWICH 
INTEGRAL. @NLY THE ABSCISSAS OF THE FIRST QUADRANT OF 
THE COMPLEX PLANEs THE REAL ABSCISSA CIF N IS @DD) AND 
THE C@RRESPONDING WEIGHTS ARE CALCULATED. THE @THER 
ABSCISSAS AND WEIGHTS ARE C@MPLEX CONJUGATES « 
INPUT PARAMETERS 
N @RDER OF THE QUADRATURE FORMULA. 
N MUST BE GREATER THAN 2. 
T@L REQUESTED RELATIVE ACCURACY @F THE ABSCISSAS+» 
$s PARAMETER @F THE WEIGHT FUNCTION. 
OUTPUT PARAMETERS 
XR AND XI CONTAIN THE REAL AND IMAGINARY PARTS OF 
THE ABSCISSAS- IF N IS @DDs THE REAL ABSCISSA 
IS XRCb). 
WR AND WI CONTAIN THE REAL AND IMAGINARY PARTS OF 
THE CORRESPONDING WEIGHTS. 
EPS IS A CRUDE ESTIMATION OF THE @BTAINED RELATIVE 
ACCURACY OF THE ABSCISSAS. 
IER IS AN ERROR C@DE- 
IF LER=0 THE COMPUTATION IS CARRIED QUT TO 
THE REQUESTED ACCURACY. 
IF IER-GT-O THE IER-TH ABSCISSA IS N@T FOUND. 


IF ITERS-1 THE COMPUTATI@NS ARE CARRIED UT, 
BUT THE REQUESTED ACCURACY IS N@T 
ACHIEVED. 

IF IER=<2 N IS LESS THAN 3- 


FUNCTION PROGRAMS REQUIRED 
FUNCTI@N GAMMACX) WHICH EVALUATES THE GAMMA 
FUNCTION FOR POSITIVE X- 
IER = -2 
IF (NeLT-3) RETURN 
Ni = (N#1)72 
L=e=Nel 
AN = N 
IER = 0 
EPS = T@L 
ARG = 0+034D0*(30.DO+AN+AN) /CAN~1 D0) 
FACTR = DCOSCARG) 
FACTI = DSINCARG) 
FAC = 1.D0 
AK = 0-DO 
D@ 10 K=1e6L 
AK = AK + 1.D0 
FAC = -FAC*AK 
10 CONTINUE 
FAC = FAC#CANtAN+S~-2.D0)**2/¢( AN*DGAMMACAN+S-1.-D0) > 
C CALCULATION @F AN APPROXIMATION OF THE FIRST ABSCISSA 
YR = 1¢333D0*AN + S - 1.5D0 
YI = 0-0D0 
IF (N-NI-Nt) 305 205 20 
20 YI = YE + 1-6D0 + 0-07D0«S 
C START MAIN L@OP 
30 D@ 140 K=1sNI 


MQMARQEQAAANAAANAAMAANANAAAANANAAANAAAANAAN 


COLLECTED ALGORITHMS (cont.) 


C NEWTON-RAPHSON METHOD 


D= 
YR = 
YI = 


YR*YR + YL*YI 
YR/D 
-YI/D 


G@ Te 50 


40 IGNA 
50 QR = 
Ql = 


Lo = 1 
‘S*YR - 14DO 
‘S*YI 


PR = €S41.D0)"((€S4#2-D0)#CYR¥YR-YI*Y1)-2-DO#YR) + 1-+DO 


PI = 2¢D0%(S+1 -DO)*YI*"((S+22.D0)*YR-1.0D0) 


Ze 


2.D0 


DO 60 J=3.N 


RR 
RI 
QR 
al 


60 CONT 
IF ¢ 
D = 
D1 
ba 


a 
Ps) 
“iounu eon 


C TEST OF C@NVERGENCE OF ITERATION PROCESS 
IF (CGR*CR+tCI*CI-EXERCYR#YR+YI*YI)) 


= QR 

= QI 

= PR 

= PI 

Z + 1-D0 

Z+ $ - 2-b0 

U+Zz 

CVEYR+ 62 ¢DO-S)/(V-2-D0)9/U 
CZ-1 6DOI*V/CU*(V-2.D0)) 
VeYL/U 


wow ow it 


LNUE 
IGNAL-EQ-1) G@ TO 100 
CYR¥YR+Y DYDD *V 


CCPREOR)¥YR+CPIL+QL)*YIO/D + PR 
COPI+Q1) *YR-CPR+ORI*YII/D + PI 


CDI*DL+D2*D2) *AN 
‘PR¥EYR - PI*YI 
PI*YR + PR#*YI 
€T1*D1+T2*D2)/D 
“€T2*D1-TI*D2)/D 
“YR = CR 

YI - CI 
= NUM + 1 


C TEST OF NUMBER OF ITERATION STEPS 
70 IF (NUM-10) SOs 50s 80 


80 Es 
IER 
NUP 


E*10.D0 
= -1 


= NUP + 1 


LF (NUP-5) 50s 50+ 90 
90 IER = K 
RETURN 
C CALCULATIGN @F WELGHTS 


100 IF CEPS-GE+«E) G@ T@ 110 
EPS 2 E 
110 -D = CQR¥OR+OL OI) &x2 


DI =: YR*OR + YI*QI 

D2 = .YI*OR - YR*GQI 

WROK) = FAC*(D1*Dt-D2*D2>/D 
WICK) = 2.D0*FAC*D2*D1/D 

D = YR*YR + YI¥*¥YI 

XKCK) = YR/D 

XICK) = -YI/D 


IF CK+1-NE) 1305 120s £50 
120 FACTR = DC@S¢1-S5D0*ARG) 
FACTI = DSINC1+SDO*ARG) 


130 YR = 
YI = 


140 CONTINUE 


150 RETURN 
END 


€V~1.D0)*COR*D-GQI1*D2) + DI*RR 
CV-1-D0)*CQI*D+QR*D2) + DI*RI 


70 


C CALCULATION @F AN APPROXIMATION OF THE (K+1)-TH ABSCISSA 
CXRCK)40«67D0#AN)*FACTR - XICK)*FACTI 
CXRCK)+0¢67DOXAN)*¥FACTI * XICKI*FACTR 


0. 67D0*#AN 
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COLLECTED ALGORITHMS FROM CACM 


Algorithm 454 


The Complex Method for 
Constrained Optimization [E4] 


Joel A. Richardson and J.L. Kuester* |[Rec’d. Dec. 22, 
1970 and May 5, 1971] 
Arizona State University, Tempe, AZ 85281 


Key Words and Phrases: optimization, constrained 
optimization, Box’s algorithm 

CR Categories: 5.41 

Language: Fortran 


Description 
Purpose. This program finds the maximum of a multivariable, 
nonlinear function subject to constraints: 


Maximize F(X, X2,..., Xy) 
Subject to G.,< Xi < Ay, A = 1,2,...,™M. 


The implicit variables Xy41,..., Xay are dependent functions of 
the explicit independent variables X1, X2,..., Xx. The upper 
and lower constraints H; and G, are either constants or functions 
of the independent variables. 

Method. The program is based on the “complex”? method of 
M.J. Box |2]. This method is a sequential search technique, which 
has proven effective in solving problems with nonlinear objective 
functions subject to nonlinear inequality constraints. No deriva- 
tives are required. The procedure should tend to find the global 
maximum because the initial set of points is randomly scattered 
throughout the feasible region. If linear constraints are present or 
equality constraints are involved, other methods should. prove to 
be more efficient [1]. The algorithm proceeds as follows: 

(1). An original “complex” of K > N+ 1 points is generated 
consisting of a feasible starting point and K — 1 additional points 
generated from random numbers and constraints for each of the 
independent variables: X;,; = G:; + rij(Hi — G;), i = 1, 2,..., 
N, and j = I, 2,..., K — 1, where r;,; are random numbers be- 
tween O and 1. 

(2) The selected points must satisfy both the explicit and implicit 
constraints. If at any time the explicit constraints are violated, 
the point is moved a small distance 6 inside the violated limit. If 
an implicit constraint is violated, the point is moved one half of the 
distance to the centroid of the remaining points: X;,;(mew) = 
(X;,;(old) + Xi,-)/2, i = 1, 2,..., NM, where the coordinates 
of the centroid of the remaining points, X;,-, are defined by 


- 1 K 
Niemen =| 4 Xig— Xi jold |, 2 = 1,2,...,N. 
K-11 


* The authors acknowledge financial support from a National 
Science Foundation summer fellowship and Arizona State Uni- 
versity Grants Committee fellowship. Computer facilities were 
provided by the Arizona State University Computer Center and 
AiResearch Manufacturing Company. 
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This process is repeated as necessary until all the implicit con- 
straints are satisfied. 

(3) The objective function is evaluated at each point. The point 
having the lowest function value is replaced by a point which is 
located at a distance a times as far from the centroid of the re- 
maining points as the distance of the rejected point on the line 
joining the rejected point and the centroid: 

X;,;(new) = a(Xi, = X;,;(old)) + Ais ’ 
Box [2] recommends a value of a = 1.3. 
(4) If a point repeats in giving the lowest function value on con- 
secutive trials, it is moved one half the distance to the centroid of 
the remaining points. 

(5) The new point is checked against the constraints and is ad- 
justed as before if the constraints are violated. 

(6) Convergence is assumed when the objective function values 
at each point are within 8 units for y consecutive iterations. 


Program. The program consists of three general subroutines 
(JCONSX, JCEK1, JCENT) and two user supplied subroutines 
(JFUNC, JCNST1). The use of the program and the meaning of 
the parameters are described in the comments at the beginning of 
subroutine JCONSX. All communication between the main 
program and subroutines is achteved in the subroutine argument 
lists. An iteration is defined as the calculations required to select a 
new point which satisfies the constraints and does not repeat in 
yielding the lowest function value. 


Test results. Several functions were chosen to test the program. 
The calculations were performed on a CDC 6400 computer. Some 
examples: 


1. Box Problem [2] 
Function: F = 9 ~ (X%, — 3)2)X3/27V3 
Constraints: 0 < % < 100 
0< X% < X/V3 
0 < (X%; = X%1 + V3X2) < 6 
Starting point: X%, = 1.0, X. = 0.5 
Parameters: K = 4, a = 1.3, B = OI, y = 5, 6 = .0001 
Computed results Correct results: 


F = 1.0000 F = 1.0000 
X = 3.0000 X, = 3.0000 
X, = 1.7320 X» = 1.7321 


Number of iterations: 68 
Central processor time: 6 sec. 


2. Post Office Problem [3] 
Function: F = X,X2X; 
Constraints: 0 < X; < 42, 7 = 1, 2,3 
O< (X% = XM + 2X. + 2X3) < 72 
Starting point: X%; = 1.0, X. = 1.0, X; = 1.0 
Parameters: K = 6, a = 1.3, 6 = .O1, y = 5, 6 = .000] 
Computed results: Correct results: 


F = 3456 F = 3456 
X, = 24.01 X; = 24.00 
X, = 12.00 X» = 12.00 
X; = 12.00 X; = 12.00 


Number of iterations: 72 
Central processor time: 6 sec. 


COLLECTED ALGORITHMS (cont.) 


3. Beveridge and Schechter Problem [1 | 
Function: F = —(X, — 0.5)? — (X, — 1.0)? 
Constraints: —2 < X%, < 2 
~V2<2X%2 5 V2 
—4A< (Xs = M+ 2X27 -— 4) < 0 
Starting. point: X; = 0., X%» = 0. 


Parameters: K = 4, a = 1.3, B = .OOOO!, y = 5, 6 = .0001 


Computed results: Correct results: 


F = .Q000 F = .0000 
X, = .5035 X, = .5000 
X, = .9990 X, = 1.0000 


Number of iterations: 40 
Central processor time = 5 sec. 


a 


References | 

1. Beveridge, G.S., and Schechter, R.S. Optimization: Theory 
and Practice. McGraw-Hill, New York, 1970. 

2. Box, M.J. A new method of constrained optimization anda 
comparison with other methods. Comp. J. 8 (1965), 42-52. 

3. Rosenbrock, H.H. An automatic method for finding the 


greatest or least value of a function. Comp. J. 3 (1960), 175-184. 


Algorithm 
SUBROUTINE JCONSX(Ns Ms Ks ITMAXs ALPHAs BETAs GAMMA, 
* DELTA» Xs Rs Fs ITs ITEV2s KOs Go Hs XCe Ld 

PURPOSE : 
TO FIND: THE CONSTRAINED MAXIMUM @F A FUNCTION OF 
SEVERAL VARIABLES BY THE COMPLEX METH@D GF Me Js BOX. 
THIS IS: THE PRIMARY SUBROUTINE AND CO@RDINATES THE 
SPECIAL: PURP@SE SUBROUTINES (JCEK1, JCENT» JFUNC,» 
JCNST1)> INITIAL GUESSES OF THE INDEPENDENT VARIABLES» 
RAND@M NUMBERS» SOLUTI@N PARAMETERS» DIMENSI@N LIMITS 
AND PRINTER CODE DESIGNATION ARE @BTAINED FROM THE MAIN 
PROGRAM, FINAL FUNCTI@N AND INDEPENDENT VARIABLE 
VALUES ARE TRANSFERRED T@ THE MAIN PR@GRAM FOR 
PRINTOUT. INTERMEDIATE PRINT@UTS ARE PR@VIDED IN THIS 
SUBROUTINE. THE USER MUST PROVIDE THE MAIN PR@GRAM AND 
THE -SUBR@UTINES THAT SPECIFY THE FUNCTION CJFUNC) AND 
CONSTRAINTS CJCNST1). F@RMAT CHANGES MAY BE REQUIRED 
WITHIN THIS SUBROUTINE DEPENDING @N THE PARTICULAR 
PROBLEM UNDER CONSIDERATION. 
USAGE : 
CALL JCONSX(Ns Ms Ks LTMAXs ALPHAs BETA» GAMMAs DEL TAs Xs Ro Fs 
IT, [LEV2sK@» Ge» H» XCsL) 
SUBROUTINES REQUIRED 
JCEKLCNs Ma KsXeGsHs L»KODEs XC» DEL TAsLs KI) 

CHECKS ALL POINTS AGAINST EXPLICIT AND IMPLICIT 

CONSTRAINTS AND APPLYS CORRECTION IF VI@LATIGNS ARE 

FOUND 
JCENTCNs Me Ks TEVis Ts XCoXsLoK1) 

CALCULATES THE CENTR@ID OF POINTS 
JFUNC ON» Me Ks Xe Fe Isl) 

SPECIFIES OBJECTIVE FUNCTION CUSER SUPPLIED) 
JCNSTI CN» Ms Ko XsGeHe IL? 

SPECIFIES EXPLICIT AND IMPLICIT CONSTRAINT LIMITS 

CUSER SUPPLIED). Q@RDER EXPLICIT CONSTRAINTS FIRST 
DESCRIPTION OF PARAMETERS 

N NUMBER @F EXPLICIT INDEPENDENT VARIABLES - DEFINE 
IN MAIN PROGRAM 


M NUMBER OF SETS @F CONSTRAINTS ~ DEFINE IN MAIN 
PR2GRAM 

K NUMBER @F POINTS IN THE COMPLEX - DEFINE IN MAIN 
PROGRAM 

ITMAX MAXIMUM NUMBER OF ITERATIGNS - DEFINE IN MAIN 
PROGRAM 

ALPHA REFLECTION FACTOR - DEFINE IN MAIN PR@GRAM 

BETA CONVERGENCE PARAMETER - DEFINE IN MAIN PROGRAM 


GAMMA CONVERGENCE PARAMETER - DEFINE IN MAIN PROGRAM 
DELTA EXPLICIT CONSTRAINT VIGLATI@N CORRECTION - DEFINE 
IN MAIN PROGRAM 


x INDEPENDENT VARIABLES = DEFINE INITIAL VALUES IN 
MAIN PROGRAM 

R RAND@M NUMBERS BETWEEN O AND 1 - DEFINE IN MAIN 
PROGRAM 

F OBJECTIVE FUNCTION - DEFINE IN SUBR@UTINE JFUNC 

IT ITERATION INDEX ~- DEFINED IN SUBROUTINE JCONSX 


YEV2 INDEX OF POINT WITH MAXIMUM FUNCTION VALUE ~ 
DEFINED IN SUBROUTINE JCONSX 

IEVI INDEX @F POINT WITH MINIMUM FUNCTION VALUE - 
DEFINED IN SUBROUTINE JCONSX AND JCEK1 


KO PRINTER UNIT NUMBER - DEFINE IN MAIN PROGRAM 

G LOWER CONSTRAINT ~- DEFINE IN SUBROUTINE JCNST1 

H UPPER CONSTRAINT - DEFINE IN SUBROUTINE JCNST1 

XC CENTR@ID - DEFINED IN SUBR@UTINE JCENT 

L TOTAL NUMBER @F INDEPENDENT VARIABLES CEXPLICIT + 
IMPLICIT) - DEFINE IN MAIN PROGRAM 

I POINT INDEX - DEFINED IN SUBROUTINE JCONSA 


KODE KEY USED TO DETERMINE IF IMPLICIT CONSTRAINTS ARE 
PROVIDED - DEFINED IN SUBR@UTINE JC@NSX AND JCEK1 
Kl De LO@P LIMIT - DEFINED IN SUBR@UTINE JCONSXK 
DIMENSI@N X(KsL)s RCKsN)s FCK)s GOM)s HOM)» XCCN) 
INTEGER GAMMA 


AaAAaNNMQaAIANAaNNaANANAANAANANAANNANAMANAaANNaANANaAaAaAaNAaNNaAANNANaANNANANNANaNaGaAaAaANIaAAAAN 


| i a | 
WRITE (K6,99995) IT 
K@DE = 0 


IF CM-N) 205 205 10 


Cc 


99999 FORMATCLH 
99998 FORMATCI1H 
99997 FORMATCIH 
99996 FORMATCI1H 
99995 FORMATCIH 


aggaggaaaaanaa 
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10 K@DE = 1 
20 CONTINUE 
D@ 40 IL1=2sK 
D@ 30 J=lsN 
XCII,J) = Of 
30 CONTINUE 
40 CONTINUE 
CALCULATE COMPLEX POINTS AND CHECK AGAINST CONSTRAINTS 
DO 60 I1=25K 
D@ SO J=1isN 
I=! 
CALL JCNSTICNs Ms Ks Xs Gs He Io L? 
XCILI,J)2 = GOS) + ROLTIs J *# CHC J-GCU)) 
50 CONTINUE 
K1 = IL 
CALL JCEKICNs Ms Ks Xs» Gs He Is KODE» XC» DELTAs Lo KI) 
WRITE (KO299999) IIs CXCIL, J), J=19N) 
60 CONTINUE 
Ki = K 
D@ 70 1=14K 
CALL JFUNCCNs Ms Ks Xe Fs Is bd 
70 CONTINUE 
K@UNT = 1 
IA = 0 
FIND POINT WITH LOWEST FUNCTION VALUE 
WRITE €K@s99998) CFCI)sT=15K) 
80 IEVi = 1 
D@ 100 ICM=2,K 
IF CFCIEVI2-FCICM)) 1005 100, 90 
90 IEVi = ICM 
100 CONTINUE 
FIND POINT WITH HIGHEST FUNCTI@N VALUE 
IEv2 = 1 
D@ 120 ICM=2,K 
IF CFCIEV2)-FCICM)) £10. 110, 120 
110 IEV2 = ICM 
120 CONTINUE 
CHECK CONVERGENCE CRITERIA 
IF CFCIEV2)-CFCIEVI)+*BETA)) 1405 1302 130 
130 K@UNT = 1 
G@ T@ 150 
140 KOUNT = K@UNT + 1 
IF CKOUNT-GAMMA) 150s 240, 240 
REPLACE POINT WITH LOWEST FUNCTION VALUE 
150 CALL JCENTCNs Ms Ks IEVIs [sy XCs Xs Le Ki) 
DO 160 J=1sN 
XCIEVIs,I>2 = CLe+ALPHA)*CXCCJ)) - ALPHA*CXCIEVI]s J) 
160 C@NTINUE 
I = IEv1 
CALL JCEKICNs Ms Ks Xs Gs Hs Is KODE» XCs DELTA» Le K1) 
CALL JFUNC(N» Ms Ks Xs Fo Is L) 
REPLACE NEW PQINT IF IT REPEATS AS L@WEST FUNCTION VALUE 
170 IEv2 = 1 
De 190 ICM=2sK 
IF CFCLEV2)-FCICM)) 190, 190s 180 
180 IEV2 = ICM 
190 CONTINUE 
IF CIEV2-IEV1) 220. 200, 220 
200 D@ 210 JJ=lsN 
XCIEV1Is,JIJ) = CXCTEVI2 ISI +tXCC III Z2.4 
210 CONTINUE 
1 = ITevi 
CALL JCEKICN» Ms Ks Xs Gs Hs Is K@DEs XC» DELTAs Ls K1) 
CALL JFUNCC(Ns Ms Ke Xs Fs Is Ld 
GB TO 170 
220 CONTINUE 
WRITE (€KQ,99997) (XCIEVI, JB),JB=1,N) 
WRITE €K0s99998) CFCI)s1=15K) 
WRITE €KOs99996) CXCCJI 2s J=19N) 
IT = IT + 1 
IF CIT-ITMAX) 230, 2305 240 
230 CONTINUE 
WRITE €K@s99995) IT 
G@ T® 80 
240 RETURN 
15Xs 21H COBKDINATES AT POINT» I4/8CF8.4s 2X)) 
20X%s 16H FUNCTION VALUESs /8CF10-4» 2X)) 
20X» 16H CORRECTED POINTs /8CF8.4s 2X)) 
21H CENTR@ID CO@RDINATESs 2X» 8CFB-4s 2X2) 
7/1\0H ITERATIONs 4X5 15) 


wey vy 


END 


SUBROUTINE JCEKICNs Ms Ks Xe» Gs Hs Iso KODE» XC» DELTAs Lo 
* KI) 
PURPOSE 
T@ CHECK ALL POINTS AGAINST THE EXPLICIT AND IMPLICIT 
CONSTRAINTS AND T®@ APPLY CORRECTIONS IF VI@LATIONS ARE 
FOUND 
USAGE 
CALL JCEKICNsMsKsXeGsHe lL» KODE» XC» DEL TAs Lo K1) 
SUBROUTINES REQUIRED 
JCENTOCNs Ms Keo LEVI se Te XCoXebo Kl) 
JCNSTLECNs Ms Ke XsGsHo Tob) 
DESCRIPTION @F PARAMETERS 
PREVI@USLY DEFINED IN SUBROUTINE JC@NSX 
DIMENSIGN X(KeL>» GOM)s HOM)» XCON) 
10 KT = 0 
CALL JCNSTICNs Ms Ke Xs Go Hs Is L? 
CHECK AGAINST EXPLICIT CONSTRAINTS 
D@ SO J#lsN 
IF (XC1sJ)-GCU) 202 202 30 
20 XCI,J) = GCJ) + DELTA 
G@ T@ SO 
30 IF CHOJ-XCIsd)) 40s 402 50 
40 XC€I,J) = HCJ) = DELTA 
50 CONTINUE 
IF CKODE) 1105 1105 60 
CHECK AGAINST THE IMPLICIT CONSTRAINTS 
60 CONTINUE 
NN = N+ ft 
D@ 100 J=NNsM 
CALL JCNSTICNs Ms Ks Xs Gs» Hs Iv Ld 
IF CXCIsJ5)-GCJ)) 805 70s 70 


COLLECTED ALGORITHMS (cont.) 


70 IF CHCJ)-XCI+J)) 80» 100% 100 
80 IEVvi = I 
KT = 1 
CALL JCENTCNs Ms Ks ITEVi» Is XC» Xs Ls KI) 
DO 90 JJ=15N 
XCIsJJ) = OXCTs JS 4+XCC III 726 
90 CONTINUE 
100 CONTINUE 
IF (KT) 110, 1105 10 
110 RETURN 
END 


SUBROUTINE JCENTCNs Ms Ks IEVIs Is XCs Xs Ls KI) 
C PURPOSE 
c T@ CALCULATE THE CENTR@ID OF POINTS 
C USAGE 
Cc CALL JCENTCNsMsKs LEVI oe Ts XCsXsboK1) 
C SUBR@UTINES REQUIRED 
Cc NONE 
C DESCRIPTI@N @F PARAMETERS 
c PREVIQUSLY DEFINED IN SUBROUTINE JC@NSX 
DIMENSION X¢€KsL)» XCCN) 
D@ 20 J=l.N 
XCCJ> = Oe 
D@ 10 IL=15K1 
XCCJ) = XCCU) + XCILO JD 
10 CONTINUE 
RK = Kl 
XCCI) = CXCCIIA“XCITEVES JI CRK-1 6) 
20 CONTINUE 
RETURN 
END 


Remark on Algorithm 454 [E4] 

The Complex Method for Constrained Optimization 
[Joel A. Richardson and J.L. Kuester, Comm. ACM 16 
(Aug. 1973), 487-489] 


Kenneth D. Shere [Recd. 8 Oct. 1973] 
Mathematical Analysis Division, Naval Ordnance Lab- 
oratory, Silver Spring, MD 20910 


This algorithm can result in an infinite loop. This happens 
whenever the “corrected point,” the centroid of the remaining 
“complex” points, and every point on the line segment joining these 
two points all have functional values lower than the functiortal 
values at each of the remaining complex points. Two examples for 
which this algorithm fails are [1] and [2]: 

1. maximize f(x) = —100(x.—«2)? — (1—x)? 

— 10 < x, x2 <10, initial value (x1, x2) = (—2.5, 5.0) 
and 
2. maximize 

S(8, ¢) = 0.2 (sin (80) cos (po) sin (0) cos (¢) + sin (80) sin (do) 

sin (0) sin (6) + cos (60) cos (@)) — 1.0 (sin® (6) cos? (6) 
+ cos® (p) sin® (¢) sin’ (6)) 
0 < 0,6 < 1/2, (00, do) = (.8726, .0873), 
initial (0, 6) = (9/4, 7/4) 
Also, there is no difference in usage between M and L. 

A similar method is the “simplex method” [3]. A modification 
to the “complex method” which uses the ideas of [3] has been pro- 
grammed. The modified JCONSX solves each of the above prob- 
lems in under 5 CP sec on a CDC 6400. The modified routine is 
available to interested parties upon request. 

It is also worth noting that the variable 1A, which appears in 
the second statement after 70 CONTINUE is not used elsewhere. 


References 

1. Rosenbrock, H.H. An automatic method for finding the 

greatest or least value of a function. Compur. J. 3 (1960), 175-184. 
2. Ferguson, R.E. An electromagnetism problem. (Private 
communication.) 

3. Parkinson, J.M., and Hutchinson, D. An investigation into the 
efficiency of variants on the simplex method. In Numerical Methods 
for Nonlinear Optimization, F.A. Lootsma, Ed., Academic Press, 
New York, 1972. 
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Algorithm 455 


Analysis of Skew 
Representations of the 
Symmetric Group [Z] 


D.B. Hunter* and Julia M. Williamst [Recd. 5 Feb. 
1971] 

* Department of Mathematics, University of Bradford, 
Yorkshire, England 

+ 12 Peel Close, Heslington, York, England 


Key Words and Phrases: symmetric group, skew representation, 
partition, Young diagram, lattice permutation, binary model, outer 
product 

CR Categories: 5.30 

Language: Algol 


Description 
This algorithm analyzes the skew representation [\]-[u] of the 
symmetric group o, corresponding to a pair of partitions 


(A) = (Ar, dv,..., Ar) and (u) = (ur, wey... , ws) Where 

r>s5 | 

AL > Den > 2 

Mi 2 we Dts 2 ps (1) 
MS Me (<i<s) 


(see Robinson [4, sec. 2.5]). The analysis takes the form 


ble) = Doe coy [rl (2) 


where the summation is over all partitions (v) of ”, the coefficients 
C.») being nonnegative integers. 

The method used may be described as follows: construct all 
possible diagrams which can be built up in accordance with the 
following two rules. 

(a) Replace y, of the nodes in the Young diagram corresponding 
to (A) by identical symbols a, in such a way that: (i) the unchanged 
nodes form a regular Young diagram; and (ii) no two identical 
symbols a, lie in the same column. Then replace y»,_; further nodes 
by identical symbols @,_1 in accordance with the same rules, and 
so on, finally replacing 4: nodes by identical symbols a . 

(b) In the final diagram the altered nodes should form a lattice 
permutation of ala? ---as* (Robinson [4, sec. 2.4]) when 
read from right-to-left through successive rows. 

Then the pattern of unchanged nodes in each diagram so con- 
structed defines a term [»] in the analysis. 

This method appears not to have been explicitly stated in the 
above form before, but is an immediate consequence of Little- 
wood’s method for analyzing the outer product [\].[u] (see Little- 
wood [3, sec. 6.3, th. V], Robinson [4, sec. 3.3]), noting that c,,) 
is also the coefficient of [A] in the analysis of [u].[v] (Litthewood 
[3, sec. 6.4, th. VIII]). 

In the procedure, binary models of those partitions (v) in (2) 
for which c(v) ~ O are stored, in lexicographic order, in nu[1], 
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nu(2],..., nu[p], the corresponding values c(,) being stored in 
c[1], c[2], ..., e[p]. The binary model used is due to Comét [1], 
a partition (vy) = (v1, %,..., ) being represented by the number 


ony ots 22 —¥1— 2 foes e 4 2" + 1. (3) 


The techniques used are similar to those employed in [2]. In 
particular, two two-dimensional arrays /am and sigma are required. 
Corresponding to any particular diagram, /am [/, j] specifies the 
number of nodes in row / which are still unchanged when all the 
symbols a, , as_1,..., a; have been inserted (j = i,i+1,...,7), 
and sigma {i, j] specifies the total number of symbols a; inserted 
in rows i, i + 1, ...,/. Thus the quantities /am[i,/] are generated 
by the equation 


laml(i,j) = lam{i+1,j] — sigmali,j| + sigmali,j—1]}. (4) 
The rules for constructing the diagrams impose the restrictions 
sigmali—1,j—1] => sigmali—1,j] — lam|i,j] + lam[i,j+-1| (5) 
and 

sigmali—1,j—1] > sigma{i,/}. (6) 


Each time array /am is completed, a term 
(vy) = (lam{I,1], /am[1,2], ... , /amf1,r]) (7) 


is added to the analysis. 
Note I. In view of the identity 


[A].(u] = [Arttur, Aittus,..., Artus, Ar, Azy- e+, Ar] — [Ar], 


procedure skew may also be used to analyse the outer product 
fA]. [uJ]. It is, however, less convenient for this purpose than pro- 
cedure outer product of Hunter [2]. 

Note 2. Value of p. It is difficult to predict the value of p in 
any example. Clearly, p < p(n), where p() denotes the number of 
partitions of ». On the other hand, for any value of n, there are 
partitions (A) and (u) for which p = p(m), namely, (A) = 
(nn—1,...,1),() = (n—l,..., 1). 


References 

1. Comét, S. Notations for partitions. MTAC 9 (1955), 143-146. 
2. Hunter, D.B. Outer product of symmetric group representa- 
tions. BIT 10 (1970), 106-114. 

3. Littlewood, D.E. Theory of Group Characters, 2nd ed. Oxford 
U. Press, England, 1950. 
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Algorithm 
procedure skew (r, s, lambda, mu, p, c, nu); 
value r, s; integer r,s, p; integer array /Jambda, mu, c, nu; 


begin 
comment Input parameters. 
r: the number of parts in partition (A). 
Ss: the number of parts in partition (x). 
lambda: the part d; is stored in fambdali], i = 1,2,..., 7r. 
mu: the part py; is stored in mu[i],i = 1,2,..., 5. 
Output parameters. 
DP: the number of terms on the right in (2) for which 
Co) ~ 0. 
nu: Binary models (3) of the partitions (v) in (2) for which 
Cv) ~ O are placed in lexicographic order in nu({\l, 
nu[2],..., nu[p]. 
c: cli] contains the coefficient c,,) of the partition whose 


binary model is in nu{i); 
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integer i, j, k, x, y; else sigmali+1,J]; 
integer array /am[1:s+1,l:r], sigma[l:s+1,0:r]; if sigma[i,j—1] = O then 
p := 0; for i := 1 step 1 until s do /am[i+1,i] := lambda |i); begin 
for 7 := s+1 step 1 until r do for x := j — 1 step —1 until i do sigma{i,x—1] := 0; 
begin g0 to build 
lam|s+1,j| := lambda{j); sigma|s+1,j—1] := 0 end 
end; end 
for i := 1 step 1 until s do sigma{i, r] := mu{i); end 
k := mu[s) — lambda{r]; sigma{s, s—1] := 0; end skew 
for j:= r— 1 step —1 until s do 
begin 


sigma|s, /} := if k > O then k else 0; 
k := sigma|s, j] — lambda{j| + lambda|j-+1| 
end; 
i:= 5s; 
build: 
for i := istep — 1 until | do 
begin 
for / := istep 1 until r do 
lam|i, J] = lam{i+1, J] = sigmali, J) + sigmali, j-1); 
if i * | then , 


begin 
k := muli—1| — lam|i, r|; sigmali—1, i—2] := 0; 
for 7 := rstep —1 until ido 
begin 


sigmali—1, 7—1] := if k > sigma|i, /| then k 
else sigmali, j]; 
k := sigmali—1, j—1| — lam|i, j—1] + lam{i, j| 
end 
end 

end; 

xi=jfi= 1; 

for j := j + 1 while (if />r then false else /am|i,j]>0) 


do x := x X 2 fT /am{1,/] + 1; 
if (if p = O then true else x>/1u[p]) then 
begin 
p:= p+ 1; nulp| := x; clp) := 1 
end 
else 
if x = nu|p] then c[p] := c[p] + 1 
else 
begin 
Ji=1lj,ki=p; 
search: 
yi= (tk) + 2; if x = nuly) then c{y] := cly] + 1 
else 
if nufy] <x A x < nuly-+1] then 
begin 
for k := pstep —1 until y + 1 do 
begin, 
c[k +1] := c[k]; nu[k+1] := nulk] 
end; 
e[y+1] := 1; mu[y+1] := x) p:= pt! 
end 
else 
begin 
if x < nuly] then k := y else j := y; go to search 
end 
end; 


for i := 1 step | until s do 
for y :== istep 1 until r—1do 
if sigmali,y] < sigmali,y+1] then 
begin 
sigma|i,y] := sigma[i,y] + 1; 
for 7 := y step — 1 until ido 
begin 
k := sigmali,j) — lam{i+1,/] + /am{i+1,j+1]; 
sigmali,j—1] := if k > sigma{i+1,/] then k 
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Algorithm 456 
Routing Problem [H] 


Zdenék Fencl [Recd. 16 Nov. 1970, 4 Oct. 1971, and 
28 Jan. 1972] 

RCA, Computer Systems Division, 200 Forest Street, 
Marlborough, MA 01752 


The algorithm was originally developed as a part of vector 
ordering procedures at the Design Automation Center, RCA, Marl- 
borough, Massachusetts, and was extended to general use in the 
traveling salesman and nonsymmetric routing problem. 


Key Words and Phrases: routing problem, shortest path, traveling 
salesman problem, Hamiltonian circuit 

CR Categories: 5.40 

Language: Fortran 


Description 

The algorithm finds the shortest serial (branchless) connection 
between nodes of a net beginning in the start node sn and termi- 
nating in the end node ev or terminating in any node. Also given 
is the m X m matrix d of distances (with zero diagonal and not 
necessarily symmetric) between all pairs of nodes, and the vector 
p containing » node numbers to be connected referring to appro- 
priate entries in the matrix d. The algorithm is constructed so that 
for one net (given by the matrix d) various connections, not neces- 
sarily exhausting all of m nodes, may be created; hence n < m. 
The case sv = en is also permitted, which actually yields a Hamil- 
tonian circuit—-traveling-salesman problem. If, in input, en = 0, 
the start-to-any connection is assumed. Also as an input is the 
number of runs r, which is discussed below. In the output, the 
original vector p is replaced by conjectured optimal sequence of 
n nodes, and / contains the connection length. The matrix d does 
not need to represent a Euclidean net nor be symmetric. Thus 
the algorithm may serve as a more general tool to solutions of 
related problems. 

Since the method is heuristic, which implies it is approximate, 
guaranty of an optimal solution is based on empiric probability. 
The algorithm uses a tour-building method combined with tour-to- 
tour improvements. 

In the first phase, the tour, or sequence of nodes, is built up 
by successively inserting not-yet-involved nodes into the tour. 
If, in the middle of tour building, the tour, for instance, consists 
of the nodes pi, p2,..., Px, the next node among the nodes 
Pk+i Petz ,+++ , Pn, and the arc (to be split by the chosen inserted 
node) among the arcs pips, PePs,..., PePi, are chosen so that 
the tour increment will be minimum; ie. i (1 < i < k) and 
i(k <j <x) are chosen in such a manner that d(p;.p;) + d(pj. pi41) 
— d(p;. p41) = min. Tour building starts with the arc pp, and ter- 
minates when all » nodes have been included. The tour-building 
approach of this kind for the traveling-salesman problem was 
originated by Karg and Thompson [1] and further developed 
by Raymond [2]. This algorithm, however, handles an open con- 
nection—start-to-end or start-to-any node. The maintenance of 
this property is ensured in the algorithm by assigning to the end- 
to-start or each-to-start distance sufficiently large negative values 
(—n X max:,{d.;]) which, in some way, firmly attach the end or any 
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of n nodes to the start node permitting a circuit to form. In fact, 
the algorithm works on a net as if it were a closed circuit and 
keeps the node configuration by modifying the distance matrix. 
In output, the distance matrix is returned to its original form. 

A tour thus built is hardly optimal and for larger nets it is 
probably far from optimum. The second phase improves the tour 
(for n > 3) by the so-called 3-opt method proposed by Lin [3]. 
Improvements consist in exchanging three arcs, or links of the 
given connection by three other links. If there are no more 3 links to 
exchange for tour improvement, the tour is said to be 3-optimal. 
In general, \-optimality can be considered. The implication of the 
3-link exchange is essentially in reinsertions. Consecutive node 
chains of length k (1 < k < n) are successively tried to be rein- 
serted (both as are and inverted) into remaining links for tour 
improvements, which actually represent 3-link exchanges (and 
also 2-link at the same time). A 3-opt tour shows a certain proba- 
bility to be an optimal one in relation to n. Different 3-opt tours 
can be achieved if different initial nodes are chosen, which allows 
us to increase the probability of obtaining an optimal solution. 

The algorithm can run r trials (as specified in input) with 
different initial nodes (p:, set automatically), thus obtaining 
different solutions while the best is saved and replaced in the vector 
p in output. For runs r > » (r < 2n) there is little chance for 
further improvement, because initial nodes repeat and the tour 
development can be affected only by previous contents of the 
vector p on which the tour is built. Probability that the 3-opt 
tour is optimal is somewhat higher in this algoiithm, than in the 
one Lin suggests. In contrast to finding a 3-opt solution from a 
given random sequence of nodes, the fast building of an appropriate 
tour in the first phase considerably reduces the number of rein- 
sertions in the second phase. The algorithm generalization to the 
noncyclic and nonsymmetric problems, in comparison to the 
traveling-salesman problem, increases computational time. 

A considerable number of test examples have been run by the 
algorithm including the three problem types mentioned and the 
non-Euclidean and nonsymmetric problems. To outline the capa- 
bility and how the “cost-approximation” factor r should be set 
for various n’s, a survey of tested problems is presented, most of 
which problems have been solved and published before. The algo- 
rithm in Fortran was run on the RCA’s SPECTRA 70/45 (fixed- 
point add time equals 8.88 ywsec), and is recommended for a high 
probability (over 95 percent) of obtaining an optimum if r = 2 
to 5forn < 10 andr = 5 to 15 for n < 30. For higher n’s, unless 
cost is out of consideration and r can be set up to 2”, the checking 
of successive results is advisable to see how improvements are 
developing (p and /; should be checked after the tour-length calcu- 
lation). These checks can also serve for getting suboptimal solutions. 

In the program, the distance matrix d is in fixed-point mode, 
which makes computation faster and does not seem to be a serious 
restriction. Decimal order range of distances is expected to be 
small enough to be represented in fixed point, and calculations 
(additions and subtractions) will, most likely, not face overflow 
problem. 

The arrays 1D and Q should have the maximum subscript set 
at least ton. 

The algorithm is believed to be applicable also to problems 
in which all connections do not necessarily exist. In terms of graph 
theory a graph representing the net to be routed need not be com- 
plete; i.e. every pair of vertices may be connected only in one of 
the two possible directions. The graph, however, must be strongly 
connected; i.e. there must be a path joining any pair of arbitrary 
distinct vertices. Nonexisting arcs might be expressed by assigning 
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Survey of tested problems 


Conjectured nh 
Ref. n sn en optimum Yopt [sec] 
Karg and Thomp- 1 2 118 1 
son [1] 5 1 0 en=5 108 1 <1 
1 1 148 1 
Raymond [2] 1 5 165 1 
7 1 0 en=4 140 1 <i 
1 1 179 1 
Barachet [4] 1 2 350 1 
10 1 0 en=7 298 1 
1 1 378 2 1.4 
1 2 308 1 
1o* 1 0 en=7 257 2 
1 1 336 2 
Author 1 2 102 1 
12 I 0 en =12 95 1 3.0 
1 1 : 114 1 
Author 1 6 117 1 
13 1 0 en=12 102 1 3.0 
1 1 130 1 
Held and Karp [5] i 25, = 1517 10 21.8 
BS. isa 0 en=25 1517 2 22.3 
| 1 +x 1711 1 29.7 
Karg and Thomp- 1 33 a 10655 2 53.6 
son [1] 33001 en=14 10585 10 53.4 
1 iad 10861 6 53.7 


* Nonsymmetric problem (two distances changed: (6, 5) = 1, and (8, 3) = 1). 


** Results obtained from 10 runs. 


to the appropriate distances d,. sufficiently large positive values, 


for instance n < max;; [dj;]. 


Symbol summary 


n number of nodes to be connected (2 <n < m). 

p vector containing m node numbers (in output, it contains 
node number sequence of conjectured shortest path). 

si Start node number (1 < sx < m; no check is provided whether 


sn is contained in p). 
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Algorithm 
SUBKOUTINE R@UTNGC(N, Ps» SNs ENs Ms De Lo Rd) 
INTEGER PCN)s DCMsM)» IDC60)s QC€60)» SNs EN» R 
N - NUMBER OF N@DES T@ BE CONNECTED 
P - NODE NUMBER VECT@R CIN OUTPUT,» OPTIMAL CONNECTION) 
SN- START N@DE NUMBER 
EN- END NODE NUMBER 
DISTANCE MATRIX Q@RDER 
D - DISTANCE MATRIX 
L - SHORTEST CONNECTION LENGTH (OUTPUT) 
R - NUMBER @F RUNS 
GET LARGE NUMBER (= N X MAX DCIsJ)) 
LARGE = 0 
D@ 20 I=1.M 
D@ 10 J2tsM 
IF (DC€IsJ)-GT-LARGE) LARGE = DCI,J) 
10 CONTINUE 
20 CONTINUE 
LARGE = LARGE*N 
DEFINE NON-EXISTING ARCS BY ASSIGNING 
THEIR DISTANCES LARGE NEGATIVE VALUES 
IF CENeNE-0) GO TO 40 
D@ 30 I=1sM 
IDCI>) = DCI» SN) 
DCIsSN) = -LARGE 
DCSNs» SN) = O 
30 CONTINUE 
40 IF (SN-EQ-EN -ORe EN-EQ-0) GO TO 5O 
IDC1)> = DCENs SN) 
DCENs SN) = -LARGE 
C RUN R TRIALS 
50 L = LARGE 
D@ 280 IRS=1tsR 
C BUILD T@UR BY SUCCESSIVE INSERTING 
C NOT-YET-INV@LVED NODES 
C INITIATE TOUR IS CONSIDERED AS 
C ARC P¢(1)> TS PC1> 
DQ 90 JS=2sN 
MININC = LARGE 
C TRACE ALL NOT-YET-INV@LVED NODES 
C T® CH@@SE THE ONE WITH MINIMUM INCREMENT 
D@ 70 J=JSsN 
JP = PCJ) 
JE = JS - 1 
C F@R EACH N@T-YET-INV@LVED N@DE TRACE ALREADY 
C BUILT-UP TOUR T@ CHO@SE THE MINIMUM INCREMENT AKC 
D@ 60 I=1s,JE 
IP = PCI) 
IP1 = PCI+1) 
IF CI«EQ-JE>) IPI = PC1) 
INC = DCIPs JP) + DCUPsIP1> - DCIPsIPIL> 
IF CINC-GE«eMININC? GO TO 60 


aAaAaAaaANRAANAN 
=z 
1 


aa 


en end node number (1 < en < m, if en = 0, start-to-any con- 


nection is assumed; en = sn is allowed, which is traveling- 
salesman problem; no check is provided whether sv is con- 
tained in p) 


m _— order of distance matrix d(m > n > 2). 

ad =m Xm matrix of distances of all node pairs (zero diagonal, 
not necessarily symmetric). 

l length of conjectured shortest path (output). 

r number of runs (trials; r < 2n). 

opt Serial run number during which optimum has been achieved. 

ft; average computational time of one run in seconds. 
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JisJ 
Ii 2! 
MINING = INC 
60 CONTINUE 
70 CONTINUE 
C STRETCH TOUR BY INSERTING THE CH@SEN NODE PC(J1) 
C BETWEEN THE N@DES PC11> AND PCI1+1) 
80 Jl = Ji- 1 
IF CJ1-EQ@-I1) GO T@ 90 
IP = PCJ1) 
PCJ1) = PCJI+1) 
PCJI+1) = IP 
G@ Te 80 
90 CONTINUE 
C CORRECT TOUR BY 3-OPT METHOD 
C VARY CONSECUTIVE CHAIN LENGTH K 
NI 3 N- 1 
IF (NeLTe3) GO TB 210 
DQ 200 K=1isN1 
ICQ@UNT = 0 
C SHIFT C@x.S5ECUTIVE CHAIN 
C THR@UGHGUT SEQUENCE @F N N@DES 
100 ICOR = 0 
D@ 190 J=1sN 
C CALCULATE CHAIN LENGTH IN FORWARD 
C AND BACKWARD DIRECTION 
Li 20 
LR = 0 
IF CKeEQ-1) G@ T® 120 
reuJ 
Ki = 1 
110 IF CIeGTeN) I = I - N 
IP = PCI) 
IPi = 1 + 1 


IF CIP1-GToN) IP1i = 1 
IP1 = PCIP1) 

L1 = Li + DCEPs IPI) 

LR = LR + DCIPIsSIP) 
Ist+i 

K1 = Ki + 1 

IF CK1-LT«K) GO T@ 110 


C F@R EACH POSITI@NED CHAIN CAS IS AND INVERTED) 
C CHECK ALL ARCS IF INSERTION IMPROVES TOUR 


120 


MININC = LARGE 
Jit=J+K-il 

IF CJEoGT*N) Jl = JI - N 
D@ 150 I=14N 


IF (JeLEeJdt «AND. CI2GE-J -AND. I-LE-J1)2) GO TO 
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* 150 
IF CJeGTeJ1 «AND. CI-LEsJI «OR- I1.GE.-J)?)? GO TO 150 
IP = PCI) 
JP = PCJ) 


JPL = PCJ1) 

IP] = 1+ 2 

IF CIP1.GTeN) IPI 
JE = IPI 


IF CIPL-EQ.J) IPL = Ji + 1 
IF CIPI-GT.N) IP} = 1 
IPi = PCIP1) 
LN = Ll 
IR = 0 
130 INC = DCIPsJP) + LN + DCJPIsIPIL) - DCIPsIPI) 
IF CEINC-GTeMININC »@R» CINC-EQ-MININC -AND. 
* CJENEsJ «OR. JE-EQeJ AND. IR-EQ-1)))? GO TO 140 
Ii eal 
IR! = IR 
MENINC = INC 
140 IF CIR-EQ-1) G@ T@ 150 
IR = 1 
LN = LR 
JS = JP 
JP = JP1 
JPi = JS 
G@ T@ 430 
150 CONTINUE 


r=) +1 
IF CI1.GT«N) I = I 
IF CL1eEQ-J «ANDe IR1-E0-0) GOB TO 190 


C REINSERT CHAIN @F LENGTH K STARTING IN J 
C BETWEEN NODES PCI1) AND PCI1t+h) 
ICOR = 1 
JS 2 J 
JE = 0 
IF CIR1-EQ@.0) GO T@ 160 
JS = Jl 
JE = -! 
160 Kl = 0 
170 Kl = KI + 2 
IF CK1-GT-K) GO TO 190 
I= JS 


JS = JS + JE 
IF (JSeLTe-1) JS =N 
180 IP =!r+ 1 
[IF CIP-GT.N) IP = 1 
JP = PCI) 
PCI) = PCIP) 
PCIP) = JP 
I=1I+t 
IF C1.-GTeN) I = 1 
IF CIP-If) 1805 170s 180 
190 CONTINUE 
IF CICGR-EQ-.9) G@ TE 200 
ICQUNT = ICQUNT + 1 
IF CICOUNT.LT-N) GE TO 100 
200 CONTINUE 
C @RIENT TEURK WITH SN IN PCL) 
210 DG 230 I=1sN 
IF (PC1L)-EQ-SN) GO TO 240 
JS = PCI) 
D® 220 J=IsNI 
PCJ) = PCU+1) 
220 CONTINUE 
PCN) = JS 
230 CONTINUE 
C CALCULATE TOUK LENGTH 
240 Lt 3 0 
DS 250 T=1sN1 


IP = rel) 
IPt = PCr+l) 
LI = Li + DOCIPs IPI? 


250 CONTINUE 
IP = PCL) 
IF (SN-EQ.EN) LI = LI + DCIPI»sIP) 

C SAVE S@LUTION»s IF SFTTFR», AND SET NEW INITIATE NODE 
IF (L1-GE.L) GO TE 270 
Lo= Lt 
DG 260 I=15N 
9CI) = PCI) 
269 CONTINUE 


e749) J = [KS + 1 
IF (JeCTeNI J=ZJI- WN 
Js = . 1) 
PCE) = PCJ) 
PC.12 = JS 
239 Covel dere 
C RESTO. © AND DGA1Y Ot TANCES 


Wi 24d T= kee 
P(t) = «TD 
290 CONTING+ 
TF Chive tee) GF 10 SIO 
OS 2). (2124 
MClsovy = Loel) 
309 Ceni lve 
SG CF COMP UCEN Une ENetUeD) GF TO 329 
DCENs SND = IDCL) 
SAI nk LUN 
ENO 
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Remark on Algorithm 456 [H] 
Routing Problem 
[Zdenék Fencl, Comm. ACM 16 (Sept. 1973), 572] 


Gerhard Tesch [Recd. 15 Oct. 1973] VFW Vereinigte 
Flugtechnische Werke GMBH, 28 Bremen 1, Hunefeld- 
strasse 1-5, Germany and Zdenék Fencl, M.L.T., De- 
partment of Urban Studies, R. 9-643, Cambridge, Mass. 


Some confusion arose from the description of the algorithm 
capability. It should have been stated that the generated tour must 
pass through each of the n nodes once and only once, although this 
is the base for the definition of the traveling salesman problem. This 
algorithm solves an extended traveling salesman problem in which 
the end node does not have to be the start node. Such connections 
may be sought in the design automation of serial printed circuits as 
well as in transportation problems. The traveling salesman problem 
is discussed in [3, p. 232] and methods of solution are surveyed in [1]. 

The users who seek the shortest paths in electric networks (the 
shortest connection between the two specified nodes in a net without 
regard to the number of nodes to be connected) are referred to 
Ford’s shortest path algorithm [2, p. 69] and Dantzig’s shortest path 
algorithm [3, p. 175]. There is a set of three efficient Algol algorithms 
by J. Boothroyd [4] handling the shortest path problem as defined 
in [2, p. 69] and (3, p. 175]. These Algol algorithms can be modified 
so that even the number of nodes may be minimized or a restriction 
of some nodes may be imposed, etc. 

Another type of shortest path algorithm is Lee’s algorithm [5 
and 6]. This algorithm is applicable for the orthogonal routing of 
printed circuit boards. 
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Algorithm 457 


Finding All Cliques of an 
Undirected Graph [H] 


Coen Bron* and Joep Kerboscht [Recd. 27 April 
1971 and 23 August 1971] 

* Department of Mathematics f+ Department of In- 
dustrial Engineering, Technological University Eind- 
hoven, P.O. Box 513, Eindhoven, The Netherlands 


Present address of C. Bron: Department of Electrical Engineering, 
Twente University of Technology, P.O. Box 217, Enschade, The 
Netherlands. 


Key Words and Phrases: cliques, maximal complete subgraphs, 
clusters, backtracking algorithm, branch and bound technique, 
recursion 

CR Categories: 3.71, 5.32 

Language: Algol 


Description 

Introduction. A maximal complete subgraph (clique) is a 
complete subgraph that is not contained in any other complete 
subgraph. 

A recent paper [1] describes a number of techniques to find 
maximal complete subgraphs of a given undirected graph. In this 
paper, we present two backtracking algorithms, using a branch- 
and-bound technique [4] to cut off branches that cannot lead to a 
clique. 

The first version is a straightforward implementation of the 
basic algorithm. It is mainly presented to illustrate the method used. 
This version generates cliques in alphabetic (lexicographic) order. 

The second version is derived from the first and generates 
cliques in a rather unpredictable order in an attempt to minimize 
the number of branches to be traversed. This version tends to pro- 
duce the larger cliques first and to generate sequentially cliques 
having a large common intersection. The detailed algorithm for 
version 2 is presented here. 

Description of the algorithm—Version I. Three sets play an 
important role in the algorithm. (1) The set compsub is the set 
to be extended by a new point or shrunk by one point on traveling 
along a branch of the backtracking tree. The points that are eligible 
to extend compsub, i.e. that are connected to all points in compsub, 
are collected recursively in the remaining two sets. (2) The set 
candidates is the set of all points that will in due time serve as an 
extension to the present configuration of compsub. (3) The set 
not is the set of all points that have at an earlier stage already 
served as an extension of the present configuration of compsub and 
are now explicitly excluded. The reason for maintaining this set 
not will soon be made clear. 

The core of the algorithm consists of a recursively defined 
extension operator that will be applied to the three sets just de- 
scribed. It has the duty to generate all extensions of the given 
configuration of compsub that it can make with the given set of 
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candidates and that do not contain any of the points in not. To 
put it differently: all extensions of compsub containing any point 
in not have already been generated. The basic mechanism now 
consists of the following five steps: 


Step 1. Selection of a candidate. 

Step 2. Adding the selected candidate to compsub. 

Step 3. Creating new sets candidates and not from the old sets by 
removing all points not connected to the selected candidate 
(to remain consistent with the definition), keeping the old sets 
in tact. 

Step 4. Calling the extension operator to operate on the sets just 
formed. 

Step 5. Upon return, removal of the selected candidate from 
compsub and its addition to the old set or. 


We will now motivate the extra labor involved in maintaining 
the sets not. A necessary condition for having created a clique is 


‘that the set candidates be empty; otherwise compsub could still be 


extended. This condition, however, is not sufficient, because if 
now zor is nonempty, we know from the definition of not that the 
present configuration of compsub has already been contained in 
another configuration and is therefore not maximal. We may now 
state that compsub is a Clique as soon as both not and candidates are 
empty. 

If at some stage not contains a point connected to all points in 
candidates, we can predict that further extensions (further selec- 
tion of candidates) will never lead to the removal (in Step 3) of that 
particular point from subsequent configurations of not and, there- 
fore, not to a clique. This is the branch and bound method which 
enables us to detect in an early stage branches of the backtracking 
tree that do not lead to successful endpoints. 

A few more remarks about the implementation of the algo- 
rithm seem in place. The set compsub behaves like a stack and can 
be maintained and updated in the form of a global array. The sets 
candidates and not are handed to the extensions operator as a 
parameter. The operator then declares a local array, in which the 
new sets are built up, that will be handed to the inner call. Both 
sets are stored in a single one-dimensional array with the following 
layout: 


|not | candidates 
index values: 1.....7€....000.cce00 Ce... 


The following properties obviously hold: 


l. ne < ce 

2. ne = ce:empty (candidates) 

3. ne = O :empty (nor) 

4. ce = 0 :empty (nor) and empty (candidates) 


= clique found 


If the selected candidate is in array position ne + 1, then the second 
part of Step 5 is implemented as ne := ne + 1. 

In version 1 we use element ne + 1 as selected candidate. This 
strategy never gives rise to internal shuffling, and thus all cliques 
are generated in-a lexicographic ordering according to the initial 
ordering of the candidates (all points) in the outer call. 

For an implementation of version 1 we refer to [3]. 

Description of the algorithn—Version 2. This version does not 
select the candidate in position ne + 1, but a well-chosen candidate 
from position, say s. In order to be able to complete Step 5 as 
simply as described above, elements s and ne + 1 will be inter- 
changed as soon as selection has taken place. This interchange 
does not affect the set candidates since there is not implicit ordering. 
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Fig. |. Random graphs show the computing time per clique (in 
ms) versus dimension of the graph (in brackets: total number of 
cliques in the test sample). 
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The selection does affect, however, the order in which the cliques 
are eventually generated. 

Now what do we mean by “well chosen”? The object we have 
in mind is to minimize the number of repetitions of Steps 1-5 in- 
side the extension operator. The repetitions terminate as soon as 
the bound condition is reached. We recall that this condition is 
formulated as: there exists a point in nof connected to all points in 
candidates. We would like the existence of such a point to come 
about at the earliest possible stage. 

Let us assume that with every point in nor is associated a 
counter, counting the number of candidates that this point is not 
connected to (#umber of disconnections). Moving a_ selected 
candidate into not (this occurs after extension) decreases by one 
all counters of the points in not to which it is disconnected and 
introduces a new counter of its own. Note that no counter is ever 
decreased. by more than one at any one instant. Whenever a counter 
goes to zero the bound condition has been reached. 

Now let us fix one particular point in not. If we keep selecting 
candidates disconnected to this fixed point, the counter of the 
fixed point will be decreased by one at every repetition. No other 
counter can go down more rapidly. If, to begin with, the fixed point 
has the lowest counter, no other counter can reach zero sooner, 
as long as the counters for points newly added to not cannot be 
smaller. We see to this requirement upon entry into the extension 
operator, where the fixed point is taken either from nor or from 
the original candidates, whichever point yields the lowest counter 
value after the first addition to not. From that moment on we only 
keep track of this one counter, decreasing it for every next selec- 
tion, since we will only select disconnected points. 

The Algol 60 implementation of this version is given below. 

Discussion of comparative tests. Augustson and Minker [1] 
have evaluated a number of clique finding techniques and report 
an algorithm by Bierstone [2] as being the most efficient one. 


1 Bierstone’s algorithm as reported in [1] contained an error. 
In our implementation the error was corrected. The error was 
independently found by Mulligan and Corneil at the University 
of Toronto, and reported in [6]. 


457-P 2- 0 


Fig. 2. Moon-Moser graphs show the computing time (in ms) ver- 
sus k. Dimension of the graph = 3k. Plotted on logarithmic scale. 
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In order to evaluate the performance of the new algorithms, 
we implemented the Bierstone algorithm! and ran the three algo- 
rithms on two rather different testcases under the Algol system 
for the EL-X8. 

For our first testcase we considered random graphs ranging 
in dimension from 10 to 50 nodes. For each dimension we gen- 
erated a collection of graphs where the percentage of edges took 
on the following values: 10, 30, 50, 70, 90, 95. The cpu time per 
clique for each dimension was averaged over such a collection. The 
results are graphically represented in Figure 1. 

The detailed figures [3] showed the Bierstone algorithm to be 
of slight advantage in the case of small graphs containing a small 
number of relatively large cliques. The most striking feature, how- 
ever, appears to be that the time/clique for version 2 is hardly 
dependent on the size of the graph. 

The difference between version | and ‘‘Bierstone’’ is not so 
striking and may be due to the particular Algol implementation. 
It should be borne in mind that the sets of nodes as they appear in 
the Bierstone algorithm were coded as one-word binary vectors, 
and that a sudden increase in processing time will take place when 
the input graph is too large for “‘one-word representation” of its 
subgraphs. 

The second testcase was suggested by the referee and consisted of 
regular graphs of dimensions 3 < k. These graphs are constructed 
as the complement of k disjoint 3-cliques. Such graphs contain 
3* cliques and are proved by Moon and Moser [5] to contain the 
largest number of cliques per node. 

In Figure 2 a logarithmic plot of computing time versus k is 
presented. We see that both version 1 and version 2 perform sig. 
nificaritly better than Bierstone’s algorithm. The processing time 
for version 1 is proportional to 4*, and for version 2 it is propor- 
tional to (3.14)* where 3* is the theoretical limit. 

Another aspect to be taken into account when comparing 
algorithms is their storage requirements. The new algorithms 
presented in this paper will need at most 4M(M-+3) storage loca. 
tions to contain arrays of (small) integers where M is the size of 
largest connected component in the input graph. In practice this 
limit will only be approached if the input graph is an almost com- 
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plete graph. The Bierstone algorithm requires a rather unpredict- 
able amount of store, dependent on the number of cliques that 
will be generated. This number may be quite large, even for mod- 
erate dimensions, as the Moon-Moser graphs show. 

Finally it should be pointed out that Bierstone’s algorithm 
does not report isolated points as cliques, whereas the new al- 
gorithm does. Either algorithm can, however, be modified to pro- 
duce results equivalent to the other. Suppression of 1-cliques in 
the new algorithm is the simplest adaption. 


Acknowledgments. The authors are indebted to H.J. Schell 
for preparation of the test programs and collection of performance 
statistics. Acknowledgments are also due to the referees for their 
valuable suggestions. 
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Algorithm 
procedure output maximal complete subgraphs 2(connected, N); 
value N; integer N; 


Boolean array connected; 
comment The input graph is expected in the form of a symmetrical 
Boolean matrix connected. N is the number of nodes in the 
graph. The values of the diagonal elements should be true; 
begin 
integer array ALL, compsub[(1 : N); 
integer c; 
procedure extend version 2(old, ne, ce); 
value ne, ce;. integer ne, ce; 
integer array old; 
begin 
integer array new|1 : ce}; 
integer nod, fixp; 
integer newne, newce, i, j, Count, pos, p, $, Sel, minnod; 
comment The latter set of integers is local in scope but need 
not be declared recursively; 
minnod := ce; i:= nod := 0; oS 
DETERMINE EACH COUNTER VALUE AND LOOK FOR 
MINIMUM: 
for i := i+ 1 while i < ce (A minnod # 0 do 


begin 
p:= oldji); count:=0; | j:= ne; 
COUNT DISCONNECTIONS: 


for / := / + 1 while j < ce A count < minnod do 
if “| connected|p, old{j\] then 
begin 
count := count + 1; 
SAVE POSITION OF POTENTIAL CANDIDATE: 
pos := j 
end; 
TEST NEW MINIMUM: 
if count < minnod then 
begin 
fixp := p; minnod := count, 
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if i < ne then s := pos 
else 
begin s := i; PREINCR: nod := | end 
end NEW MINIMUM; 
end i; 
- comment If fixed point initially chosen from candidates then 
number of disconnections will be preincreased by one; 
BACKTRACKCYCLE: 
for nod := minnod + nod step —1 until 1 do 
begin 
INTERCHANGE: 
= old[s|;  old|s| := old|ne + 1); 
sel := old{[ne + 1] := p; 
FILL NEW SET not: 
newne := i:= 0; 
fori:=i+1 while i < ne do 
if connected(|sel, old{i)| then 
begin newne := newne + 1; 
FILL NEW SET cand: 
newce := newne; =ne+1; 
for i := i + 1 while i < ce do 
if connected(|sel, old{i}] then 
begin newce := newce + 1; 
ADD TO compsub: 
c:=c+1; compsub[c] := sel; 
if newce = 0 then 
begin 
integer /oc; 
outstring(1, ‘clique = ’); 
for /oc := 1 step 1 until c do 
outinteger(1, compsub|loc}) 
end output of clique 
else 
if newne < newce then extend version 2(new, newne, newce),; 
REMOVE FROM compsub: 
c:=c-— 1; 
ADD TO not: 
ne:=net+l; 
if nod > 1 then 
begin 
SELECT A CANDIDATE DISCONNECTED TO THE FIXED 
POINT: 


new|newne| := old{[i| end; 


new[newce] := old{i] end; 


S:= ne; 
LOOK: FOR CANDIDATE: 
s:=s+1; 


if connected|fixp, old|s|| then go to LOOK 
end selection 
end BACKTRACKCYCLE 
end extend version 2; 
for c := 1 step 1 until Ndo ALL{c] := 
c:= 0; extend version 2(ALL, 0, N) 
end output maximal complete subgraphs 2; 
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Algorithm 458 


Discrete Linear Li 
Approximation by Interval 
Linear Programming [E2] 


P.D. Robers* and S.S. Robers t+ 

[Recd. 26 Feb. 1971 and 6 Oct. 1971] 

* Ernst & Ernst, 1225 Connecticut Ave., NW, Washing- 
ton, D.C. 26636. 

+ 2308 Riviera Drive, Vienna, VA 22180 


Key Words and Phrases: discrete approximation, L, 
approximation 

CR Categories: 5.13, 5.41 

Language: Fortraa 


Description 

Purpose. This subroutine finds the discrete linear Z; approxima- 
tion using the suboptimization method of interval linear program- 
ming. 

Problem. The problem is stated as: 


minimize >. | ¢; | (1) 
1 

subject to 
Fx +e=t (2) 
where the matrix F = (f:;) and the vector ¢ = (¢;) are given; the 
vectors « = (e;) and x = (x,;) are to be found (i = 1, ...,2; 7 = 1, 
sen y M). , 

Such problems arise, for instance, if a given set of data { (s; , ¢:): 
i = 1,..., nm} is to be approximated, in the sense of the Z; norm, by 


a linear combination of given functions {g;(s); 7 = 1, ..., m}. 


Work on this algorithm was done while P.D. Robers and S.S. 
Robers were employed by The Research Analysis Corporation, 
McLean, Va., and the Mitre Corporation, McLean, Va., respec- 
tively. 

The problem is then: 


n 
minimize >> | «; | 

tacl 
subject to 


m 
>» a(sdx; te = 1; »,£=1,..,4, 
1 
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which has the form of problem ((1), (2)) if we let fi; = g;(si) for 
all i and j. 

Method. The algorithm works with the dual problem of ((1), 
(2)), which may be written: 


maximize Ty 


subject to 
Fly =0, -eSySe, 
where ef = (1, 1, ..., 1). This problem could be solved by any 


linear programming algorithm. The suboptimization method of 
interval linear programming, however, is specially suited to solve 
the dual problem because of its structure. It is an iterative method 
which solves a subproblem at each stage. 

The details of applying the suboptimization method to the Li 
approximation problem are contained in {1] and will not be pre- 
sented here. A general discussion and development of the sub- 
optimization method is contained in [2]. 

Program. Subroutine APPROX is completely self-contained 
and communication to it is solely through the argument list. It can 
be used in two modes: (1) to solve a problem from scratch; and 
(2) to solve a problem using an advanced start from a previous run. 
The advanced start mode is useful if the optimal value of the objec- 
tive function is too large on a given problem (i.e. the approximation 
is too poor) and the problem is to be rerun after adding additional 
columns to the F matrix (i.e. increasing the order of the approxima- 
tion). In some applications the user may wish to construct.the calling 
program in such a way that the advanced start mode for APPROX 
is easily utilized. For example, the program might punch out in- 
formation about an optimal problem solution, on request, which 
could automatically be read in at a later time for use as an advanced 
start if the problem was resolved. The main program might also 
contain a step-wise option which provides the capability for in- 
creasing the order of approximation iteratively until either the 
program runs out of data or a desired approximation accuracy is 
reached. The Fortran listing for a general purpose calling program 
which has both of the above features is available on request from 
the authors. Entrance to the subroutine APPROX is achieved by 
using the statement 


CALL APPROX (MD, M,N, T, FT, INBASE, AINV, Y, XOPT, 
ZOPT, IER). 


The meanings of the parameters in APPROX are as follows: 

MD, the mode of operation indicator. Note that if MD = 1, 
the problem is to be solved from the beginning. If MD = 2, the 
problem is to be solved from an advanced start from a previous run. 

M, the number of columns in the F matrix Gf MD = 2, M must 
be the modified value). 

N, the number of rows in the F matrix. 

T, the right hand side vector for the problem (dimension N). 

FT, the transpose of the F matrix (dimension M X N). 

INBASE, a vector which contains indices of basic columns in 
the optimal solution to the linear program when APPROX re- 
turns control (dimension JN). 

AINV, a matrix which contains the inverse of the matrix of 
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optimal basic columns when APPROX returns control (dimension 
N X N). 

Y, a vector containing the optimal dual solution when APPROX 
returns control (dimension N). Note that no initial values are re- 
quired for INBASE, AINV, and Y when APPROX is called with 
MD = 1. However, when MD = 2, these parameters should con- 
tain the saved values that were contained in the respective positions 
when APPROX returned control on the previous run which is now 
to be used as the advanced start. 

XOPT, a vector containing the optimal x-values when 4A PPROX 
returns control. 

ZOPT, the optimal value of the objective function when 
APPROX returns control. 

JER, error indicator. Note that JER = 0 at return is normal. If 
JER = | at return, a singular matrix was generated. If JER = 2, 
APPROX exceeded the iteration limit (10 & (m + 1)). The latter 
two conditions are abnormal returns, and the contents of INBASE, 
AINV, Y, XOPT, and ZOPT are unpredictable. 

As presently dimensioned, the size limitations for 4PPROX 
are M < 15 and N < 50. The dimension statements could clearly 
be changed to accommodate larger problem or ones with different 
proportions. Core storage and running time requirements for 
APPROX are modest. Since LZ; approximation will typically be 
“‘moderate”’ in size, the authors’ experience indicates that APPROX 
should adequately solve all problems of practical interest, although 
specific tests directed at determining size limitations have not been 
performed. The ultimate size limitation will probably depend on the 
conditioning of the particular coefficient matrix, which is indeed 
an interesting area of study in itself. 

Test Results. All tests have been performed on a CDC 6400 
computer. No breakdown in the method has occurred, and in general 
very accurate results have been obtained. 

Some examples: 


(i) fF = ©.5, 1.0, 2.0, 3.0) 


(10 10 10 1.0) 
FT=|00 10 20 3.0 
(00 10 40 9.0 


The optimal solution found by APPROX in three iterations is 
x? = (0.5000000, 0.6666667, —0.1666667), 

and the minimum value of (1) is 

z* = 0.8333333. 


(ii) ¢? = (1.52, 1.025, 0.475, 0.0100, —0.475, — 1.005) 


1.0 1.0 1.0 1.0 1.0 1.0} 
0.0 1.0 2.0 3.0 4.0 5.0 


The optimal solution found by APPROX in two iterations is 
xT = (1.520000, —0.5033333), 

and the minimum value of the objective function is 

z* = 0.07333333. 


(iii) #7 = (0.0, 1.5, 4.0, 3.0, 4.5, 5.0, 3.0, 7.0, 10.0) 

(10 10 10 10 10 10 10 10 1.2) 
0.0 10 20 30 40 50 60 70 80 
0.0 1.0 40 9.0 160 25.0 360 49.0 64.0 
(0.0 1.0 8.0 27.0 64.0 125.0 216.0 343.0 512.0 


The optimal solution found by APPROX after eight iterations is 


x? = (7771561 X 10-', .3333333 X 101, 
— 8437500 , .7291667 X 1074) 


and the minimum objective function value is 
z* = 5.250000. 


The above set of three problems was solved on the CDC 6400 
using APPROX in less than four seconds of central processor time. 
This estimate is the complete running time including Fortran com- 
pilation time of a main program and APPROX. 


FT = 


FT = 
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Algorithm 


SUBROUTINE APPROX (MDsMs No Ts FTs INBASEs AINVs Ys XOPT» 
* ZOPT»IER> 


Q 


THIS SUBROUTINE S@LVES THE DISCRETE LINEAR LI! 
APPROXIMATION PROBLEM USING THE SUBOPTIMIZATIGN METHOD OF 
INTERVAL LINEAR PROGRAMMING. THE PROBLEM T@ BE SOLVED IS 


MINIMIZE Z = ABSCEC1)) + oe + ABSCECN)) 
SUBJECT TO 
FX + E=T 


WHERE F IS A GIVEN N BY M MATRIXs T IS A GIVEN N VECTOR,» 
X AND E ARE VECT@RS OF VARIABLES HAVING DIMENSION M AND N 
RESPECTIVELY. 


SUBROUTINE APPROX IS DESIGNED T@ BE USED IN TWO M@DES~ 
(1) T@ SOLVE A PROBLEM FR@M SCRATCH» AND 
(2) T® SOLVE A PROBLEM USING AN ADVANCED START FROM A 
PREVIOUS RUN- 
THE ADVANCED START M@DE IS USEFUL IF THE OPTIMAL VALUE OF 
Z IS T@@ LARGE ON A GIVEN PROBLEM (I-E+ THE APPROXIMATION 
1S T@@ P@O@R>) AND THE PROBLEM IS TO .BE RERUN AFTER ADDING 
ADDITIONAL COLUMNS T@ THE F MATRIX (I-E+ INCREASING 
THE ORDER @F THE APPROXIMATION) - 


SUBR@UTINE APPROX IS COMPLETELY SELF-CONTAINED AND 
COMMUNICATI@N IS ACHIEVED S@LELY THR@UGH THE ARGUMENT 
LIST. THE MEANING @F THE PARAMETERS ARE AS FOLLOWS- 

MD = THE MODE OF @PERATI@N INDICATOR. 

IF MD = 1» THE PROBLEM IS S@LVED FROM THE BEGINNING. 

IF MD = 2. THE PR@BLEM IS TO BE S@LVED USING AN 

ADVANCED START FROM A PREVIOUS RUN. 

M = THE NUMBER @F C@LUMNS IN THE F MATRIX CIF MD = 2, 

M MUST BE THE M@DIFIED VALUE-) 

N = THE NUMBER OF ROWS IN MATRIX F. 

T = THE RIGHT HAND SIDE VECTOR FOR THE PROBLEM. 

FT = THE TRANSP@SE OF MATRIX F. 

INBASE = A VECTOR WHICH CONTAINS INDICES @F BASIC COLUMNS 
IN THE OPTIMAL SOLUTION T@ THE LINEAR PR@GRAM WHEN 
APPROX RETURNS CONTR@L. 

AINV = A MATRIX WHICH CONTAINS THE INVERSE OF THE MATRIX 
@F BASIC COLUMNS WHEN APPROX RETURNS CONTROL. 

Y = A VECT@R CONTAINING THE OPTIMAL DUAL SOLUTION WHEN 
APPROX RETURNS CONTROL. 

X@PT = A VECTOR CONTAINING THE GPTIMAL X-VALUES WHEN 
APPROX RETURNS CONTR@L. 

ZOPT = THE OPTIMAL VALUE @F THE @BJECTIVE FUNCTION 
WHEN APPR@X RETURNS CONTR@L~ 

IER = ERROR INDICATOR WHEN APPROX RETURNS CONTROL. . IER30 
INDICATES NORMAL RETURN. 


N@ INITIAL VALUES ARE REQUIRED F@R INBASEs AINVs AND 
Y WHEN APPROX IS CALLED WITH MD = le WHEN MD # 2s AN 
ADVANCED START IS INDICATED. THESE VARIABLES MUST THEN 
CONTAIN THEIR FINAL VALUES FROM THE PREVI@US RUN- THE 
USER WILL THUS WANT TO MAKE PROVISIOUS FOR SAVING THESE 
VALUES IN THE CALLING PR@GRAM S@ THAT THEY CAN BE REUSED 
IF NEEDED. 


THE CALLING PR@GRAM AND APPR@X SHOULD C@NTAIN THE 
FOLLOWING DIMENSION STATEMENT- 
DIMENSION TCN)sFTCMsN)» INBASECN) sAINVONe ND)» YON) sXOPTCM) 


APPROX MUST ALS@ CONTAIN. THE FOLLOWING DIMENSION 
STATEMENT- 
CDIMENSI@N BPCN)sBMCN)sARCN)» ARAINVCN) » QCN) »s GAMMACN) » DELCN) 
C »TEMPCN) 
DIMENSION T(50)sFTC15s 50)» INBASEC 50) sAINVC 50 50)2¥ C50) 
*»XOPTCIS) 
DIMENSION BP(50)sBMC50) sARC50) sARAINV( 50) QC 50), 
* GAMMAC SO)» DEL( S50)» TEMP( S50) 
INTEGER ENTsQQs ADBASE>s QP 
EQUIVALENCE (GAMMAs DEL) 
EPSI IS THE SINGULAR MATRIX ERROR MESSAGE CRITERION. 
THE VALUE @F EPSI CAN BE REDUCED FOR ILL CONDITIONED 
PROBLEMS. 
EPSI = -0000001 
IF (MD-EQ-2) GO TO 70 
PROBLEM T@ BE SOLVED FROM THE BEGINNING. 
DEFINE INITIAL SUBPR@BLEM. 
ItTz=t 
ADBASE = N+1 
D@ 20 I=1sN 
BPCI> = 1-0 
BMCI> = -1-0 
ARCID = FTC1sTI) 
INBASECI) = I 
C INITIALIZE AINV AS THE IDENTITY MATRIX. 
D@ 10 J=1,N 
10 AINVCIs J) = 0-0 
20 AINVCI,ZI) = 120 
C FIND THE INITIAL Y VECTOR. 
D@ 60 I=1tsN 
IFCTCID) 302 40550 
30 YCI>D = -1-0 
G@ T@ 60 
40 YCI> = O- 


ARQAQAAAAAAAAAAAARARAAAAUARAAAAARAANQAAUAARARQAAAAARAARAAAAAARGNAAARAANRRAAA 


aaa 


aa 
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G6 Te 60 
50 YCI) = 100 
60 CONTINUE 
G@ Te 100 
CPROBLEM T@® BE SOLVED FROM AN ADVANCED START. 
70 ADBASE = M+N 
DS 90 I1,N 
IF CINBASECI) LEN) GO TA 80 
BPCI) = 020 
BMCI) = 0.0 
G8: To 90 
80 BPCI) = 120 
BMC(1I) 2-1-0 
90 ARCI) # FTCMsI) 
IT = IT+!t 
100 BRM = 0-0 
BRP = 0-0 
C BEGIN GENERAL ITERATION. 


C DETERMINE DEL (THE AMOUNT OF INFEASIBILITY IN THE BOTTOM 
‘C CONSTRAINT OF THE CURRENT SUBPROBLEM). 


110 CONTINUE 
3 = 0.0 
D@ 120 I=1.N 
120 S = S + ARCI)*YC1) 
D = S-BRP 
IF €D.GT-0-) G@ TO 130 
Dz S - BRM 
IF €DeGE-0-.) GO TB 430 
130 CONTINUE 
D@ 140 I#1lsN 
ARAINVCI) = 0.0 
D@ 140 J=leN 
140 ARAINVCT) = ARAINVC I) ARC J) *AINVCO Js TD 
CALCULATE GAMMA VECTOR (THE VECTOR OF MARGINAL COSTS FOR 
MOVING T@WARD FEASIBILITY). 
DQ 170 It1sN 
TEMPCI)D = 0-0 
D@ 150 J=1s5N 
150 TEMPCI) = TEMPCI)+TCJ)*AINVC Je I) 
IF CARAINVCL) «NE«O0.0)G0 TO 160 
GAMMACI) 2 -1.0 
G@ Ta 170 
160 GAMMACI) 2 TEMPCI)/ARAINVCI) 
IF (DeLTe0¢0) GAMMACI)=-GAMMACI) 
170 CONTINUE 
FIND @ VECTOR (THE VECTOR @F INDICES WHICH INDICATE THE 
VARIABLES WHICH CAN BE CHANGED T@ MOVE T@WARD FEASIBILITY). 
Q@=0 
De 210 LalsN 
D@ 180 I=1.N 
IF CGAMMACI)-LT-0-0) G8 T@ 180 
S = GAMMACI) 
J=I 
G@ Te 190 
180 CONTINUE 
G@ TS 215 
190 D@ 200 I=1tsN 
IF CGAMMACI)«LT.O0 «O@Re GAMMACI)-GE-S)GO TB 200 
S = GAMMACI) 
J=I 
200 CONTINUE 
Qaag=aQ+l 
QacL> = J 
210 GAMMACJ) 2 -1.0 
CALCULATE DELTA VECT@R (THE VECT@R INDICATING THE MAXIMUM 
PERMISSABLE CHANGES IN THE VARIABLES). 
215 D@ 260 121,00 
K=QC1) 
S = 0.0 
Il = INBASECK) 
IF CII-LEeN) GOB TO 230 
Lo II-N 
D@ 220 Ja1sN 
220 S = S$ + FTCL»J #¥ >) 
G@ Ta 240 
230 S 2 YCII) 
240 IF CD*ARAINVC(K) -LE-00.) G8 T@ 250 
DEL¢CK) = BMCK)-S 
G@ T@ 260 
250 DELCK) = BPCK)-S 
260 CONTINUE 
DETERMINE P CTHE NUMBER OF VARIABLES CHANGED 
THIS ITERATION). 
D@ 280 I=1,40 
Psat 
S 2 0-0 
D@ 270 Jzleol 
K = OC) 
270 S = S*DELCK)*ARAINVCK) 
IF CABS(S).GE-ABS(D)) G@ T@ 290 
280 CONTINUE 
CALCULATE THETA (THE AMOUNT WHICH THE PTH VECTOR IS 
CHANGED). 
290 L = P-1 
$70.0 
IF CL eLTe 1) GO TO 310 
D@ 300 Jeiol 


K 2 Q¢J) 
300 S = 5+DELCK) *ARAINVCK) 
310 K=QCP) 


THETA = =(D+S)/ARAINV(K) 
UPDATE ¥ VECT@R CTHE OPTIMAL SOLUTION T@ THE CURRENT 
SUBPROBLEM). 

D@ 320 I21,N 
320 TEMPCI) = 0-0 

IF CL eLTe 1) GO TO 340 

D@ 330 I=1lel 

K 2 QcI) 
330 TEMPCK) = DELCK) 
340 K = QCP) 
TEMPCK) = THETA 
D@ 360 I=12N 
S = 0-0 
D@ 350 J=isN 
350 S = StAINVC Ls, J) *TEMPCJ) 


Cc 
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360 YCID = YCT)4S 
K = @CP) 
BPCK)>) = BRP 
BMCK) = BRM 
INBASE(K) = ADBASE 
CALCULATE NEW AINV MATRIX. 
D@ 370 I=1sN 
TEMPCI) = 0-0 
D@ 370 L=isN 
370 TEMPCI) = TEMPCI) +ARCL*AINVOLS IT) 
IF CABSCTEMPCK))»GT« EPSI ) G@ T@ 380 
SINGULAR MATRIX INDICATED- SET ERROR TAG AND TERMINATE. 
IER = 1 
RETURN 
380 D@ 390 I21.N 
390 AINVCI,ZK) = AINVCIsK) /TEMPCK) 
D@ 420 J=1sN 
IFC JeEQ-K) GO TO 410 
D@ 400 I21sN 
AINVCIsJ> & AINVC Ls JI-AINVC I» K) *TEMPC J) 
400 CONTINUE 
410 CONTINUE 
420 CONTINUE 
FIND S (THE LARGEST INFEASIBILITY)» AND ENTCTHE INDEX OF 
THE CORRESPONDING CONSTRAINT) « 
430 TEMPC!) = 0. 
L = M+tN 
D@ S10 Telsel 
D@ 440 J=1sN 
IFCINBASE(J)+«EQ-1)> GOB TO 510 


440 CONTINUE. 
IFCI-LE.N) G@ T@ 470 
S=0. 

Il = I-N 
D@ 450 J=1.N 
450 Sa S$ + FICIIsJ*YCD 


IFC(S-EQ-0-) G@ TO 510 
460 IFCABS(S) «LE. TEMP(1)> GO T@ S10 
TEMP(1) = ABSCS) 
ENT = I 
G@ Te 510 
470 S = YCI) 
IF €S=te) 490+ 490+ 480 
480 5S = S-l. 
G@ TO® 460 
490 IF €S#1t-.) 500.510.2510 
500 S = S+le 
G@ T@ 460 
510 CONTINUE 
S = TEMP(1) 
IF €SeEQ-0.) GO T@ 560 
PRESENT SOLUTION INFEASIBLE. START THE NEXT ITERATION. 
IT = IT+1 
DEFINE THE NEXT SUBPROBLEM. 
IF CENT-LE-N) G@ TO 530 


BRP = 0-0 
BRM = 0.6 
L = ENT-N 


D@ 520 J=loN 
520 ARCJ) = FTCLsJ) 


G@ T@ 550 
530 BRP = 1.0 
BRM = -1.0 


DO 540 J=1sN 
$40° ARCJ) = 0-0 
ARCENT) = 1-0 
550 ADBASE = ENT 
IF CIT+sLE«10*(M+1>)> GB TA 110 
ITERATION LIMIT EXCEEDED. SET ERR@R TAG AND TERMINATE 
IER = 2 
RETURN 
@PTIMAL DUAL S@LUTI@N FOUNDe CALCULATE PRIMAL SOLUTION. 
560 D@ 600 J=1s™ 
L = J+Nn 
D@ 570 I=1tsN 
IFCINBASECI)-EQ@-L) G@ TO 580 
570 CONTINUE 
XO@PTCJ) = O« 
G@ T@ 600 
580 TEMPC(I) = O-« 
D@ 590 L=1sN 
590 TEMPCI) = TEMPCI)+TCL*AINVCL,S TD 
XOPTCJ) = TEMPCI) 
600 CONTINUE 
ZOPT = O- 
D@ 610 I=1lsN 
610 ZOPT = ZOPT + YCI)*TCID 
RETURN 
END 


0 
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Algorithm 459 


The Elementary Circuits 
of a Graph [H] 


Maciej M. Syslo [Recd. 30 Apr. 1971 and 15 Aug. 1972] 
Department of Numerical Methods, University of 
Wroclaw, Wroclaw, pl. Grunwaldzki 2/4, Poland 


Key Words and Phrases: algorithm, graph theory, circuit 
search algorithm, path search algorithm, searching 

CR Categories: 3.74, 4.22, 5.32 

Language: Algol 


Description 

This algorithm investigates the existence of elementary circuits 
of a directed graph G. 

Data: n is the number of vertices; arc(i,j) is the Boolean pro- 
cedure with two parameters i, / of type integer, which is equal to 
true if (i, /) € G, and false otherwise. 

Results: 

(a) If the graph has no circuits, then the following sequence of 
symbols will be printed: 

Graph without elementary circuits. 

Ordered numeration of vertices i iy is --* in 

where (i, i2,..., én) is the permutation of numbers (1, 2, ..., 

n), and a new numeration of vertices such that if (j, i) € G, 

then j< i. 

(b) In the other case the following sequence of symbols will be 
printed: 

Graph contains the circuits: 

Circuit i te i 

Circuit jj oe) Ah 


Every elementary circuit will be printed once and only once. 

Method. This Algol program is based on the well-known 
method used while searching for cycles (circuits) in oriented graphs 
({1, 2]). However, before the beginning of this method, vertices 
which do not belong to any circuits are labeled (s{i] = 12). The 
process uses only two arrays: nodes{1:n], which contains either the 
ordered numeration of vertices or the vertices of the elementary 
path of the move; and s{1:7], the ith element of which denotes the 
investigation phase of vertex /. 

If the incidence matrix is stored one bit per entry, the process 
needs n[n/w] ++ 2n machine words, where w is the number of bits 
in a machine word. 

The program has been run on the ODRA-1204 computer and 
numerous examples were tested, including complete graphs. 


References 

1. Tiernan, J.C. An efficient search algorithm to find the elemen- 
tary circuits of a graph. Comm. ACM 13 (Dec. 1970), 722-726. 

2. Vantrusov, Ju.I. About the Analysis of Finite Graph, in Mathe- 
matical Programming (in Russian). Moscow, 1966, pp. 68-77. 
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Algorithm 

begin 
integer 1; 
ininteger (2, 1); 
begin 


integer array s, nodes [1: #1); 

integer i, j,k, k1, k2, k3, k4, m1, n2, sj; 

Boolean /; 

comment The body of procedure arc and all other declara- 
tions connected with it should be inserted here; 

al:= —-n— |; n2:= —n — 2; 


J := true; 


for i := 1 step | until 1 do s{i] := 0; 
fork := 1,k + 1 whilek <2 A fdo 


begin 
for i := 1 step 1 until 1 do 
if s[i] = O then 
begin 


for j := | step 1 until » do 
if s{j] = O then 
begin 

if arc(j, i) then go to nexti; 
end s[j] = 0, /; 
nodes{k] := i; s{i] := n2; 
go to nextk; 


next: 


end s|[i] = 0, i; 
J := false; 


nextk: 


end k; 

if f then 

begin 
outstring (1, ‘Graph without elementary circuits.’); 
outstring (1, ‘Ordered numeration of vertices’); 
outarray (1, nodes) 


end f 
else 
begin 


rep: 


for j := 1 step 1 until 1 do 
if s{j] = O then 
begin 
for i := 1 step 1 until n do 
if s[i] = O then 
begin 
if arc(j, i) then go to nextj 
end s[i] = 0, i; 
s{j] := n2; 
go to rep; 


next): 


scan: 


end s[j] = 0, /; 
outstring(\, ‘Graph contains the circuits:’); 
k2 := 1; 


for k3 := s[k2] while (A3 = n2 VV k3 = nl) A k2 <ndo 
begin 
if k3 = m1 then s[k2] := 12; 
k2:=k2+1 
end k3; 
for k := k2 + 1 step 1 until » do 
if s[k] = nl then s{k] := 72; 


COLLECTED ALGORITHMS (cont.) 


if k3 = 0 then 
begin 
i:= 1; kl := nodes({1) := k2; 
cd: 
i:=i+1; 
cdl: 
for j := abs(s[k2]) + 1 step | until 1 do 
begin 
si = sll; 
if sj + n2 then 
‘begin 
if arc(j,k2) \ (k3 = OV sj = nl V sj > O) then 
begin 
s[k2] := if k3 = 0 then / else —/; 
if sj = n1 then 
begin 
if k3 = O then k3 := k2; 
s[j] := 0 
end sj = ni; 
if s[/] > O then 
begin 
outstring(1, ‘ Circuit’); 
k4:=O0;k := i; 
outinteger(\, j); 
for k := k — 1 while 44 + j do 
begin 
k4 := nodes{|k]; 
outinteger(1, k4) 
end k; 
g0 to cdl 
end s[j] > 0 
else 
begin 
k2 := nodes|i| := 7; 
go to cd 
end sj < 0 
end arc(j, k2)--- 
end sj ~ 2 
end /; 
s|k2| := nl; 
if k2 + k1 then 
begin 
f:=i-t; 
if k3 = i — 1 then k3 := 0; 
k2 := nodes[i — 1]; 
go to cal 
end k2 # k1, 
go to scan 
end k3 = 0 
end — f 
end 
end 


459-P 2- Rl 


Remark on Algorithm 459 [H] 
The Elementary Circuits of a Graph [Maciej M. Syslo, 
Com. ACM 16 (Oct. 1973), 632-633] 


Maciej M. Syslo (Recd 11 Feb. 1974) Department of 
Numerical Methods, University of Wroclaw, pl. Grun- 
waldzki 2/4, 50384 Wroclaw, Poland 


Corrections are needed in the algorithm: 
(i) Insert: 


k3 += s[k2); 


after the statement end k3; 
(ii) The 9th line from the end of the algorithm 


if k3 = i — 1 then k3 := 0; 
and insert the line 

if k3 = k2 then k3 := 0; 
before the statement go to cdl. 
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Algorithm 460 


Calculation of Optimum 
Parameters for Alternating © 
Direction Implicit Procedures 
[D3] 


Paul E. Saylor ee 
Department of Computer Science, University of Illinois, 
Urbana, IL 61801 

and 

James D. Sebastian 

Boeing Computer Services, Seattle, WA 98124 

[Recd. 26 May 1971 and 12 Nov. 1971] 


Key Words and Phrases: elliptic difference equations, Peaceman- 
Rachford, Douglas-Rachford, W. B. Jordan, optimum parameters, 
alternating-direction-implicit, ADI 
CR Categories: 5.14, 5.17 
Language: Fortran 


Description 

Purpose. Let Gz = s be a system of simultaneous equations, 
where G is a positive-definite matrix, s isa known vector, and z an 
unknown vector. Such systems arise, for example, as the result of 
the discretization of an elliptic boundary value problem. Beginning 
with an initial approximation z), one version of the Alternating 
Direction Implicit (ADJ) method [2] determines successive ap- 
proximations to the true solution, z, from two iterative formulas, 


Zkeq1/2 = (H + weyl)s _ (H + wppl)— (VV — we H1)Z% 
and 
Zen = V+ wevl)ts — V + wry) (A — wrvl)Zesr , 


k = 0,1, ...,m — 1, where A and V are symmetric matrices such 
that G = H+ V, Jis the identity matrix, wx and wxy are param- 
eters chosen to accelerate convergence, and m is the number of 
iterations. When H and V commute, i.e. HV = VH, the parameters 
that yield fastest convergence for fixed m, the optimum parameters, 
are the solution to a min-max problem that has been completely 
solved by W.B. Jordan using techniques of elliptic function theory 
[1, App. and 2]. 

An algorithm for computing optimum parameters based on 
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Jordan’s solution does not appear to be generally available, and it 
is our aim to provide one here. 

Method. The formulas used in the subroutines are taken from 
the solution of Jordan as presented in [2]. We refer to [2] for their 
derivation, and observe here only that, given either m or u,, , but not 
both, where uw», is the spectral norm of the m-step error propagation 
matrix 

m—1 


Ty, = I] V + weyl) (A — weyl)(H + weal tV — wenl), 
=9 


the subroutine computes the parameters w; y and w; 7 that minimize 
the value of m or u,, , Which is not given, while satisfying the in- 
equality 


I Z— Zn lle — Kull Z2— 2 lle : (1) 


This makes the Jordan algorithm more flexible than alternative 
methods of computing parameters, due to Peaceman and Rachford 
[4] and Douglas and Rachford [5]. These methods compute an in- 
teger m and a satisfactory but not optimal sequence of m param- 
eters such that (1) is satisfied, given yz, . Unlike the Jordan algorithm, 
it is impractical to specify the number m of iterations then 
compute a sequence of m parameters and an estimate of u, . 

For m = 2‘, where k is a nonnegative integer, another algorithm 
for computing an optimum sequence of m parameters and an 
estimate of 4», is due to Wachspress [2]. Again, the greater flex- 
ibility of the Jordan algorithm is apparent. However, it employs 
truncated infinite series, whereas, for m = 24, the Wachspress 
algorithm only requires the approximation of square roots. 

Program. The number of iterations, m, and the spectral radius, 
Mn, are represented in the argument list by JT7TNS and DMU re- 
spectively. Iteration parameters w,4 and w,y are the kth entries of 
the arrays OMEH and OMEV respectively. The dimension of each 
array is the value of NV. The program parameter /O PT, specified on 
entry, determines one of two options: 

(i) If JOPT has the value 1, then J7NS must be specified on entry. 
Optimum parameters OMEH(1), OMEV(1), ..., OMEH(ITNS), 
OMEV(ITNS) are computed together with the value of DMU. 
(ii) If JOPT has the value 2, then DMU must be specified on entry. 
A value of ITNS is then computed with optimum parameters 
OMEH(1), OMEV (1), ..., OMEH(ITNS), OMEV(UITNS) such 
that J7NS is the minimum number of iterations for which y,, is less 
than or equal to the value of DMU. 

In option (ii), if 7NS.GT.N is satisfied, then /7NS is set equal 
o N, corresponding optimum parameters are computed, and the 
error flag JER is set to 2. Other possible values of JER are 0 and 1. 
These indicate that computation was normal or that some input 
parameter was improper. 

Estimates of the minimum and maximum eigenvalues of H are 
assigned on entry to parameters 4 and B respectively. Estimates 
of the minimum and maximum eigenvalues of V are assigned to C 
and D. Gerschgorin’s theorem yields satisfactory estimates of B 
and D, whereas estimates of A and C may be computed from an 
algorithm suggested by Wachspress [3]. 


COLLECTED ALGORITHMS (cont.) 


Machine dependent constants. The constants —90, — 10, 10, and 
30 in the three JF statements following the last comment card in 
the program are machine dependent. At the point where this com- 
ment occurs, DMU is to be computed from the formula 


DMU = (2.D0*«DEXP(TEMP)/ 
(1.D0 + 2,D0*DEX P(TEM P)**4)) «2 


but for greater efficiency and to avoid underflow, overflow, or 
argument out of range conditions on the IBM 360, the formula 
actually used to compute DMU is chosen according to the value of 
TEMP. These constants are used as follows: If TEMP < —90 or 
TEMP > 30, then DMU < 10*+*—77, as may be verified from the 
above formula, and the program simply sets DMU = 0. Let dfl(X) 
denote the IBM 360 Fortran internal double precision floating 
point representation of X. It is easily verified that if TEMP < —10 
then 


dfl(1.DO + 2.D0*«DEXP(TEMP)«*4) = dfl(1.D0), 
and if TEMP > 10, then . 
dfi(2.D0 + DEXP(TEMP)*«—4) = djl(2.D0). 


Thus DMU is computed to full machine precision from 
DMU = 4.D0«DEX P(2.D0*TEMP) when TEMP < —10, and from 
DMU = DEXP(—6.D0*TEMP) when TEMP > 10. Finally, 
DMUuU is computed from the formula given at the beginning of 
this section when —10 < TEMP < 10. 

Tests. The program has been tested on the 360/75 by applying 
the parameters to the solution by ADI of Gz = s, withz = s = 0. 
In each test, G is a 900 by 900 matrix obtained from discretizing 
ad*/dx? + Bd?/dy*, a and B constants. Therefore, G = aH + BY, 
where H and V are discrete analogs of 9?/dx? and 9?/dy? respectively. 
The initial vector, zo , was chosen to have a nonzero component in 
the direction of each of the eigenvectors of H or V. 

To test option (i), two pairs of values of a and 8 were used. For 
a = 4, B = 2, called the model problem /7NS was assigned the 
values [TNS = 1, 2, ..., 20. For a = 4,8 = 200, called the gen- 
eralized model problem, and considered a more difficult problem 
for ADI, ITNS was assigned ITNS = 15, ..., 20. In each case 
Z1, «., Z7¢ng Were computed and the validity of E.LT.DMU was 
tested where E is the /, relative error defined by 


E = |[z — Zypys |le/l|z — 20 lla. (2) 


With a = 3,8 = 2, the comparison E.LT.DMU was satisfied for 
ITNS = 1, ..., 29, whereas for ITNS = 30, it failed. Performance 
of the program may nevertheless be considered satisfactory since 
for ITNS = 30,DMU was less than .9 D—17, a value beyond 
practical interest and sufficiently small that one may expect to ob- 
serve roundoff. In the second case for a = 3,8 = 200, E.LT.DMU 
was satisfied for JTNS = 15, ..., 18, whereas for 19 and 20 the 
comparison fails. For each failure, DMU was less than .2 D—25. 

To test option (ii), parameter DMU was assigned the values 
DMU = 107‘ fori = —1, —3,..., —15, then J7NS and the optimum 
parameters for TNS iterations were computed and the validity of 
E.LT.DMU checked. For each value of ITNS, E.LT.DMU was 
satisfied for both problems. 

Observe that tests of this kind are not in fact objective and do 
not test whether the iteration parameters are optimal; they verify 
that values of DMU or ITNS, depending on the option, are con- 
sistent with the results obtained by solving actual problems with 
the computed iteration parameters. To evaluate the accuracy of 
the program more objectively, we compared AD/P as follows to a 
FORTRAN version of the Wachspress algorithm for computing 
exact parameters when the number of iterations is a power of 2. 
Values of the optimum parameters and the spectral radius of the 
iteration matrix were computed from each program for 2, 4, 8, 16, 
and 32 iterations, with other input data taken from the model 
problem and generalized model problem described above. In addi- 
tion each set of optimum parameters was applied to the solution of 
the model problem and generalized model problem. 

Comparisons between the output of each program were made by. 
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computing the relative difference of the spectral radii and each pair 
of optimal parameters. (The relative difference of a; and ay is 
defined to be | a; — ay|/a; where a; is computed from AD/JP and 
aw is computed from the Wachspress algorithm.) 

For the model problem the relative difference of each quantity 
was bounded by 10- for 2, 4, 8, and 16 iterations. For each number 
of iterations, the /, relative errors (2) of each pair of computed 
solutions were in agreement to five significant digits. In each case 
the relative error of the computed solution as computed from ADIP 
parameters was larger (in the sixth decimal place or higher) than 
the relative error computed from the Wachspress exact parameter 
program. This confirms the expectation that the Wachspress al- 
gorithm is more accurate, although the difference is slight, since this 
is a comparison of relative error. 

For 32 iterations in the solution of the model problem, the dif- 
ferences between the two algorithms were somewhat greater, but 
with the performance of AD/P superior. The /, relative error in the 
solutions as computed by ADIP and the Wachspress program were 
respectively .71 D—18 and .75 D—18. The difference in these 
values is not significant. For, 32 is an unrealistic number of itera- 
tions. Also, any difference in relative error does not imply the same 
difference in accuracy of the computed solutions. Here, each ap- 
proximation agrees with the exact solution to 18 significant figures 
in the /, sense. Rather than this, the significant feature of the com- 
parison is that ADIJP is more reliable when input parameters are 
nontypical. This is also evident in testing with the generalized model 
problem. 

In runs of the generalized model problem for 2, 4, and 8 itera- 
tions, the differences between corresponding /, relative errors of the 
approximate solution were greater than for the model problem 
but still insignificant. For 2 and 4 iterations, the /, relative error 
obtained by AD/P parameters was smaller than that obtained by 
parameters from the Wachspress exact parameter program. Param- 
eters from the Wachspress program yielded more accurate results 
only for 8 iterations. For 16 and 32 iterations, the response of the 
Wachspress exact parameter program was bizarre. Certain param- 
eter values returned were negative whereas exact parameters are 
positive. In each case, the spectral radius was assigned the value 
zero. Of course the conditions of the runs are extreme. They represent 
an attempt to reduce the /; relative error to unrealistically small 
values. The results again indicate that 4D/P performs more reliably 
under adverse conditions. In fact, ADIP is self-consistent for 16 
iterations in reducing the relative error to less than the computed 
value of the spectral radius, although for 32 iterations, the self 
consistency test fails. 

In conclusion, these tests indicate that the Wachspress exact 
parameter program yields more accurate values under ideal condi- 
tions, but that the difference is of no practical significance. When 
the requirements of the problem are severe or fanciful, ADIP is 
more reliable than the Wachspress exact parameter program. 
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Cc 


Cc. 


c 


c 


SUBROUTINE ADIPCAs Bs Cs Ds I@PT» Ns ITNSs DMUs OMEHs 
* @MEVs, IER) 


D@UBLE PRECISION As ALFAs Bs BETAs BMDs BPDs C» CMAs CPAs 


* Ds DEL» DEXPs DKPRs DLOGs DMs DMUs, DRJs DSQRTs BJ» 
* BMEHCN)s OMEVCN)s PISQs TEMP» TEMPAs TEMPBs TEMPC 
DATA PISQ/9 «869 604401089359D0/ 
GIVEN A MATRIX EQUATIGN GZ=Ss WHERE G IS A REAL POSITIVE 
DEFINITE MATRIX» S IS A KNOWN» AND Z THE UNKNOWN» VECTOR. 
LET H AND V BE SYMMETRIC COMMUTING MATRICES SUCH THAT 
G=H+V.- BEGINNING WITH AN INITIAL APPROXIMATION 2(0)» LET 
SUCCESSIVE APPROXIMATIONS TO Z BE GENERATED FROM 
ZCK+E/2)= CH+OMEHCK #L RC = 1 CS = CV OMEHCK) ID *ZCK) D 
ZCK+1) = CVFOMEVCK) KID ¥RC R12 CS- CH-OMEVOK) €ID*ZCK4+1/2) 25 
WHERE I IS THE IDENTITY MATRIX. FINALLYs LET ONE OF ITNS 
AND DMU BE GIVEN. THEN THIS SUBROUTINE COMPUTES THE PAR- 
AMETERS QMEHCK)s OMEVCK) THAT MINIMIZE THE VALUE OF 
DMU AND ITNS WHICH IS N@T GIVEN WHILE SATISFYING THE 
INEQUALITY /2-ZCITNS)/+LE+DMU#/Z-Z(0)7s WHERE 7/7 DENOTES 
THE EUCLIDEAN NORM. 
THE SUBROUTINE ARGUMENTS HAVE THE FOLLOWING MEANING. 
A AND B ARE LOWER AND UPPER BOUNDS, RESPECTIVELYs @N THE 
EIGENVALUES @F He C AND D ARE LOWER AND UPPER BOUNDS» 
RESPECTIVELYs @N THE EIGENVALUES OF V» THE VALUES OF AsBs 
C» AND D MUST SATISFY THE INEQUALITIES 0-LT»-AsLEsBs AND 
OLTCoLEsDe 
I@PT DENOTES THE INPUT OPTION. IF I@PT=1 THEN THE VALUE OF 
ITNS MUST BE SPECIFIED ON ENTRY AND DMU WILL BE COMPUTED. 
IF I@PT=2 THEN THE VALUE @F DMU MUST BE SPECIFIED @N ENTRY 
AND ITNS WILL BE COMPUTED. 
IF I@PT=1 THEN THE INEQUALITY 1-+LE-ITNS-LE-N MUST BE 
SATISFIED, WHILE IF I@PT=2 THEN THE INEQUALITIES N-GEo1 
AND DMU.GT-O MUST BE SATISFIED. 
N IS THE DIMENSION OF THE ARRAYS OMEV AND @MEH. 
ITNS {S THE NUMBER OF ITERATIONS TO BE PERFORMED. 
DMU I5 A BOUND ON THE SPECTRAL NORM OF THE ITERATION 
MATRIX TO THE ITNS PQ@WER. IF 18PT=2 A VALUE F@R DMU MUST 
BE SPECIFIED @N ENTRY» AND THIS VALUE MAY BE CHANGED BY 
ADIP (SEE IER» BELOW). 
THE VALUES @F THE REQUIRED PARAMETERS ARE CONTAINED IN THE 
LOCATIONS OMEVCK)s OMEHCK)s K=lseeesTTNS ON EXIT FROM 
ADIP. 
IER I5 A VARIABLE WHOSE VALUE @N EXIT FROM ADIP HAS THE 
FOLLOWING MEANING 
TER=0 SIGNIFIES COMPUTATION @F THE PARAMETERS HAS BEEN 
PERF@RMED WITH N@ CHANGE @F THE VALUES SPECIFIED ON ENTRY-+ 
LER=1 SIGNIFIES THAT SOME INPUT VALUE VI@LATES THE 
CONSTRAINTS GIVEN AB@VEs AND HENCE THE PARAMETERS HAVE NOT 
BEEN COMPUTED. :* 
IER=2 (POSSIBLE ONLY IF I@PT=2) SIGNIFIES THAT FOR THE 
GIVEN VALUE @F DMU» THE C@MPUTED VALUE @F ITNS WOULD BE 
GREATER. THAN Ns S@ THAT ITNS HAS BEEN SET EQUAL TO N AND 
DMU HAS BEEN REC@MPUTED AS FOR I@PT=1. 
TEST INPUT VALUES FOR RANGE ~ 
IER = 1 : 
IF € «N@Te CAsGT.0.D0 «AND. A-LE-.B «AND. C-GT-0-DO -AND- 
* CeLE-D)) GO TO 90 
IF € .N@T. CI@PT.EQ-1 «OR» IOPT»EQ.2)) GO TS 90 
IF CI@PT-EQ.2) GO TO 10 
IF ¢ sN@T.s CITNS-GEol eANDs LTNSeLEeNI) GO TO 9O 


10 IF € sN@Te (NeGE.1 «AND. DMU.GT.0-D0)) GO TO 90 
PRELIMINARY COMPUTATIONS COMMON TO BOTH OPTIONS 


ise) 
<4 
w=] 
ununnas 
aaquwo 
Ptr 


>rPoUT 


DM = 2-DO*((D-C)*(€B-A))/¢(BPD*CPA) 
DKPR = 14D0/C€1-D0+DM+DSQRTCDM*¢ DM+2-D0))) 
DEL = 0.DO 
IF (BMD.EQ-0-D0 »-AND. CMA+«EQ-0-D0) G® TO 30 
TEMP = BPD*DKPR | 
DEL = 26D0*( TEMP-CPA) /(CPA*BMD+ TEMP*CMA) 
30 ALFA = DKPR*(CMA+2.DO*DEL*A*C) /CPA 
BETA = (€2-D0+DEL*BMD> /BPD 
TEMP = DKPR/4.D0 
END @F STAGE 1 ~ COMPUTE ITNS FOR OPTION 2 
IF CIOPT-EO.1) G@ T@ 40 
ITNS = (€DL@GCDMU/4.D0)*DLOGCTEMP))/PISQ + 1-D0 
IF CIINS-LE«N) GO TO 40 
ITNS = N 
IER = 2 
STAGE 2 - COMPUTATION OF THE OPTIMAL PARAMETERS 
40 TEMPA = 2#*ITNS 
TEMPB = TEMP*TEMP 
D@ SO Jats I TNS 
DRJ = 2*J - 1 
DRJ = DRJ/TEMPA 
TEMPC = TEMP**DRJ 
OJ = 2-DO0*¢( TEMPC+TEMPB/TEMPC) /( 1 ¢DO0+TEMPCXTEMPC ) 
TEMPC = DEL*@J 
OMEV(J) = (OJ-ALFA)/(BETA-TEMPC) 
Q@MEHCJ) = CQOJ+ALFA)/(BETA+TEMPC) 
50 CONTINUE 
IF CIOPT-EQ-2 eANDe IER«EQ-0) GO TO 90 
END @F STAGE 2 ~ C@MPUTE DMU FOR OPTI@ON 1 
TEMPA = ITNS 
TEMP = PISQ*TEMPA/DL@GC TEMPB*( 1 -DO+8 «DO*TEMPB) > 
CHOOSE PROPER FORMULA TO AVOID UNDERFLOW OR OVERFLOW 
IF (CTEMP+LE+«-90-D0 «OR- TEMP-GE+30+D0) GB TO 60 
IF (TEMPeLEe-10-D0) G8 TS 70 
IF CTEMP-LT.10-D0) GO TO 80 
DMU = DEXP(-6¢DO* TEMP) 
G@ T@ 90 ~ j 
60 DMU = 0-DO 
G@ To 90 
70 DMU = 4+¢DO*DEXP(2+DO*TEMP > 
G@ TB 90 
80 TEMP = DEXPCTEMP) 
DMU = €C€2-eDO*¥TEMP) /€1-DO04+26DO*TEMP** 4) )*k2 
90 RETURN 
END 
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Cubic Spline Solutions to a 
Class of Functional 
Differential Equations [D2] 


F.J. Burkowski and W.D. Hoskins [Recd. 3 June 1971 
and 27 Apr. 1972] 

Department of Computer Science, University of Mani- 
toba, Winnepeg, 19, Manitoba, Canada 


Key Words and Phrases: differential equation, spline 
approximation 

CR Categories: 5.17 

Language: Fortran 


Description 

Purpose. The subroutine SPNBVP calculates a piecewise con- 
tinuous approximation to the solution of the boundary value 
problem 


X(t) = PX) + QAMX(G)) + RW) (1) 


on the interval |A, B]. The existence of such a solution has been 
demonstrated by Grimm and Schmitt [5], and it should be noted 
that the boundary values take the form of two continuous functions 
U(t) and V(r) specified on the two intervals [a, A] and [B, Q] re- 
spectively where 


and Q = max 
te( A, B] 


a= min 
te( A,B] 


{G(r), A} {G(t), B}. 
Boundary value problems of this type can arise in the study of 
variational problems in control theory where the problem is com- 
plicated by the effect of time delays in signal transmission. For 
example, one may wish to determine extrema of the functional 


Ja F(t, x(t), x(g(t))) de 
under the conditions 
x) = ~(),t < a, x(d) = B. 


Under suitable hypotheses on F, this problem leads to a boundary 
value problem of the above type. Such problems have been treated 
by El’sgol’ts [3]. Other related works are the survey papers [6, 7, 8, 
9]. 

Method. SPNBVP utilizes an iterative scheme where each 
iterate is a cubic spline {4, p. 1] serving as an approximation to the 
true solution. Burkowski and Cowan [2] have demonstrated that 
such an iterative procedure will converge to an approximation of 
the solution if the condition 


max {| P(t) | + 8) 1 QM |} < 8/((B — A)? + 6H?) 
ASI<B 


is satisfied where A is defined below and 
g(t) = 1 if G(r) €[A, Bl, 
O if G(t)¢[A, B]. 
The interval (A, B] is partitioned into N equal subintervals of 
length H = (B — A)/N. That is we have a sequence of “‘knots”’ 


il 


ii 


A=W<th<st+ Sty =B 
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such that t; — t;.1 = H forj = 1, 2,..., N. For our purpose the 
equation of the cubic polynomial in the interval [7;1, 7;] may be 
written as 


S() = xG-a(Uy — 1/6A) + 2," — 1)-1)8/6H) 
+ (xj-1 — (H?/6)x5-) (4; — )/A) 
+ (x; — (H?/6)x,") (it — t5-1)/F) (2) 
where x; = X(t;) and x;” = X"(t,). 
In order to ensure that the spline has the necessary continuity 


conditions at the knots, the x; and x;” values are subject to the 
following “continuity equations” 


Xjat — 2x; + xy = (A2/O) [Xj + 4x;” + x5) (3) 
valid for j = 1, 2, 3,..., MN — 1. Using the central difference opera- 


tor 6 this can be rewritten as 
dex; = (8/6) + 1)x;” 7 =1,2,3,..,N~] (4) 


In [2], it is also demonstrated that the accuracy of the spline ap- 
proximation is proportional to H?. 

The difficulty in constructing solutions to such equations as (1) 
arises in having to evaluate terms such as X(G(r;)) in order to 
calculate the value of X”(r) at a point 7, . By using splines a con- 
tinuous rather than discrete approximation to the solution is 
generated and hence a value for X(G(r;)) can be determined even if 
G(t;) does not correspond to a value s; for some j. Since a cubic 
spline is used, the method is superior to any algorithm which 
simply evaluates X(G(/;)) by using a linear interpolation. 

The basic strategy used in SPNBV P is to calculate a sequence 
of successive splines or essentially a sequence of vectors each con- 
taining the values x;, i = 1, 2,..., MW — 1. Once a set of x; values 
is calculated, we may use the continuity equations and boundary 
values to evaluate the x,” values and hence determine the cor- 
responding spline. 

The x;, i = 1, 2,..., MN — 1 are treated as unknowns in the 
system of equations 


Bx; = H?((6/6) + 1) P(j)xy + Q0j)X(GG)) + RW)} 
ek eee ee | 


derived from (1) and (4). The solution of (5) is obtained by setting 
up the matrix equation 


(MAT)(X) = (VM) (6) 


where the vector (X) contains the unknowns x; ,/ = 1,2,---°,N—- 
l and the matrix (MAT) contains the coefficients of the x; un- 
knowns. The vector (VM) contains values arising from the func- 
tion R(t) and also other quantities discussed below. In the calcula- 
tion of a spline, the iterative character of the algorithm arises from 
the fact that the values X(G(r;)) are calculated from the previous 
spline or from the current spline depending upon the nature of 
G(t;). More precisely, if for a certain r; we have G(r;)¢[A, B], then 


X(GQj)) = UGG))) if G@;) S A 
ViG;)) if G(t;) 2 B. 


Since the value of this term is independent of any x; , an appro- 
priate entry is made in the vector (VM). If G(t;) = #, for some 7; , 
then X(G(r;)) = x. , and in this case (MAT) is accordingly modi- 
fied. If neither of these last two conditions prevails, we set 
X(G(t;)) = S(G(t;)) in eq. (2), and hence X(G(r;)) is expressed in 
terms of two unknowns x; and x;_1 (for some k) and also in terms 
of x,” and x/z-1, two values which are taken from the previous 
spline. Thus we use only the x;” values of any spline when we 
calculate the next successive spline. To start the iteration we assume 
an initial spline with x;” = 0, 7 = 0, 1, 2,..., N. 


(5) 
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Program Call. Parameters in the call statement for SPNBVP 
include the following: 

A, B are the endpoints of the inverval under consideration. 

NP is the number of knots in [A, B], and hence NP = N + 1. 

NK is the number of interior knots in [A, B], and so NK = 
N—1. 

X will contain the values of x;, / = 1, 2, 3,...,. M — 1 on 
return to the calling program. 

XD P will contain the values (H?/6)x;” , 7 = 0,1,2,...N. 

EP SPNBVP returns to the calling program when convergence 
has progressed so far that 
N-1 N-1 
D> Ix el < EPX | x. 
i-1 i=1 


Thus, if EP is set to the value 10-“"+)), convergence of the iteration 
to the approximation has been attained if the x; have m persistent 
figures in successive iterates. Hence this may be considered as a 
machine dependent constant. The term x, denotes the value of 
x, In the previous spline. 

The remaining eight variable names have been included in the 
parameter list in order to achieve execution-time dimensioning of 
arrays. The user need only concern himself with the dimension and 
type of each of these arrays as explained in the comment cards. 

SPNBVP requires six function subprograms defining the func- 
tions U(r), V(t), P(), Q(O, R(t), and G() as defined above. Four 
other subroutines are required. GAGB is used when x» and xy are 
calculated. These quantities require rather special treatment since 
the continuity equations apply only to the internal knots 7; , / = 
1, 2,..., N — 1. SOLVE is simply a special routine which when 
given x; values quickly calculates x;”, 7 = 1, 2,...,N — 1 by using 
the continuity equations. Finally, the user is responsible for the 
provision of routines which compute the solution of the matrix 
system (6). In this case the routine LUDCMP replaces (MAT) by 
its LU decomposition. The routine LUSUB uses this new matrix 
and the vector (VM) to compute the next iterate (XY). The descrip- 
tion and analysis of such routines are given in [1, pp. 93-110]. 
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SUBROUTINE SPNBVPCAs Bs NP» NK» Xs XDPs EPs GTs KGs VPs 
* VQ», VR» VG» MATs VM) 
THIS ALGORITHM COMPUTES ITERATIVELY A CUBIC SPLINE 
APPROXIMATION TO THE SOLUTION OF THE DIFFERENTIAL EQUATI®@N 
M#XCTISPCTIXCTI +00 TIXCGCTII+RCT) ON THE INTERVAL CAsB> 
WITH B@UNDARY CONDITIONS GIVEN BY UCT) IF T-LE-A AND 
VCT> IF T.GE.Be 


aagagaa 


NQMAQaANNAaANAANAANAANANNAANAAAANAAaAAAAAAANANAAAMAAAANAAAANQ 


Cc 


C ADD INTO (MAT) 
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A AND B ARE TWO REAL VARIABLES DEFINED AS ABOVE. 

NP AN. INTEGER VARIABLE SPECIFYING THE NUMBER @F KNOTS 
ON THE INTERVAL (AsB)- 

NK AN INTEGER VARIABLE SPECIFYING THE NUMBER @F INTERIOR 
KN@TS- THUS NK=NP-2- IT IS USED T®@ ESTABLISH. THE 
DIMENSION OF CERTAIN ARRAYS MENTI@NED BELOW. 

x ON RETURN T@ THE CALLING PROGRAM X WILL CONTAIN THE 
VALUES OF THE APPROXIMATION TO THE SOLUTION AT THE 
NK INTERI@R KNOTS-e THIS IS AN ARRAY OF DIMENSION 
NK AND TYPE REAL. 

XDP @N RETURN» XDP CONTAINS THE QUANTITIES H*H/6+0 
MULTIPLIED BY THE SECOND DERIVATIVE VALUES AT ALL THE 
KNOTS- XDP IS A REAL ARRAY @F DIMENSI@N NP. 

EP THIS REAL VARIABLE IS SET T@ THE VALUE 1+OE-M IF WE 
REQUIRE M-1 IDENTICAL FIGURES IN SUCCESSIVE ITERATES.~ 

GT AN INTEGER ARRAY @F LENGTH NP WHICH ASSIGNS T@ EACH 
KN@T T SUB J AN INTEGER VALUE BETWEEN 1 AND 6. THIS 
VALUE DESIGNATES RESPECTIVELY THE CASES WHEN 
GCT SUB J) IS. 1) «LE+« A» 2) -GEeBs 3) WITHIN EP OF 
SOME KN@T VALUE, 4) IN THE FIRST SUBINTERVAL» 

5) IN THE LAST SUBINTERVALs AND 6) IN ANY @THER 
SUBINTERVAL+ GTCI+1) CORRESPONDS TO KNOT T SUB I. 

KG AN INTEGER ARRAY @F LENGTH NP WHICH ASSIGNS TO EACH 
KN@T AN INTEGER BETWEEN 2 AND NP-1l.e IF GTCI+1)9=3 
THEN KGCI+1) CONTAINS THE SUBSCRIPT OF THE KNOT 
AT THE POINT G(T SUB 1). IF GTCI+1)=6 THEN KGCI+1) 
CONTAINS THE SUBSCRIPT OF THE KNOT AT THE RIGHT HAND 
ENDPOINT OF THE SUBINTERVAL IN WHICH GCT SUB I) LIES. 

VPs» VQ» VR» AND VG ARE ALL REAL AKRAYS OF DIMENSI@N NP AND 

CONTAIN THE VALUES @F THE FUNCTI@NS Ps» QO» R AND G 

RESPECTIVELY EACH EVALUATED AT THE NP KNOTS. 

IS A REAL NK BY NK ARRAY USED IN THE MATRIX EQUATION 

(MAT) CX)=CVM) SET UP T@ S@LVE FOK THE X SUB J VALUES 

ST@RED IN ARKAY Xe 

vM AN ARRAY @F LENGTH NK AND TYPE REAL USED AS 
DESCRIBED ABOVE. 

THE USER MUST SUPPLY REAL FUNCTION SUBPROGRAMS T@ COMPUTE 

THE FUNCTIONS UCT)sVCT)sPCT),0CT)4K¢T) AND GOT) DEFINED AS 

ABOVE. HE MUST ALS@ SUPPLY SUBPR@GRAMS WHICH SOLVE THE 

SYSTEM (MAT) (X)5CVM) 6 THE ROUTINE LUDCMPCMATs NK) IS TO 

REPLACE MAT BY ITS LU DEC@MPOSITION. THE ROUTINE 

LUSUBCVMsMATsX»NK) IS TO COMPUTE X WHEN VM AND THE LU 

F@RM OF MAT IS GIVEN. 

INTEGER GTYPs GTEs GTCNP)» KGCNP)s GTi» GTNP 
REAL XDPCNP)» VPCNP)» VQCNP)s» VRONP)» VGCNP) 
REAL MATCNKsNK)s VMCNK)s XCNK) 

KPR IS PRINTER DEVICE NUMBER 

DATA KPR/6/ 

CCT) = T#(T*T-1.) 
INITIALIZATION 

N = NP - 1 

RN =N 

NK = N- 1 

D®@ 20 K=15NK 

D@ 10 J=isNK 
MATCKs J) = 060 

10 CONTINUE 
20 CONTINUE 

XA = UCA) 

XB = V(B) : 

INITIALIZE XDP TO ZER@ CINITIAL SPLINE) 

D2 30 K=1sNP 
XDPCK) = 0-0 
30 CONTINUE 
SET UP PsQ@sRsG VECTORS 
H = (B-A)/RN 


MAT 


HS = H*H/6-0 

HR = 1+/H 

D@ 40 K=1»NP 
RK = K - |} 
™ = A + RK#¥H 
VPCK) = PCTM) 
V@CK) = QCTM) 
VRCK) = ROTM) 
VGCK) = GCTM) 


40 CONTINUE 
SET UP *TYPE OF G VALUE* ARRAY AND KG ARRAY 
APLSE = A + EP*ABSCA) 
BMINE = B - EP*ABS(B) 
D@ 70 K=1s,NP 
GTE = 6 
VGE = VGCK) 
IF CVGE-+LTeA+H) GTE 
IF CVGE.GT»B-H) GTE 
IF CVGE-LE»APLSE) GTE 
IF (VGE.GE-BMINE) GTE 
VDH = (VGE-A)/H 
KNOT = VDH + EP 
RKNOT = KNOT 
IF CCKNOT«LT«1) «OR- CKNOT-GT.NK))? GO TO 50 
IF CABSCVDH-RKNOT)+«GT-EP) GO TO 50 
GTE = 3 
KGCK) = KNOT 
GY T@ 60 
SO  KGC(K) = KNOT + 1 
60 GTC(K) = GTE 
70 CONTINUE 
PUT XSUBJ COEFFICIENTS INT®@ (MAT) AND INITIALIZE X TO ZERO 
DO 90 J=1sNK 
XCJ) = 060 
IF (JeEQe1) G2 TO 80 
MATCJsJ-1) = 16 = HS*VPCU) ; 
BO = MATC Se J) = -20%#C1o+26#HSKVPC +1)? 
IF (JeEQ-NK) G2 TO 90 
MATCJsJ+1) = Le - HS*VPCJ+2) 
90 CONTINUE 


nas 
~ 


X SUB G SUB T COEFFICIENTS 
D@ 150 J=lsNK 
DO 140 JJ=123 

JZ = JJ - 1 

JJZ = J + JZ 

COEF = HS*VQCJJZ> 

IF €JZeEQ-1) COEF = COEF*4. 

GTYP = GTCJJZ) 


0 
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120 


130 


140 


GO T@ €14051405100.1105120.130)» GTYP 
KNOT = KGCJJZ) 
MATCJsKNOT) = MATCJsKNOT) - COEF 


Ge TO 140 

MATCJs1) = MATCJe1) - COEF*C( VGC JJZ)-A)*HR 
G@ Te 140 

MATC J»sNK) = MATC J» NK) - COEF*(B~VGCJJZ) )*HR 
GO T® 140 


KN@T = KGCJJZ) 

RKN@T = KNOT 

CCC = RKNOT + CA-VGCJJZ))*HR 

MATC Je KNOT-1) = MATCJsKNOT-1) - COEF*CCC 

CCC = (VGC UJIZ>-A*FHR = RKNOT + 1 

MATCJsKNOT) = MATCJsKNOT) - COEF*CCC 
CONTINUE 


150 CONTINUE 
C REPLACE (MAT) BY ITS LU DECQ@MPOSITION. 


CALL LUDCMP(CMATs NK) 


C A SEQUENCE OF SPLINES CUP T@ 20) IS N@W GENERATED 


VPA = VPC(1) 


VPB = VPCNP) 
DO 250 NNN=1.220 


C VECTOR VM 1S NOW SET UP 


160 


170 


180 


190 
200 


C THE 
C THE 


210 


220 


D@ 200 J=1,NK 
VMCJ) = CVRC JD + 4¢kVROJ+1) AVRO I42) HHS 
D@ 190 JJ=123 
JZ= JS 21 
—JJZ = J + JZ 
GTYP = GT¢JJUZ) 
CGEF = HS*VQ(JJZ) 
IF (JZ+EQs1) COEF = COEFKA. 
IF (GTYPeEGe1) VMCJ) = VMCJ) + COEFRUCVGCJJZ)) 
IF (GTYP+EQ«2) VMCJ) = VMCJ) + COEFAVCVGCJUZ)) 
GO TO €190+190519051605170,180). GTYP 
TM = CVGCJJZ)-A)*HR 
[TJ = 16 = TM 
CCC = TURXA + GCTM)*&XDPC2) * CCTUD*XDPC1) 
VMCJ) = VMCJ> + COEF*CCC 


G@ T@ 190 
. TS = CB“VGCJIZ)) *HR 
T = 1.6 - TJ 


CCC = TM*XB + CCTM) *XDPCNK+2) + CCTJ)*XDPCNK+1) 
VMCJ) = VMCJ) + COEF*CCC 
GO TO 190 
KN@T = KGCJJZ) 
RKNOT = KNOT 
TJ CA-VGCJJZ))*HR + RKNOT 
™ 1. - TJ 
CCC = COTM) *XDPCKNGT+1) + CCTJ)*XDPCKN@T) 
VMCJ) = VMCJ) + COEF*CCC 
CONTINUE 
CONTINUE 
VMC1) = VMCL) = Cle sHS*VPA) *UCA)D 
VMCNK) = VMCNK) = €1«"-HS*VPB)*VCB) 
NEW X ARRAY IS N@W COMPUTED. 
ARRAY. VP SERVES AS A W@RK AREA. 
D@ 210 JF=1sNK 
VPCJF) = XCJUF) 
CONTINUE 
CALL LUSUBCVMs, MATs Xs NK) 
TSTVLI = 0-0 
TSTVL2 = 0.0 
D@ 220 JF=1.,NK 


TSTVLIL = TSTVL1 + ABSCVPCJF)-XCJF)) 
TSTVL2 = TSTVL2 + ABSCXCJF)) 
CONTINUE 


C CALCULATION OF XDP AT A AND B 


GTt. = GTC1) 

GTNP = GTCNP) 

IF CGT1.EQ-1) XGAA = UCVGC1)) 

IF (GT1.EQ.2) XGAA = VCVGC1)) 

IF €GTNP»EQ.1) XGAB = UCVGCNP)) 

IF (GTNP.«EQ»2) XGAB = VCVGCNP)) 

CALL GAGBC(GTI» XGAAs KGC1)s VGC1)s X» XDPs As Bs NPs 


CALL GAGBCGTNPs XGABs KGCNP)» VGCNP)s Xs» XDPs As Bs NPo 


XDPA = (VPA*XA+VOC1L)*XGAA4VRC1))*#HS 
XDPB = (VPB*XB+VQCNP)*XGAB+VRCNP) > *HS 


C S@LVE FOR XDP VALUES OF CURRENT SPLINE USING CONTINUITY 
C EQUATI@NS-« VM AND VP ARE USED AS WORKING AREAS. 


230 


240 


1000 
250 


10 
20 
30 


VMCTD = XA + KC2Y = Be*XC1) = XDPA 
NK1 = NK - 2 
VMCNK) = XB + XCNKI) = 2¢*XCNK) - XDPB 
D@ 230 J=2sNKt 
VMOJS> = XCIeEd + XCTHLD - Bo¥*XCI) 
CONTINUE 
CALL SOLVECVMs, NKs VPs» NP) 
XDP¢(1) = XDPA 
XDPCNP) = XDPB 
08 240 J=1sNK 
XDPCJ+1) = VMCJ) 
CONTINUE 
IF CTSTVLIeLE+«TSTVL2*EP) RETURN 
IFCNNN-+EQ+20) WRITECKPR» 1000) 
FORMATC 32H NO@ CONVERGENCE IN 20 ITERATIONS ) 
CONTINUE 
RETURN 
END 


SUBROUTINE GAGBCGTYPs ANS» Ks GVs X» XDP» As Bs NPs NK) 
REAL X(NK)s XDPCNP) 

INTEGER GTYP 

CCT) = Te (T*T-1.) 

RNKD = NK + 1 


RK = K 
XA = UCA) 
XB = VCB) 


H = (BA) /RNKD 

GO T@ €10+20530s 40550, 60)s GTYP 
RETURN 

RETUKN 

ANS = XCK) 

RETURN 


40 


50 


60 


20 


™ = (€GV-A)/H 
TJ = 16 - TM 
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ANS = TM&XC1) + TU*XA + CCTM)*XDPC2) + CCTJ>*XDPC1) 


RETURN 
TJ = (B=-GV)/H 
™ = 16 - TJ 


ANS = TM*XB + TJ*XONK) + COTM) *XDPCNK+2) + CCTJ)*XDPCNK+1) 


RETURN 
TJ = CA-GV)/H + RK 
™ = le - TJ 


ANS = TMeXCK) + TU*XCK-1) + CCTM) *XDPCK+1) + CCTJ*XDPCK) 


RETURN 
END 


SUBROUTINE SOLVECDs, NKs Ms NP) 
REAL DCNK)»s MCNP)? 


MCSD = Le/C4e-MC S419) 

DCJS) = DEJ - DCS+1>*MC S41) 
CONTINUE 
DCLIISDCLI#MCL) 
D@ 20 122s,NK 

DCI) = CDCT)-DCI-1)2*MCT) 
CONTINUE 
RETURN 
END 
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Bivariate Normal 
Distribution [S15] 


Thomas G. Donnelly [Recd. 9 July 1971] 

Department of Biostatistics and Center for Urban and 
Regional Studies, University of North Carolina at 
Chapel Hill, Chapel Hill, NC 27514 


Key Words and Phrases: bivariate, normal Gaussian, 
frequency distribution 

CR Categories: 5.5 

Language: Fortran 


Description 

Purpose. Tables of the bivariate normal distribution are avail- 
able [1] for H, K = 0(..1)4 and R = +.0(.05)0.95(.01)1 to six 
decimal places for positive R and to seven decimal places for nega- 
tive &. A valuable section in the preface to [1] by D. B. Owen de- 
scribes a wide variety of problem areas in which the tables can be 
applied. 

The advantages of being able to access these data in a computer 
are many. Frequently the values of (H, K, R) in which one is inter- 
ested will have been produced through computer calculations, and 
it is much more convenient if the user can produce the corresponding 
probability immediately and continue his calculations. Secondly, 
use of tables ordinarily involves the user in three-dimensional hand 
calculated interpolation, and the risk of errors here can be eliminated 
by use of a functional subprogram. Finally, a functional subprogram 
is a Starting point for additional refinements, such as confidence 
regions and tetrachoric correlations. 

Method. The methods employed in the program were basically 
those described in [2, eqs. 3.5, 3.8, 3.9], and comments in the pro- 
gram have been reduced to a minimum because the relations be- 
tween the program and the equations should be self-evident. Because 
the expression used (2, eq. 3.9] in evaluating 7(A, a) is an alternating 
convergent series, it was possible to provide controlled precision in 
the algorithm. As written, it provides accuracy to 15 decimal places, 
but the parameter controlling this, DJG, may be adjusted to suit 
the computer environment in which the algorithm is to be used. Of 
course, the value selected must conform to the precision obtainable 
from the univariate error function used, such as Algorithm 304, [3] 
and the other standard subroutines used, as well as to the computer 
characteristics. 

The lower-left tail values of the distribution, if desired, are 
obtained by reversing the signs of H and K. 
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1. National Bureau of Standards, Tables of the Bivariate Normal 
Distribution and Related Functions, N.B.S., Applied Math. Series, 
No. 50, 1959. 
2. Owen, D.B. Tables for computing bivariate normal probabili- 
ties. Ann. Math. Stat. 27 (1956), 1075-1090. 
3. Hill, I.D., and Joyce, S.A. Algorithm 304. Normal curve 
Integral. Comm. ACM 10 (June 1967), 374. 
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DOUBLE PRECISI@N FUNCTION BIVN@RCAHs AKs Kk) 
BIVN@R IS A CONTROLLED PRECISION 
FORTRAN FUNCTION T@ CALCULATE THE 


THE PROBABILITY FOR TWO NORMAL 
VARIATES X AND Y WHOSE CORRELATION 
IS Rs THAT X «GTe AH AND Y «GTe AKe 
DOUBLE PRECISIGN TWOPIs Bs AH» AKs KR» GH» GK» RR» GAUSS,» 
* DERFs H2s A2s H4»s DEXP» EXs W2s AP» S2@s SP» Sls» SN» SOR» 
* DSQORTs C@N»s DATANs WHs WKs GWs SGN» Ts DABSs G2e CONEXs 
* CN 
GAUSS(T) = (€1-O0DO0+DERF(T/DSQRTC2-000)))/2-0D0 
GAUSS IS A UNIVARIATE L@WER NORMAL 
TAIL AREA CALCULATED HERE FROM THE 
CENTRAL ERROR FUNCTION DERFe 
1T MAY BE REPLACED BY THE ALGORITHM IN 
HILL» IeDe AND JOYCEsSeAe ALGORITHM 304, 
NORMAL CURVE INTEGRALCS15)5 COMMeA-CoMe C10) 
CJUNEs 1967) »P 0374s 
SOURCE? OWEN» DeBe ANNoMATHeSTAT~ 
V@L~- 27€1956)5 P-1075- 
TWOPI = 2. x PI 
TWOPI = 6-283185307179587D0 
B = 0-.0D0 
IDIG = 15 
THE PARAMETER ‘IDIG* GIVES THE 
NUMBER OF SIGNIFICANT DIGITS 
TO® THE RIGHT OF THE DECIMAL POINT 
DESIRED IN THE ANSWER» IF 
IT IS WITHIN THE COMPUTER'S 
CAPACITY OF COURSE. 
GH = GAUSS(-AH) /2-0D0 
GK = GAUSS(-AK)/2-0D0 
IF CR) 105 30+ 10 
10 RR = 1-0D0 - R*#R 
IF CRR) 20+ 405 100 
20 WRITE (3299999) R 
C ERROR EXIT FOR ABSCR) -GT» 1-0D0 
99999 FORMATC12H BIVN@R R ISs D26016) 
STOP 
30 B = 4-0D0*GH*GK 
G@ TS 350 
40 IF €R> 50s 705 70 
50 IF CAH+AK) 60% 350» 350 
60 B = 2.0D0*(GH+tGK) - 1-0D0 
G@ T@ 350 
70 IF CAH-AK) 80s 905 90 
80 B = 2-0D0*GK 
G@ Te 350 
90 B = 2-0D0*GH 
G@ Te 350 
100 SQ@R = DSQRTCRR) 
IF CIDIG-15) 1205 110. 120 
110 CON = TWOPI*1-D-15/2.0D0 
G@ T@ 140 
120 CON = TWOPI/2-0D0 
D@ 130 1=1,1DIG 
CON = CON/10.0D0 
130 CONTINUE 
140 IF CAH) 1705 150s 170 
150 IF CAK) 1905 160% 190 
160 B = DATANCR/SQR)I/TWOPI + 0-25D0 
G@ To 350 
170 B = GH 


IF CAH*AK) 1802 200s 190 
180 B = B - 0.SDO 
B= 


Cc 
Cc 
C BIVARIATE NORMAL UPPEK RIGHT AREA» VIZ- 
c 
Cc 
c 
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190 B + GK 

IF CAH) 2005 340s 200 
200 WH = ~-AH 

WK = CAK/AH~R)/SOR 

GW = 2.0D0*GH 

Is = -1 
210 SGN = -1.0D0 

T = 0-0D0 


IF (WK) 220. 320+ 220 
220 IF CDABSCWK)=-1-0D0) 270+ 230» 240 
230 T = WK*GW*C1-O0D0-GW)/2-0D0 


GO T@ 310 
240 SGN = -SGN 
WH = WH«&WK 
G2 = GAUSSCWH) 
WK = 1-0D0/WK 
IF CWK) 250s 260s 260 


250 B = B + 0-5D0 
260 B = B - (GW+G2)/2-0D0 + GW*Ge 


270 H2 = WH*WH 
A@ = WK*xWK 
H4 = H2/2-0D0 
EX = DEXP(-H4) 
W2 = H4*EX 
AP = 1-0D0 
S2 = AP - EX 
SP = AP 
Si = 0.0D0 
SN = Sl 
CONEX = DABSC(CON/WK) 
GB TO 290 
280 SN = SP 
SP = SP + 1-0D0 
$2 = $2 - we 
W2 = W2*H4/SP 


COLLECTED ALGORITHMS (cont.) 


290 


~AP*A2 

AP*S2/CSN+SP) 

Si + CN 

DABS(CN)-CQ@NEX) 300» 300. 280 
CDATANCWK)-WK*S1L>/TWOPI 


auwtuu 


IF €1S) 330, 350, 350 
IF €AK) 3405 3505 340 
WH = -AK 

WK = CAH/AK=-R)/SOR 

GW = 2-0D0*GK 

Is = 1 

G@ T® 210 

IF €B) 360s 370 370 
B = 0-0D0 

IF (B-1-0D0) 390+ 390, 380 
B = 1%0D0 

BIVNOR = B 

RETURN 

END 
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Algorithm 463 


Algorithms SCALE1, 
SCALE2, and SCALE3 for 


Determination of Scales on 
Computer Generated Plots [J6] 


C.R. Lewart (Recd. 6 Aug. 1971 and 28 Jan. 1972) 
Bell Telephone Laboratories, Incorporated, Holmdel, 
NJ 07733 


Key Words and Phrases: plotting, scaling for plotting 
CR Categories: 4.41 
Language: Fortran 


Description 

Introduction. It is often desirable to plot computer generated 
output or obtain discrete distribution functions such as histograms 
automatically. In general the raw data does not lend itself directly 
to an easily readable presentation. The three related algorithms as 
presented here obtain readable linear or logarithmic scales with 
uniform interval sizes for users of various plot routines. 

Readability. A readable linear scale is defined here as a scale 
with interval size a product of an integer power of 10 and 1, 2 or 5, 
and scale values integer multiples of the interval size. 

A readable logarithmic scale on a display with uniform plotting 
intervals is defined here such that the ratio of adjoining scale values 
DIST = aaa where K and ZL are integers, with 1 < ZL < 10; 
scale values are equal to DIST , where M is a set of successive 
integers. 

The definition of readability used for SCALE 1 and SCALE 2 
permits scale values such as: 


—0.5, 0.0, 0.5, 1.0, ... 
1.24, 1.26, 1.28,... 
100.0, 200.0, 300.0, .. . , etc. 


It prohibits the following examples: 


—1.0, 4.0, 9.0, ... 
1.2, 1.31, 1.42,... 
0.0, 4.0, 8.0, 12.0, ..., ete. 


The definition of readability for logarithmic plots would permit 
scale values of 1, +/10, (x/10)’, 10, ..., but disallow 1, /5, 5, 
ST On ZO hee 


Usage. A call of the form 
CALL SCALE| (XMIN, XMAX, N, XMINP, XMAXP, DIST) 


where XMIN and XMAX are the minimum and maximum, respec- 
tively, of a given array and N a requested number of grid intervals 
will return a new minimum and maximum XMINP and XMAXP 
such that the range [YMINP, XMAXP} is the smallest range which 
will embrace the range [YMIN, XMAX] and simultaneously result 
in approximately N grid intervals, each of the length DIST. Interval 
DIST is selected by SCALE] as the product of an integer power of 
10 and 1, 2, or 5. XMINP and XMAXP are integer multiples of 
DIST. 
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In certain cases the number of plot intervals N has to be fixed. 
In particular, for plots generated by devices with relatively large 
pen increments, e.g. line printers or teletypewriters, N is restricted. 
For such cases SCALE2 for linear plots and SCALE3 for logarith- 
mic plots have to be used. 

SCALE2 with the same arguments as SCALE1 differs from 
SCALE in that XMINP and XMAXP are determined such that 
exactly N grid intervals will result; as a consequence the range 
[XMINP, XMAXP] will in general be less economical than that 
obtained by SCALE1. Parameters DIST, XMINP, and XMAXP 
will still satisfy requirements specified for SCALE1, namely DIST 
will be an integer power of 10 times 1, 2, or 5; and XMINP and 
XMAXP will be integer multiples of DIST. 

SCALE3 with the same arguments as SCALE] will set XMINP 
and XMAXP such that N logarithmic uniformly spaced grid inter- 
vals will cover the range [XMIN, XMAX]. DIST will be the ratio of 
adjacent grid line values. 

SCALE3 selects DIST as 10 , where K and L are integers 
and 1 < L < 10. XMINP and XMAXP are selected so that 
XMINP = DIST’, and XMAXP = DIST’ where j and / are inte- 
gers. 

Calling SCALE1, SCALE2, or SCALE3 will approximately 
center the range [XMIN, XMAX] between XMINP and XMAXP. 
SCALE, having determined DIST, selects the most economical 
limits, i.e. (XMIN — DIST) < XMINP < XMIN and XMAX < 
XMAXP < (XMAX + DIST). SCALE2 and SCALE3 select 
limits to minimize (YMAXP — XMAX) and (XMIN — XMINP) 
without necessarily satisfying the previous inequalities, but subject 
to the constraints of a fixed number of intervals. 

The actual number of intervals N, , determined from the out- 
puts returned by SCALE 1 is as follows: 


Na = (XMAXP — XMINP)/DIST. 


N. may be slightly larger or smaller than N as shown by the fol- 
lowing inequality: 


(N/V/2.5) < Na < (N X V2.5 + 2). 


N, will always equal N if SCALE2 or SCALE3 is called. 

Round-off considerations. The three algorithms compensate for 
the computer round-off to assure that XMIN and XMAX are 
within the range [XMINP, XMAXP}. A normalized parameter 
DEL is introduced to serve as a narrow gate around the minimum 
XMIN and the maximum XMAX to avoid an unnecessarily large 
range [XMINP, XMAXP] caused by computer round-off. For 
example, if DEL = 0.0001, N = 3 and SCALEI1 or SCALE2 is 
called, XMINP of 1.0 and XMAXP of 4.0 will result for 0.9999 < 
XMIN < 1.0001 and 3.9999 < XMAX < 4.0001. DEL is nor- 
malized to the interval size and should satisfy the following in- 
equality: 


A < DEL < (BX N)/C, 


where A is the round-off expected from a division and float opera- 
tion, B is the minimum increment of the plotting device in inches, 
N is the number of intervals on the plot, and C is the plot size in 
inches. For example, using single precision REAL+*4 variables 
(IBM 360): A ~ 0.0000002; for a precision flat bed plotter: B = 
0.002, C = 50.0. Assuming N = 10 the following inequality is 
obtained: 


0.0000002 < DEL < 0.0004. 


It is obvious from this inequality that in practical cases the range of 
permissible values of DEL is so large that DEL is quite insensitive 
to the type of plotter and the type of computer used. 


(1/L+K) 
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Examples 
SCALE} Actual 
No. of 
Intervals 
XMIN XMAX N XMINP XMAXP DIST 
—3.1 11.1 § —4.0 12.0 2.0 8 
5.2 10.1 5 5.0 11.0 1.0 6 
—12000  -—100 9 —12000 0 1000 12 
SCALE2 Actual 
No. of 
Intervals 
XMIN XMAX N XMINP XMAXP_ DIST 
—3.] 11.1 5 —5.0 20.0 5.0. 5 
2 10.1 5 4.0 14.0 2.0 5 
—12000 —100 9 —14000 4000 2000 9 
SCALE 3 Actual 
No. of 
Intervals 
XMIN XMAX N XMINP XMAXP DIST 
1.8 125.0 10 1.58 158.49 1.58 10 
(=~V10) 
0.1 10.0 2 0.1 10.0 10.0 2 
0.1 1500.0 4 0.077 2154.4 12.92 4 
(= 100+1/9)) 
Algorithm 
SUBROUTINE SCALEICXMIN, XMAX» N» XMINPs XMAXPs DIST) 
C ANSI FORTRAN 
C GIVEN AMIN» XMAX AND N SCALE1 FINDS A NEW KANGE XMINP AND 
C XMAXP DIVISIBLE INTO APPROXIMATELY N LINEAK INTEKVALS 
C OF SIZE DIST 
C VINT- IS AN ARRAY OF ACCEPTABLE VALUES FOR DIST CTIMES 
C AN INTEGER POWER OF 10) 
C SOR IS AN ARRAY @F GEOMETRIC MEANS OF ADJACENT VALUES 
C OF VINTs IT IS USED AS BKEAK POINTS TO DETERMINE 
C WHICH VINT VALUE T@ ASSIGN TO DIST 


DIMENSION VINTC4)5 SQRC3) 
DATA VINTC1)s VINTC2)59 VINTC3)e VINTC4)/1e> Q2ee Ses 1007 
DATA SQRC1)s SORC2)» SQRC3BI/d- 4142142 321622782 720710687 
C CHECK WHETHEK PROPER INPUT VALUES WEKE SUPPLIED 
IF CXMINoLT*¢XMAX «ANDs NeGT-0) GO TG 10 
WRITE €6s99999) 
99999 FORMATC34H IMPRGPER INPUT SUPPLIED TG@ SCALE1) 
RETURN 
DEL ACCOUNTS F@R GOMPUTER ROUND~OFF 
DEL SHOULD BE GREATER THAN THE KGUNO-GFF EXPECTED FROM - 
A DIVISION AND FLOAT OPERATION,» IT SHOULD BE LESS THAN 
THE MINIMUM INCREMENT @F THE PLOTTING DEVICE USED BY 
THE MAIN PROGKAM CIN.) DIVIDED BY THE PLOT SIZE CIN.) 
TIMES NUMBER OF INTERVALS N 
10 DEL = .00002 
FN = N 
C FIND APPROXIMATE INTERVAL SIZE A 
A = CXMAX-AMIN) /FN 
AL = ALOGIOCA) 
NAL = AL 
IF CAsLT+l«) NAL = NAL - 1 
€ A IS SCALED INT@ VAKIABLE NAMED 8 BETWEEN 1 AND 10 
B = A/10.**NAL 
C THE CLOSEST PERMISSIBLE VALUE FOn B IS FOUND 
‘ DO 20 1-133 
IF (BeLT-SanC1)>)? GO TS 30 
20 CONTINUE 
I= 4 
C THE INTERVAL SIZE 1S COMPUTED 
30 DIST = VINTCL)&10.*4NAL 
FMt = XMIN/DIST 
ML = FMI 
IF CFMI-LT«O.) MI = MI = 1 
IF CABSCFLGATCMII+1e-FM1)eLT.DEL) MI = 41 + 1 
C THE NEW MINIMUM AND MAXIMUM LIMITS AKE FOUND 
XMINP = DIST#FLOATCMI). 
FM2 = XMAX/DIST 
M2 = FM2 + 1. 
IF CFM2.LT«C-362) M2 = M2 - 1 
IF CABSCFM2+1.-FLOAT(M2))+LT«DEL) M2 = M2 - 1 
XMAXP = DIST*FLOCAT(M2) 
C ADJUST LIMITS TO ACCOUNT FOK ROUND-GFF IF NECESSARY 


oaqgaaa 


IF CXMINP «GTeXMIN) XMINP = XMIN 
IF CXMAXP«LTeXMAX) XMAXP = KMAX 
RETURN 

END 


SUBROUTINE SCALE@CXMIN» XMAXs Ns» XMINPs AMAXPs DIST) 
ANSI FORTRAN 
GIVEN XMINsXMAX AND N SCALE2 FINDS A NEW RANGE XMINP AND 
XMAXP DIVISIBLE INTG EXACTLY N LINEAK INTERVALS GF SIZE 
DIST» WHERE N I[S GREATEK THAN 1 

DIMENSI@GN VINTC 5S) 

DATA VINTC1)s VINTC2)6 VINTC3)s VINTC 425 VINTC5)/L os Qos 

* Ses 100e3 2007 


aaaa 


Cc 
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CHECK WHETHER PROPER INPUT VALUES WERE SUPPLIED 
IF CXMINeLTsXMAX «ANDe NeGT-1)>,GG TS 10 
WKITE €6299999) 


99999 FORMATC34H IMPRKOPEK INPUT SUPPLIED TO SCALE2) 


qaaaaqa 


c 


RETURN 
10 DEL = .00002 
FN = N 


FIND APPROXIMATE INTERVAL SIZE A 
A = CAMAX-XMIN)/FN 
AL = AL@G10CA) 
NAL = AL 
IF CAeLTe1-)> NAL = NAL - 1 
A IS SCALED INT@ VARIABLE NAMED B BETWEEN 1 AND 10 
B = A/10¢**NAL 
THE CLOSEST PERMISSIBLE VALUE FGR B IS FOUND 
DO 20 {f=133 
IF (BeLTeCVINTCI)+DEL2) GO TO 30 
20 CONTINUE 
I1=4 
THE INTERVAL SIZE IS COMPUTED 
30 DIST = VINTCI)*10.*kNAL 
FMIL = XMIN/DIST 
M1 = FMI 
IF CFML1eLT-O.) MI = MI - 1 
IF CABSCFLGATCMI)+bLe-FMI)-LT~-DEL) M1 = MI + 1 
THE NEW MINIMUM AND MAXIMUM LIMITS ARIE FOUND 
XMINP = DIST*FLOATCM1) 
FM2 = XMAX/DIST 
M2 = FMA + 1. 
IF CFM2.LT.€C-12)9) M2 = M2 - 1 
IF CABSCFM2+1.«.-FL@ATCM2))-LT-DEL) M2 = M2 - 1 
XMAXP = DIST*FLOATCM2) 
CHECK WHETHER A SECOND PASS 15 REQUIRED 
NP = M2 - M1 
IF (NPeLEe«N) GO TQ 40 
t=t+*1 
GO TO 30 
40 NX = (N-NP)72 
XMINP = XMINP - FLOATCNX)*DIST 
XMAXP = XMINP + FL@ATCN)*DIST 
ADJUST LIMITS T@ ACCOUNT FOR R@UND-GFF IF NECESSARY 


IF CKMINP+GTeXMIN) XMINP = XMIN 
IF CXMAXP eLT/eXMAX) XMAXP = XMAX 
RETURN 

END 


SUBROUTINE SCALESCXMIN, XMAXs Ns XMINPs XMAXPs DIST) 
ANSI FORTRAN 
GIVEN XMINsXMAX AND Ns WHERE N IS GREATER THAN 1s SCALES 
FINDS A NEW RANGE XMINP AND XMAXP DIVISIBLE INT@ EXACTLY * 
N L@GARITHMIC INTERVALS» WHEKE THE RATIO OF ADJACENT 
UNIFORMLY SPACED SCALE VALUES IS DIST 
DIMENSION VINTCI1) 
DATA VINTC1)s VINTC2)5 VINTC3)s VINTC4)9 VINTCS)s VINTC6)> 
* VINTCT), VINTC8)s VINTC9)s VENTC1025 VINTCLIDS1IOs Ges 
* Bex Ter Bex Sev Ges Boo Zen Lev 25S 
CHECK WHETHER PROPER INPUT VALUES WERE SUPPLIED ; 
IF CXMIN-LT+sXMAX -ANDs NeGT-ot eANDe XMIN-GT-0-) GO TO 10 
WRITE € 6399999) 


“99999 FORMATC34H IMPROPEK INPUT SUPPLIED TO SCALE3) 


RETURN 
10 DEL = .090002 
VALUES ARE TRANSLATED FROM THE LINEAR INTO LGGARI THMIC 
REGION 
XMINL = AL@GIOCXMIN) 
XMAXL = ALGG1OCXMAX) 
FN = N 
FIND APPROXIMATE INTERVAL SIZE A 
A = (AMAXL~XMINL)/FN 
AL = ALOGIOCA) 
NAL = AL 
IF CAeLT«t-e) NAL = NAL - 1 
A IS SCALED INT@ VARIABLE NAMED B BETWEEN 1 AND 10 
B = A/10.*+NAL 
THE CLOSEST PERMISSIBLE VALUE FOK B iS FOUND 
D@ 20 I=129 
IF (BeLTeCLOs/VINTCI>+DEL)) GO TO 30 
20 CONTINUE 
I = 10 
THE INTERVAL SIZE IS COMPUTED 
30 DISTL = 10¢*#*C(NAL+1I)/VINTCI) 
FM1 = XMINL/DISTL 
Mi = FM1 
IF CFM1.LT«O.) Mi = MI - 1 
IF CABSCFLOATCMID+1--FMI1)-«LTeDEL) MI = MI + 1 
THE NEW MINIMUM AND MAXIMUM LIMITS AKE FOUND 
XMINP = DISTL*FLOATCMI) 
FM2 = XMAXL/DISTL 
M2 = FM2 + Ile 
IF (FM2-LTeC-15e)) Me = M2 - 1 
IF CABSCFM2+1.-FLGATCM2))-LT-DEL) M2 = M2 = 1 
XMAXP = DISTL*FL@ATCM2) 
NP = M2 = Ml 
CHECK WHETHER ANOTHER PASS IS NECESSARY 
IF CNP+«LE«N) GO TQ 40 
I=tI+t 
Ge TO 30 
40 NX = (N-NP) 72 
XMINP = XMINP - FLOATCNX) #DISTL 
XMAXP = XMINP + FLOATCN)*DISTL 
VALUES ARE TRANSLATED FROM THE LOGARITHMIC INTO THE LINEAR 
REGICN 
DIST = 10-**DISTL 
XMINP = 10+%*XMINP 
AMAKP = 10¢**kXMAXP 
ADJUST LIMITS TO ACCOUNT F@R ROUND-OFF IF NECESSARY 
IF (XMINP+GTeXMIN) XMINP = XMIN 
IF CXMAXP«LT«XMAX) XMAXP = XMAX 
RETURN 
END 
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Eigenvalues of a Real, 
Symmetric, Tridiagonal 
Matrix [F2] 


Christian H. Reinsch [Recd. 11 Mar. 1971] 
Mathematisches Institut der Technischen Universitat, 
8000 Miinchen 2, Arcisstra 21, Germany 


Key Words and Phrases: eigenvalues, QR Algorithm 
CR Categories: 5.14 
Language: Algol 


Description 

This algorithm uses a rational variant of the QR transformation 
with explicit shift for the computation of all of the eigenvalues of a 
real, symmetric, and tridiagonal matrix. Details are described in 
{1]. Procedures tred1 or tred3 published in [2] may be used to reduce 
any real, symmetric matrix to tridiagonal form. Turn the matrix 
end-for-end if necessary to bring very large entries to the bottom 
right-hand corner. a 


References E 
1. Reinsch, C.H. A stable, rational QR algorithm for the com- 
putation of the eigenvalues of an Hermitian, tridiagonal matrix. 
Math. Comp. 25 (1971), 591-597. 

2. Martin, R.S., Reinsch, C.H., Wilkinson, J. H. Householder’s 
tridiagonalization of a symmetric matrix. Numer. Math. 11 (1968), 
181-195, 


Algorithm 
procedure fg/rat (n,macheps) trans: (d,e2); 
value 1, macheps; 
integer ; real macheps; array d, e2; 
comment 
Input: 
n order of the matrix, 
macheps the machine precision, i.e. minimum of all x such that 
1 + x > 1 on the computer, 
d{1:n) represents the diagonal of the matrix, 
e2[1:n] represents the squares of the sub-diagonal entries, 
(e2[L] is arbitrary). 
Output: 
a{1:n] the computed eigenvalues are stored in this array in 
ascending sequence, 
e2(1:n] is used as working storage and the original informa- 
tion stored in this array is lost; 
begin 
integer i, k, m; real b, b2, f, g,h, p2, r2, s2; 
for i := 2 step 1 until n do e2[i—1] := e2[i]; 


e2(n] : 
for k : 
begin 


contl: 
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b:= 62:= f:= 0.0; 
1 step | until do 


lI 


h += macheps X macheps X (d[k]{?2 + e2[k]); 
if b2 < Athen ~ 
begin b := sgrt(h); b2 := hend; 

comment Test for splitting; 

for m := k step 1 until n do 

if e2(m] < 52 then go to conti, 


if m = k then go to root; 
comment Form the shift from leading 2 2 block; 


nextit: 


g i= dk}; p2 := sgrt(e2[k)); 


—h= (d[k+1]—g)/(2.0Xp2); r2 := sgrt(hxh+1.0); 


d[k) := h := p2/(if h<0.0 then A—1r2 else h+12); 
hi=g-Ahyfi=ft+h,; 
for i := k + 1 step 1 until ” do d[i] := d[{i] — A; 
comment Rational QL transformation, rows k through m; 
:= d[m); if g = 0.0 then g := 5b; 
h := g;52:= 0.0; 
for i := m — 1 step —! until k do 
begin 
p2:= @ Xh;r2:= p2 + efi; 
e2(i+1] := s2 & r2; 52 := e2[i]/r2; 
djit 1] := A+ s2 < (h+d[i)); 
g := di] — e2[i)/g; if g = 0.0 then g := 5; 
h:= g XX p2/r2 
end i; 


- ed(k]) := s2 XX eX hy, dk] :=h,; 


if e2[k] > 62 then go to nextit; 


root: 
-hs=dki t+ 


comment One eigenvalue found, sort eigenvalues; 
for i := k step —1 until 2 do 

if h < d{i—1] then d[i] := d{i—1] else go to cont2; 
i:= 1; 


cont2: 


di] :=h 


end k 
end fqlrat; 
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Algorithm 465 
Student’s ¢ Frequency [S14] 


G.W. Hill [Recd. 24 Aug. 1971, 23 Feb. 1972, 10 
July 1972] 

C.S.1.R.O., Division of Mathematical Statistics, Glen 
Osmond, South Australia 


Key Words and Phrases: Student’s t statistic, density function, 
series approximation 

CR Categories: 5.12, 5.5 

Language: Algol 


Description 
The frequency function for Student’s ¢ distribution, 


Gn + 4) 
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fln) = 
is evaluated for real ¢ and real n > 0 to a precision near that of 
the processor, even for large values of 1. 

The factor involving ¢ is evaluated as exp(—14b) where 5 is 
computed as (n + 1)in(l + 2/n) if 2/n = c is large (>cmax, 
say) or, to avoid loss of precision for smaller c, by summing 
the series for b = (2 + c)(1 — ¢/2 + 2/3 — /44 ---+) until 
negligible terms occur, i.e. c’/(r + 1) < «, where « is the relative 
magnitude of processor round-off. The relative error up to e/ 
cmax in evaluating /n(1 + c) and the.accumulated round-off 
error of order «./R in summing a maximum of R terms of the 
series can be limited to about the same low level by choosing 
cmax = R~* where R7!®/R =~ e. Thus for R = 12, 16, 23, or 32, 
values of cmax - 0.2887, 0.25, 0.2085, or 0.1762, respectively, 
correspond to processor precision where « = 274, 2736) 2-56, 
or 2-*, respectively. 

Evaluation of the ratio of gamma functions by exponentiating 
the difference of almost equal values of their logarithms would 
involve considerable loss of precision for large n. This is avoided 
by use of the asymptotic series obtained by differencing the Stirling 
approximations, changing the variable to a = » — 4, and ex- 
ponentiating the result (see also [1]): 
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where Cy = C;, = 1, C2 = —19/2, C3 = 631/2, Cy = —174317/8, 
C,; = 204 91783/8, Cs = —73348 01895/16, Cr: = 185 85901 
54455/16, Cs = —5 06774 10817 68765/128, Cy = 2236 25929 
81667 88235/128, Cio = —24 80926 53157 85763 70237/256. 

The relative error of the sum of the first s terms is negligible 
for n > nmin where | C.| X [4 (amin— 4)]-* & ¢, e.g. for s = 5 
and « = 27%4 or 2736, nmin ~ 6.271 or 13.76, respectively, and for 
s = 10 and e = 27 or 2-%, nmin ~ 15.5 or 40.89, respectively. 
For smaller 1 the ratio of gamma functions is obtained from the 
ratio for some N > nmin by the relation: 
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For large , processor underflow at line 21 is avoided by use of the 
normal approximation, which is adequate for values of n > 1/e, 
whose representation is unaffected by subtraction of 0.5. Protection 
against negative or zero n is provided by returning the distinctive 
value, —1.0, which may be supplemented by an error diagnostic 
process, if required. 

For double precision calculations speed is improved by evalu- 
ating higher order terms of the gamma ratio series using. single 
precision operations. Comparison of double precision (« = 27*) 
results with single precision results (¢ = 2-36, nmin = 13.76, cmax = 
0.25) for a Control Data 3200 indicated achievement generally of 
about ten significant decimal digits, dropping to about eight sig- 
nificant decimals for arguments beyond the 10~* probability level. 

Valuable comments from the referee are gratefully acknowl- 
edged. 
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Algorithm 

real procedure ¢ frequency (1, #); 
value f, 1; real f, 7; 

if » < 0.0 then ¢ frequency := —1.0 

else 

begin 
real a, b, c, d, e, nmin, cmax; 
comment for 36-bit precision processor; 
nmin := 13.76; cmax := 0.25; 
b:=tXte:=b/nwa:=d:=b+e; 
if c > cmax then b := (n+1.0) X In{(1.0+c) 


else 
for e:= 2.0, e + 1.0 while b # ddo 
begin a := —a Xc;b:=d;d:= a/e + dend; 


a:=n;c := 0.3989422804, 

comment I /sqrt(2r) = 0.3989422804014326779399461 ... ; 
for e := awhile e < nmin do 

begin c := c X a/(a+1.0); a := a 4- 2.0 end; 


a:=a-—0O.5; 
ifa ~ n then 
begin 


c:= sqrt(a/n) X c;a:= 0.25/a;a:=aXa; 
c := ((((—21789.625 Xa+ 315.5) Ka—9.5) Xa+1.0) Xa+1.0) 
xXc 
end; 
t frequency := exp(—0.5Xb) X ¢ 
end Student’s t-frequency 
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Four Combinatorial 
Algorithms [G6] 


Gideon Ehrlich [Recd. 25 Aug. 1971, 4 Jan. 1972, and 
12 Dec. 1972] 

Department of Applied Mathematics, Weizmann In- 
stitute of Science, Rehovot, Israel 


Key Words and Phrases: permutations and combinations 
CR Categories: 5.39 
Language: PL/I 


Description 
Each of the following algorithms produce, by successive calls, 

a sequence of all combinatorial configurations, belonging to the 

appropriate type. 

PERMU Permutations of N>3 objects: X(1), X(2),..., X(N). 

COMBI Combinations of M natural numbers out of the first N. 

COMPOMIN Compositions of an integer P to M + 1 ordered 
terms, INDEX(k), each of which is not less than a given mini- 
mum MIN(k). 

COMPOMAX The same as COMPOMIN but each term has its 
own maximum MAX (k). 

The four algorithms have in common the important property 
that they use neither loops nor recursion; thus the time needed for 
producing a new configuration is unaffected by the “‘size’” (N, N 
and M, P and M respectively) of that configuration. 

Each algorithm.uses a single simple operation for producing a 
new configuration from the old one, that is: 

PERMU A single transposition of two adjacent elements. 

COMBI Replacing a single element x by a y having the property 
that there is no element between x and y belonging to the com- 
bination. 

COMPOMIN(MAX) Changing the values of two adjacent terms 
(usually only by 1). 

The algorithms are written in PL1(F). 

Special instructions for the user and notes. 

PERMU (i) The mean work-time is actually a decreasing func- 
tion of N since, on (V — 1)/N of the calls, it returns by the 
first RETURN. (2) The procedure operates directly on any 
object vector x{1:N]. (3) For the first permutation one must 
cali FIRSTPER; for other permutations PERMU must be 
used. (4) Together with the last permutation, which is the 
original one, we will get DONE = '1'B. If we continue to call 
PERMU, the entire sequence will repeat indefinitely. If at any 
stage we set DONE = '0'B, then at the end of the appropriate 
sequence it will become '1'B. (5) The entire resulting sequence 
is the same as that of Johnson [1] and Trotter [2]. 

COMBI Every combination is represented in two forms: (1) As a 
bit array of M '1's and N — M '0's which is identical to A(1), 
A(2),..., A(N). (2) As an array C of M different integers not 
greater than N. The M elements are ordered according to their 
magnitude. If the second representation is not needed one can 
omit Z, H and C together with the last line of the procedure. 
For the first combination we can use the following initializa- 
tion (for other initializations see [3]): 
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DECLARE A(0:N) BIT (1), (X, Y, T(N), FO:N), 

I, L, Z, H(N), C(M)) FIXED; 

DO K =0TOWN — M; A(K) = '0'B; END; 

DOK =N—M+1T0ON;A(K) = '1'B; END; 

DOK =1TOM;C(K) =N—-M+4+K;A(N-—-M+K) = K; 
END; 

T(N — M) = -—1;T(1) =0;F(N) = N-M+1;1=N—M; 
L=N; 

(The initialization was not done in the body of the procedure 
COMBI only in order to simplify the procedures COMPO MIN- 
MAX..) 

Instead of using such a large number of parameters it is pos- 
sible to retain only A, I, L as parameters of the procedure and 
declare and initialize the other present parameters in the body of 
the procedure (as is done in PERMU). In sucha case N, 7, F, L, H 
must be declared as STATIC or CONTROLLED (‘own’ in AL- 
GOL). 

COMPOMIN Each of the M + 1 MIN(k), as well as P, can be 
any integer (positive, negative, or zero), but the sum S of all 
those minima cannot be greater than P. 

For the first composition set INDEX(1) = P — S + MIN(1) 
INDEX(k) = MIN(k), for k > 1. 

Set N = P — S + M, and declare and initialize all variables 
that also appear in COMBI in the same way as was done for 
COMBI. 

Together with the last composition, we will get J = 0 asa 
signal to halt. 

COMPOMAX The instructions for COMPOMIN are valid for 
COMPOMAX provided: (1) MIN is replaced by MAX (S 
> P); and (2) Nis initialized toN = S—- P+ M. 

The vector C (but not H!) has no use in COMPOMIN(MAX), 
so one can omit all statements in which it appears. A justification 
for the four algorithms and for some others can be found in [3]. 


Acknowledgment. I would like to thank Professor Shimon Even 
for guidance and encouragement. 
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Algorithm 
FIRSTPER: PROCEDURE (X,DONE); 
DECLARE (X(*), (XN,XX) STATIC) DECIMAL, DONE BIT(1) 


(N,S,V,M,L,1,DI,IPI) BINARY STATIC, 
(P(O:N),IP(N-1)D(N-1),T(N)) BINARY CONTROLLED; 

N=DIM(X,1) 3 

IF ALLOCATION (P) THEN FREE P,IP,D,T; ALLOCATE P,IP,D,T; 
DO M=1 TO N-1; P(M),IP(M)=M; D(M)=-1; END; 

XN=X(N); V==13 S,P(0),P(N)=N; M,L=1; 

T(N)=N-1; T(N-1)=-2; T(2)=2; 

DONE='0'B; 

PERMU: ENTRY (X,DONE); 

IF Sa=M THEN DO; X(S)=X(S+V); S=S+V3; X(S)=XN; RETURN; END; 
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I=T(N); DI=D(I); 

IP(T) ,IPI=IP(I)+DI; M=P(IPI); IP(M)=IPI-DI; 
P(IPI-DI)=M; P(IPI)=I; M=IPI+L; 
XX=X(M) 5 X(M)=X (M-DI) ; X(M-DI) =XX; 
L=1-L; V=-V; M=N+1-S; 


IF P(IPI+DI) < I THEN 
DO; IF I=N-1 THEN RETURN; 
T(N)=N-1; T(N-1) = -I3 RETURN; 
END; ; 
D(1)=-D1; 
IF T(I) < 0 THEN 
DO; IF T(E)-v=l-1 THEN T{I-1)=T(I); T(1)=1-1; END; 
IF I— =N-1 THEN DO; T(N)=N-1; T(N-1)=-I-1; END; 
T(I+1)=T(T): 
IF 1=2 & P(2)=2 THFN DONE='1'B; 
END; 
COMBI PROCEDURE (A,N,X,Y,1,F,1,L,Z,H,C)s 
DECLARE A(*)BIT(1), (N,X,Y,T(*),F(*),1,L,Z,H(*),C(*)) FIXED; 
IF T(I) <0 THEN 
DO; IF -T(I)~=I-1 THEN T(I-1)=T(I); T(1)=I-1; END; 
IF — A(I) THEN 
DO; X=#I; Y=F(L); 
IF A(I-1) THEN F(I)=F(I-1); ELSE F(I)=I; IF F(L)=L THEN 
DO; L=I; 1=T(1); GOTO CHANGE; END; 
IF L=N THEN 
DO; T(F(N))=-I-1; T(I+1)=T(I); I=F(N); 
F(N)=F(N)+1; GOTO CHANGE;. 
END; 
T(L)=-I-1; T(1+1)=T(1); 
F(L)=F(L)+1; I=L; GOTO CHANGE 
END; 
Y=I; 
IF I-=L THEN 
DO; 
F(L),X=F(L)-1; F(1-1)=F(I); 
IF L=N THEN 
DO; IF I=F(N) -1 THEN DO; I=T({1); GOTO CHANGE; END; 
T(F(N)-1)=-I-1; T(I+1)=T(1); 
I=F(N)-1; GOTO CHANGE; 
END; 
T(L)=-I-1; T(I+1)=T(1); I=L; GOTO CHANGE; 
END; 
X=N; F(L-1)=F(L)3 F(N)=N; L=N; 
IF I=N-1 THEN DO; I=T(N-1); GOTO CHANGE; END; 
T(N-1)=-I-1; T(I+1)=T(1); I=N-1; 
CHANGE ; 
A(X)='1'B; A(Y)='0'B; 
H(X) ,Z=H(Y); C(Z)=X; 
END COMBI; 
COMPOMIN: PROCEDURE (INDEX,A,N,X,Y,1,F,1,L,Z,H,C); 
DECLARE A(*) BIT(1), 
( INDEX(*) ,N,X,Y,T (*) ,F(*),1,L,Z,H(*) ,C(*)) FIXED; 
CALL COMBI (A,N,X,Y,1,F,1,L,Z,H,C); 
INDEX (Z)=INDEX(Z)+X~Y; INDEX (Z+1 )= INDEX (Z+1)+Y-X: 
END COMPOMIN; 
COMPOMAX: PROCEDURE (INDEX,A,N,X,Y,1,F,1,L,Z,H,C); 
DECLARE A(*) BIT(1), 
( INDEX(*) .N,X,Y,1(*),F(*),1,L,Z,H(*) ,C(*)) FIXED; 
CALL COMBI (A,N,X,Y,1,F,1,L,Z3H,C); 
INDEX (Z)=INDEX(Z)-X+Y3 INDEX (Z+1)=INDEX(Z+1)-Y+X; 
END COMPOMAX; 
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Matrix Transposition in 
Place [F1] 


Norman Brenner [Recd. 14 Feb. 1972, 2 Aug. 1972] 
M.I.T., Department of Earth and Planetary Sciences, 
Cambridge, MA 02139 


Key Words and Phrases: transposition, matrix operations, 
permutations, primitive roots, number theory 

CR Categories: 3.15, 5.14, 5.39 

Language: Fortran 


Description 

Introduction. Since the problem of transposing a rectangular 
matrix in place was first proposed by Windley in 1959 [1], several 
algorithms have been used for its solution [2, 3, 7]. A significantly 
faster algorithm, based on a number theoretical analysis, is de- 
scribed and compared experimentally with existing algorithms. 

Theory. A matrix a, of n; rows and ne columns, may be stored 
in a vector v in one of two ways. Element a;; (O0-origin subscripts) 
may be placed rowwise at vw, k = im, + /, or columnwise at vz. , 
k’ =i+jn,. Clearly, letting 1 = 1, and m = mnz — 1, 


k! = nk (mod m). (1) 


Transposition of the matrix is its conversion from one mode 
oi storage to the other, by performing the permutation (1). This 
permutation may be done with a minimum of working storage in 
a minimum number of exchanges by breaking it into its subcycles. 
For example, for a 4 X 9 matrix, one subcycle representation is 


(0) (1 4 16 29 11 9) (34 31 19 6 24 26) 
(22 18 2 8 32 23) (13 17 33 27 3 12) 
(5 20 10) (30 15 25) (7 28) (14 21) (35). 


The notation for the sixth subcycle, for example, means that 
V5 <~ Yoo <— Vio — V5. Z 

For a subcycle starting with element s, the elements of the 
subcycle are sn” (mod m), for r = 0, 1,.... The following theo- 
rems are easily established. 

THEOREM 1. All the elements of the subcycle beginning with s 
are divisible by d = (s, m), the largest common factor of both s and 
m. They are divisible by no larger divisor of m., 

Proor. Both m and s are divisible by d, and therefore so is any 
subcycle element sn” (mod m). But n and m have no common 
factors (since m = nnz — 1), so no divisor of m larger than d can 
divide sn".(7 

THEOREM 2. For every subcycle beginning with s, there is 
another (possibly the same) subcycle beginning with m — Ss. 

Proor. The elements of the second subcycle are just —sn’ 
(mod m). It is the same subcycle if for some r, n” = —1 (mod m’), 
for m! = m/(s, m).O 

The next theorem gives the group representation of the integers 
modulo m. 

THeorem 3. Factor m into powers of primes, m = py) +++ pr. 
Let r; be a primitive root of p;; that is, the powers ri* (mod p;) for 
k = 0,1,...,p — 2, comprise every positive integer less than p;: . 
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Define the generator g; = 1 + Rm/p{', where R = (rn, — 1) 
(m/pf*)— (mod p$*). Define the Euler totient function ¢(1) = 1; 
otherwise ¢(k) = the number of integers less than k having no com- 
mon factor with it. Then, for any integer x less than m, there exist 
unique indices j; for which 0 < ji; < o(pSi/(x, pf*)) and 
x = (x, m)gi? see gi" (mod m). 

Proor. In [4]; if any p; = 2, replace g?* by +577, where 0 < 
je GF 27, 2). 8 

_ For example, for m = 35, as in our example above, x = 
22713172 (mod 35) for (x,35) = land forO<j/, < 4and0< jz. < 6. 

Index notation is analogous to logarithmic notation in that 
multiplication modulo m becomes merely addition of indices. 

The following theorem solves the problem of the subcycle 
starting points. It is similar to the algorithm in [6]. 

THEOREM 4. Let n and m be defined as for (1). Then, for any 
integer x less than m, upper bounds J; may be found so that unique 
indices j; exist in the range Q < ji < Ji and x = +(x,m) 
nogit see gy! (mod m). 

Proor. Express n and —1 in index notation. Then, compute 
from the indices of the smallest e such that n® = 1 (mod mm). 
Initially, set each J; = o(p?*/(x, Pe*)). Next, doing only index 


arithmetic, examine each power +7 for nontrivial relations of the 
form gii = n’gi} oe git (mod m/(x, m)) where 0 < jy < Jk 
for each k. Then set J; = /;. Stop when the product of the J; and e 
equals ¢(m/(x, m)), which is the number of integers in subcycles 
divisible only by (x, m).O 

Notice that the choice of J; by this method is not unique. For 
example, continuing from above, for (x, m) = 7,n = 4, 
x = 7-4/022/1 (mod 35), for 0 < jp < 2andO <j; < 2. The rela- 
tions found were (— 1)! = 4! (mod 5), 22? = 4! (mod 5) and 31! = 4° 
(mod 5). 

Theorem 4 is more important in theory than in practice. The 
tremendous labor in finding primitive roots for large primes (since 
a table of roots is very bulky) and in finding the index representa- 
tion of 1 is not compensated for by time savings afterward; see the 
timing tests below. The same practical objection holds against the 
algorithm in [6]. 

Algorithm. An efficient program breaks naturally into two 
parts. First determine starting points for the subcycles. and then 
move the data. In each part, the program below is significantly 
faster than Algorithm 380 in [3]. 

For each divisor d of m, the subcycles beginning with d and with 
m — d are done. If the number of data moved is still less than 
¢(m/d), further subcycle starting points of the form sd are tried, 
for s = 2, 3,.... The most general test is that sd is acceptable if 
no element in its subcycle is less than sd or greater than m — sd. 
Since this test requires much time-consuming computation, it is 
much faster to look for sd in a table where marks are made to 
indicate that an element has been moved. In some applications, a 
bit within each datum may be used. For example, if the data are all 
biased positive, the sign bit may be used; or, for normalized, non- 
zero, binary floating point data, the high bit of the fraction is always 
one and so may be used. In general, a special table of length 
NWORK is used. As in [3], NWORK = (m + nz)/2 was found to 
be sufficient for most cases. However, when m has many divisors, 
Algorithm 380 must perform the time-consuming general test for 
many possible starting points when the new algorithm need not. ' 

The inner loop of the algorithm computes (1), moves data, 
marks in the table, and checks for loop closure. Since the major 
part of the time of the inner loop is calculating (1), time is saved 
over Algorithm 380 by moving elements v, and v,,-; Simultaneously. 
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Timing Tests 


ny Ny m (all times in msec) 

Alg.302 Alg.380 Alg.380 XPOS XPOS 

IWRK=0 IWRK= NWORK=0 NWORK= 
(m1+12)/2 (m+-M2)/2 

Ti T: T3 T,4 Ts T/T, T2/T. T3/Ts 
45 50. 13-173 350 317 167 133 67 2,62 2,38 2,50 
45 60 2699 558 123 117 90 100 6,20 1,37 1,17 
46 50 112-19 367 339 VAL 106 83 3,46 3,21 2,60 
46 60. 31-89 425 350 250 133 83 3,19 2,63 3,00 
47 50. 34-29 383 378 267 ops 67 5,18 5,23 4,00 
47 60 2819 483 127 133 90 100 5,36 1,41 1,33 
45 180° 7-13-89 1200 1050 816 517 300 2,25 2,03 re Ps 
45 200 8999 1767 408 416 283 300 6,25 1,44 1,39 
46 180 17-487 1816 1233 583 267 267 6,41 4,63 2,19 
46 200 9199 1700 508 417 383 317 4,44 1,33 1,32 
47 180 11-769 1450 1133 667 383 267 3,78 2,96 2,50 
47 200 3-13-24] 983 1150 1067 550 467 1,69 2,09 2,29 
Un apedial cases, farther savings may be made. For EER me ea eee te ete iy aon 
divisible by 2 only when both m and , are odd. Then the subcycles C M@VED IS A LOGICAL WORK ARRAY @F LENGTH NWOKKe 


beginning at m/2 — s and m/2 + s may be done simultaneously LOGICAL MOVED 
DIMENSIGN ACNI2)s MOVEDCNWORK) 


with the subcycles from s and m — s, thus reducing the number of C REALLY ACN1,N2)» BUT NI2 = Ni*N2 


times (1) is computed. DIMENSION IFACTC3)s IPQWER(8)s NEXP(8)5 LEXPC8) 
IF (NIeLT2 OR. N2eLT+2) RETURN 


Timing tests. A set of test matrices were transposed on the N= NI 
360/65 with all programs written in Fortran H, OPT = 2. The new Bana - 1 aaae 
algorithm. was always faster than both Algorithm 380 [3] and Al- © SQUARE MATRICES ARE BENE CeeANATERY Fon SPEED 


I1IMIN = 2 
D@ 20 ILIMAX=NsoMsN 
12 = I1MIN +N - 1 
D@ 10 L1=EIMIN, TI MAX 
ATEMP = ACI1) 
ACI1) = ACT2) 


gorithm 302 [2] when NWORK = (m + 1)/2. When NWORK = 0, 
it was slower than Algorithm 380 (for JWRK = 0) and Algorithm 
302 only for a few cases when mn, < 100. It was especially faster 
than Algorithm 380 when m = mt, — 1 had many factors and 
there were hence many subcycles. 

An experiment was made for cases when m was prime. A known l2 = 12 

eee 10 CONTINUE 

primitive root of m was then taken from a table [5] and was used 


IIMIN = LIMIN +N +1 
to generate subcycle starting points. Since no time was wasted 20 Sen 
in finding the primitive root or in finding subcycle starting points, C M@DULUS M IS FACT@RED INT@ PRIME PGWERS. EIGHT FACTORS 


SUFFICE UP TO M = 2*34547%11413417*19 = 957675520. 
30 CALL FACTOR(Ms IFACTs IPGWER» NEXPs NP@WER) 
D@ 40 IP=1.,NPOWER 
IEXPCIP) = 0 


this test showed the maximum time savable by implementing c 
Theorem 4. For NWORK = (m + m)/2 and m > 200, no im- 
provement was found over the normal algorithm. For NWORK = 0, 


pape 40 CONTINUE 
the gain in speed was never more than 25 percent. © GENERATE EVERY DIVISOR OF M LESS THAN M72 
IDIV = 1 
50 IF CIDIV-GEeM/2) GO Ta@ 190 
C THE NUMBER @F ELEMENTS WHOSE INDEX IS DIVISIBLE BY IDIV 
C AND BY N@ @THEK DIVIS@OR OF M IS THE EULEK TOTIENT 
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NCOUNT = M/IDIV 
D@ 60 IP=1ls.NPOWER 
IF CIEXPCIP)-EQeNEXPCIP)) G@ TO 60 
NCQUNT = CNCQOUNT/IFACTCIP) )*¥CIFACTCIP)-1) 
60 CONTINUE 
D@ 70 I=1lsNWORK 
MOVEDCI) = -FALSE. 
70 CONTINUE 
THE STARTING POINT OF A SUBCYCLE IS DIVISIBLE ONLY BY IDIV 
AND MUST N@T APPEAR IN ANY OTHER SUBCYCLE. 
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SUBROUTINE XP@OSECAs Nis» N2s N12» MOVEDs NWORK) 
TRANSPOSITION OF A RECTANGULAR MATRIX IN SITU. 
BY NGRMAN BRENNERs MIT» 1/7726 CFe ALGe 3802 
TRANSPOSITION OF THE NI BY N2 MATRIX A AMQUNTS TO 
REPLACING THE FIFMENT 4T VFCTGR PASTTION T €N-ARTGIND 


agagagn 


CACMs 5/70. 


ISTART = IDIV 
80 MMIST = M = ISTART 

IF CISTART-EQ-IDIV) GO TO 120 

IF CISTART-GT-NWOKK) GO TB 91D 

IF (MOVEDCISTART)) G@ T@ 160 
90 ISOID = ISTART/IDIV 

D@ 100 IP=tsNPOWER 

IF CIEXPCIP) -EQ-NEXPCIP)) GB TO 100 
IF (CM@DCISOIDsIFACTCIP))-EQ-0) GE TO 160 

100 CONTINUE 

IF CISTART-LE-NWO@RK) GO TO 120 

ITEST = ISTART 
110 ITEST = MODCN*ITESTsM) : 

IF CITESTeLToISTART -OR» ITEST-GTeMMIST) GO TO 160 

IF CITEST-GT-ISTART «AND. ITEST«LTeMMIST) GO TO 110 
120 ATEMP = ACISTART+1) 
BTEMP = ACMMIST+1) 
IAl = ISTART 
IA2 = MODCN*IAIs™) 
MMIAL = M - TAI 
MMIA2 = M - IA2 
IF CIAL+eLE-NWORK) MOVEDCIAL) = e TRUE} 
IF (MMIAL-+LE«NWORK) MOVEDCMMIA1) = «TRUE. 
NCOUNT = NCOUNT - 2 


130 


COLLECTED ALGORITHMS (cont.) 


C MOVE TWO ELEMENTS, THE SECOND FROM THE NEGATIVE 
C SUBCYCLE-. CHECK FIRST FOR SUBCYCLE CLOSURE. 


IF CIA2.EQ-ISTART) GO T@ 140 
IF (MMIA2-EQ-ISTART) GO TO 150 
ACIAL+1) = ACTA2+1) 
ACMMIAI+1) = ACMMIAQ+1) 
IAl = IA2 
G@ TS 130 
140 ACIA1+1) = ATEMP 
ACMMIA1+1) = BTEMP 
GO T@ 160 
150 ACIAI+1) = BTEMP 
ACMMIAIL+1) = ATEMP 
160 ISTART = ISTART + IDIV 
IF CNCOUNT.GT-0) G@ TO 80 
DO 180 IP=1sNPQOWER 
IF CIEXPCIP)+«EQe¢NEXPCIP)) GO T@ 170 
IEXPCIP) = ITEXPCIP) + 1 
IDIV = IDIV*IFACTCIP) 
G@ TS SO 
170 IEXPCIP) = 0 
IDIV = IDIV/IPOWERCIP) 
180 CONTINUE 
190 RETURN 
END 


SUBROUTINE FACTOR(Ns IFACTs IPOWERs NEXPs NPQWER)D 
C FACTOR N INT® ITS PRIME POWERS» NPOWER IN NUMBER. 


© EeGes FOR N=1960=2**3 *5 #7**25 NPOWER=3>5 
C IPQWER=8» 5549s AND NEXP=3s152.- 
DIMENSION IFACTC8)» IPQWERC8)s NEXPC8) 
Ip = 0 
IFCUR = 
NPART = 
IDIV = 2 
10 IQUGT = NPART/IDIV 
IF (NPART-IDIV*IQU@T)> 60» 205 60 
20 IF CIDIV-IFCUR) 40s 40» 30 
30 IP = IP + 1 
IFACTCIP) = IDIV 
IPOWERCIP) = IDIV 
IFCUR = IDIV 
NEXPCIP) = 1 
GO T@ SO 
40 IP@WERCIP) = IDIV*IPOWERCIP) 
NEXPCIP) = NEXPCIP) + 1 
50 NPART = IQUOT 
G@ TO 10 
60 IF CIQU@T-IDIV) 100» 1005 70 
70 IF CIDIV-2) 80s 80» 90 
80 IDIV = 3 


c¢) 
N 


G@ Tg 10 
90 IDIV = IDIV + 2 
Go Te 10 


100 IF CNPART-1) 140s 1405 110 
110 IF €NPART-IFCUR) 1305 1305 120 
120 IP = IP + } 
IFACTCIP) = NPART 
IPQOWERCIP) NPART 
NEXPCIP) = 
G@ TO 140 
130 IPQOWERCIP) = NPART*IPOWERCIP) 
NEXPCIP) = NEXPCIP) + 1 
140 NPOWER = IP 
RETURN 
END 


_ 


IFACT=3» 5s 7» 
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Algorithm 468 


Algorithm for Automatic 
Numerical Integration Over 
a Finite Interval [D1] 


T.N.L. Patterson [Recd. 20 Jan. 1971, 27 Nov. 1972, 
12 Dec. 1972, 26 Mar. 1973] 

Department of Applied Mathematics and Theoretical 
Physics, The Queen’s University of Belfast, Belfast BT7 
INN Northern Ireland 


Key Words and Phrases: automatic integration, numerical 
integration, automatic quadrature, numerical quadrature 

CR Categories: 5.16 

Language: Fortran 


Editor’s note: Algorithm 468 described here is available on magnetic 
tape from the Department of Computer Science, University of 
Colorado, Boulder, CO 80302. The cost for the tape is $16.00 (U.S. 
and Canada) or $18.00 (elsewhere). If the user sends a small tape 
(wt. less than 1 Ib.) the algorithm will be copied on it and returned to 
him at a charge of $10.00 (U.S. only). All orders are to be prepaid 
with checks payable to ACM Algorithms. The algorithm is recorded 
as one file of BCD 80 character card images at 556 B.P.I., even 
parity, on seven track tape. We will supply algorithm at a density of 
800 B.P.I.-if requested. Cards for algorithms are sequenced starting 
at 10 and incremented by 10. The sequence number is right justified in 
column 80. Although we will make every attempt to insure that the 
algorithm conforms to the description printed here, we cannot guaran- 
tee it, nor can we guarantee that the algorithm is correct.—L.D.F. 
and A.K.C, 


Description 

Purpose. The algorithm attempts to calculate automatically 
the integral of F(x) over the finite interval [A, B] with relative 
error not exceeding a specified value e. 

Method. The method uses a basic integration algorithm 
applied under the control of algorithms which invoke, if necessary, 
adaptive or nonadaptive subdivision of the range of integration. 
The basic algorithm is sufficiently powerful that the subdivision 
processes will normally only be required on very difficult integrals 
and might be regarded as a rescue operation. 

The Basic Algorithm. The basic algorithm, QUAD, uses a family 
of interlacing whole-interval, common-point, quadrature formulas. 
The construction of the family is described in detail in [1]. Begin- 
ning with the 3-point Gauss rule, a new 7-point rule is derived, with 
three of the abscissae coinciding with the original Gauss abscissae; 
the remaining four are chosen so as to give the greatest possible 
increase in polynomial integrating degree; the resulting 7-point 
rule has degree 11. The procedure is repeated, adding eight new 
abscissae to the 7-point rule to produce a 15-point rule of degree 23. 
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Continuing, rules using 31, 63, 127, and 255 points of respective 
degree 47, 95, 191, and 383 are derived. The 255-point rule has not 
previously been published. In addition, a 1-point rule (abscissa at 
the mid-point of the interval of integration) is included in the family 
to make eight members in all. The 3-point Gauss rule is in fact 
formally the extension of this 1-point rule. The successive applica- 
tion of these rules, until the two most recent results differ relatively 
by ¢ or better, is the basis of the method. Due to their interlacing 
form, no integral evaluations need to be wasted in passing from one 
rule to the next. 

The algorithm has been used for some time on practical problems 
and has been found to generally perform reliably and efficiently. 
Its domain of applicability generally coincides with that of the 
Gauss formula, which is much wider than commonly supposed 
[2]. It will perform best on ‘“‘smooth” functions, but the degree of 
deterioration of performance when applied to functions with various 
types of eccentricities depends more on the harshness of these 
eccentricities than on their presence as such. Integrands with large 
peaks or even singularities at the ends of the interval of integration 
are handled reasonably well. It may be noted that none of the rules 
actually uses the end points of the interval as abscissae. Peaks in 
the integrand at the center of the interval and discontinuities in the 
integrand are less easily dealt with. Although it is recommended that 
the algorithm be applied using the control algorithms described 
Jater, if desired it can be used directly as follows. 


The algorithm is entered by the statement: 
CALL QUAD (A, B, RESULT, K, EPSIL, NPTS, ICHECK, F) 


The user supplies: 

A lower limit of integration. 

B_ upper limit of integration. 

EPSIL required relative error. 

F F(X) is a user written function to calculate the integrand. 

The algorithm returns: 

RESULT an array whose successive elements RESULT(1), 
RESULT(2), etc., contain the results of applying the succes- 
sive members of the family of rules. The number of rules ac- 
tually applied depends on EPSIJL. The array should be de- 
clared by the calling program to have at least eight elements. 

K element, RESULT(K), of array RESULT contains the value 
of the integral to the required relative accuracy. K is determined 
from the convergence criterion: 


| RESULT (K) — RESULT (K — 1) | 
< EPSIL* | RESULT (K) | 


NPTS number of integrand evaluations. 

ICHECK this flag will normally be 0 on exiting from the sub- 
routine. However, if the convergence criterion above is not 
satisfied after exhausting all members of the family of rules, 
then the flag is set to 1. 

The control algorithms. Two control algorithms are provided, 
QSUBA and QSUB, which if necessary invoke subdivision respec- 
tively in either an adaptive or a nonadaptive manner. QSUBA is 
generally more efficient than QSUB, but since there are reasons for 
believing [2] that adaptive subdivision is intrinsically less reliable 
than the nonadaptive form, an alternative is provided. 


COLLECTED ALGORITHMS (cont.) 


Table I. Test Integrals and Their Values 


I 2 
1. i Vx dx = 3 
F 5 


1 
2. [ {0.92 cosh (x) — cos (x)] dx = 0.4794282267 


1 
3: [ dx/(x4 + x? + 0.9) = 1.582232964 
a 


‘al 

2 

4, i xtdx = 3 
0 


1 
3: | dx/(1 + x4) = 0.8669729873 
0 
1 
6. / dx/(1 + 0.5 sin (31.4159x)) = 1.154700669 
0 
1 
ce i x dx/(e* — 1) = 0.7775046341 
0 


1 
8. i sin (314.159x)/(3.14159x) dx = 0.009098645256 
0.1 


10 
9. | 50 dx/(2500x? + 1)/3.14159 = 0.4993638029 
0 


3.1415927 
10. / cos (cos (x) + 3 sin (x) + 2 cos (2x) 
0 
+ 3cos (3x) + 3 sin (2x)) dx = 0.8386763234 


1 
11. / In (x) dx = —1.0 
0 
1 & 
12. | 4n’x sin (207x) cos (27x) dx = —0.6346651825 
0 


1 
13. / dx/(1 + (230x — 30)?) = 0.0013492485650 
0 


The adaptive algorithm QSUBA. QUAD is first applied to the 
whole interval. If a converged result is not obtained (that is, the 
convergence criterion is not satisfied), the following adaptive sub- 
division strategy is invoked. At each stage of the process an interval 
is presented for subdivision (initially the whole interval (A, B)). 
The interval is halved, and QUAD applied to each subinterval. If 
QUAD fails to converge on the first subinterval, the subinterval is 
stacked for future subdivision and the second subinterval imme- 
diately examined. If QUAD fails to converge on the second sub- 
interval, it is immediately subdivided and the whole process re- 
peated. Each time a converged result is obtained it is accumulated 
as the partial value of the integral. When QUAD converges on both 
subintervals the interval last stacked is chosen next for subdivision 
and the process repeated. A subinterval is not examined again once 
a converged result is obtained for it, so that a spurious convergence 
is more likely to slip through than for the nonadaptive algorithm 
QSUB. 

The convergence criterion is slightly relaxed in that a panel is 
deemed to have been successfully integrated if either QUAD con- 
verges or the estimated absolute error committed on this panel 
does not exceed « times the estimated absolute value of the integral 
over (A, B). This relaxation is to try to take account of a common 
situation where one particular panel causes special difficulty, per- 
haps due to a singularity of some type. In this case, QUAD could 
obtain nearly exact answers on all other panels, and so the relative 
error for the total integration would be almost entirely due to the 
delinquent panel. Without this condition the computation might 
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continue despite the requested relative error being achieved. The 
risk of underestimating the relative error is increased by this pro- 
cedure and a warning is provided when it is used. 

The algorithm is written as a function with value that of the 
integral. The call takes the form: 


QSUBA(A, B, EPSIL, NPTS, ICHECK, RELERR, F) 


and causes F(x) to be integrated over (A, B) with relative error 
hopefully not exceeding EPSIL. RELERR gives a crude estimate of 
the actual relative error obtained by summing the absolute values of 
the errors produced by QUAD on each panel (estimated as the 
differences of the last two iterates of QUAD) and dividing by the 
calculated value of the integral. The reliability of the algorithm will 
decrease for large EPSIL. It is recommended that EPSIL should 
generally be less than about 0.001. F should be declared EXTERNAL 
in the calling program. NPTS is the number of integrand evaluations 
used. The outcome of the integration is indicated by ICHECK: 

ICHECK = 0. Convergence obtained without invoking subdivi- 
sion. This corresponds to the direct use of QUAD. 

ICHECK = 1. Subdivision invoked and a converged result ob- 
tained. 

ICHECK = 2. Subdivision invoked and a converged result ob- 
tained but at some point the relaxed convergence criterion was 
used. If confidence in the result needs bolstering, EPSJL and 
RELERR may be checked for a serious discrepancy. 

ICHECK negative. If during the subdivision process the stack of 
delinquent intervals becomes full a result is obtained, which 
may be unreliable, by continuing the integration and ignoring 
convergence failures of QUAD which cannot be accommodated 
on the stack. This occurrence is noted by returning JCHECK 
with negative sign. 

The nonadaptive algorithm QSUB. QUAD is first applied to 
the whole interval. If a converged result is not obtained the follow- 
ing nonadaptive subdivision strategy is invoked. 

Let the interval (A, B) be divided into an panels at step N of 
the subdivision process. QUAD is first applied to the subdivided 
interval on which it last failed to converge, and if convergence is 
now achieved, the remaining panels are integrated. Should a con- 
vergence failure occur on any panel, the integration at that point is 
terminated and the procedure repeated with N increased by one. 
The strategy insures that possibly delinquent intervals are examined 
before work, which later might have to be discarded, is invested on 
well behaved panels. The process is complete when no convergence 
failure occurs on any panel, and the sum of the results obtained by 
QUAD on each panel is taken as the value of the integral. 

The process is very cautious in that the subdivision of the inter- 
val (A, B) is uniform the fineness of which is controlled by the suc- 
cess of QUAD. In this way it is much more difficult for a spurious 
convergence to slip through than for QSUBA. The convergence 
criterion is relaxed as described for OSUBA. 

The algorithm is used in the same way as QSUBA and is called 
with the same arguments as QSUBA. One of the possible values of 
ICHECK has a different interpretation: 

ICHECK negative. If during the subdivision process the upper 
limit on the number of panels which may be generated is 
reached, a result is obtained, which may be unreliable, by con- 
tinuing the integration ignoring convergence failures of QUAD. 
This occurrence is noted by returning JCHECK with nega- 
tive sign. 

Tests; The algorithms have been found to perform reliably on 
a large number of practical problems. To give a feeling for the 
performance, results for a number of contrived examples are given 
using the adaptive control algorithm, QSUBA. It would be dif- 
ficult to justify these examples as acid tests of any method, but they 
have the advantage of having being quoted at various times in the 
literature. 

For comparison a number of automatic procedures were used, 
which include SQUANK [3] (adaptive Simpson), as well as the 


COLLECTED ALGORITHMS (cont.) 


Table II. Relative Error Requested, 10~3 


Integral Neapre Naosvea Toeanre/Tosusa 
1 17 15 1.8 
2 17 7 2.9 
3 33 15 4.4 
4 9 7 1.9 
5 9 7 2.2 
6 175 127 3.2 
7 9 7 1.8 
8g 1137 255 8.5 
9 97 127 2.4 

10 107 63 2.2 

11 137 31 9.9 

12 252 63 6.3 

13 129 787 52 


N and T with appropriate subscripts give respectively the num- 
ber of integrand evaluations and the time taken for the com- 
putation. 


Table III. Relative Error Requested, 10-® 


1 33 63 75 
Z 33 15 2.6 

3 49 31 3.0 

4 129 31 5.0 

5 17 15 2.0 

6 401 255 2.9 

7 9 7 1.8 

8 2633 253 18. 

9 281 255 2.4 

10 193 63 3.8 

1] 233 795 .74 
12 532 127 6.4 

13 305 1001 .90 


Table IV. Relative Error Requested, 1078 


1 65 255 . 36 
2 33 15 2.7 
3 97 31 4.9 
4 545 31 20. 

5 65 31 3.6 
6 569 255 3.8 
7 17 1) 1.6 
8 4001 255 24. 

9 337 255 2.8 
10 305 127 2.8 
11 297 2415 .28 
12 932 127 10. 
13 481 1017 tl 


modified Havie integrator [4] and CADRE [5] (both based on the 
Romberg scheme). The latter algorithm, which attempts to detect 
certain types of singularities using the Romberg table, was found, 
on the examples tried, to be the best overall competitor to QSUBA, 
and only this comparison is quoted. The Havie algorithm was par- 
ticularly poor and had the disturbing feature of converging spu- 
riously on periodic integrands. Thacher [6] has described the short- 
comings of Romberg integration, and Algorithm 400 appears to 
exhibit them. SQUANK was found to be quite good when used 
at low accuracy, but the performance deteriorated as the demand for 
accuracy increased. It also gave trouble on some of the more awk- 
ward integrals such as 8 and 11. SQUANK also computes the in- 
tegral in the context of absolute error, and since this is meaningless 
unless an estimate of the order of magnitude of the integral is 
known, the algorithm can hardly be described as automatic. 
CADRE allows a choice of absolute or relative error. A criticism 
sometimes levied at relative error is that should the integral turn 
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out to be zero a difficulty will arise. The only advice that can be 
offered in this respect is that, should a user suspect that this is likely 
to happen, a constant should be added to the integrand reflecting 
some appropriate quantity such as the maximum of the integrand. 
The constant which will be integrated exactly can be removed after 
the algorithm has done its work. 

The test integrals are listed in Table I, and the results obtained 
for various required relative accuracies in Tables II, IJ, and IV. 
Generally QSUBA is superior by a substantial margin. The methods 
are compared in terms of the number of integrand evaluations 
needed to obtain the required accuracy and also in terms of the 
times required. For simple integrands the bookkeeping time of 
some methods can be significant, and QUAD can obtain a con- 
siderable advantage by its relative simplicity. Integrals 11 and 13 are 
interesting examples of this. The number of integrand evaluations 
exceeding 255 indicates that QSUBA invoked subdivision to obtain 
the resuJt. In Tables III and IV QSU8A returned ICHECK = 
2 on integral 11, but the requested tolerance was achieved. 

Integral 8 caused special difficulty to CADRE, and for Tables 
Ill and IV a converged result could be obtained only after a relatively 
large investment of computer time. The feature of CADRE to 
detect certain singularities should show up in integrals 1 and 11, 
but the gain does not emerge until high accuracy is requested as in 
Table IV. For harsher singularities the gain would likely become 
apparent earlier. 
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Algorithm 


SUBROUTINE QUADCAs Bs RESULT» Ks EPSILs NPTS»s ICHECKs F) 
DIMENSIGN FUNCT(127)s5 PC381)» RESULTC8) 

THIS SUBR@UTINE ATTEMPTS T@ CALCULATE THE INTEGRAL OF F(X) 
GVER THE INTERVAL *A* T@ *B¥ WITH RELATIVE EKROK NOT 
EXCEEDING *EPSIL*.- 
THE RESULT IS OBTAINED USING A SEQUENCE OF 15387915531 63% 
1272 AND 255 POINT INTERLACING FORMULAECNO INTEGRAND 
EVALUATIONS ARE WASTED) OF RESPECTIVE DEGREE 1552115235 
472953191 AND 383+ THE FORMULAE ANE BASED @N THE OPTIMAL 
EXTENSION GF THE 3=P@INT GAUSS F@KMULAs DETAILS OF 
THE FORMULAE ARE GIVEN IN *THE @PTIMUM ADDITION OF PGINTS 
T@ QUADRATURE FORMULAE' BY TeNeLe PATTERSON» MATHS +COMP. 
VOL 228 47-856s 19686 

*ee INPUT *** 
A LOWER LIMIT OF INTEGRATION. 
UPPER LIMIT OF INTEGRATION. 
RELATIVE ACCURACY KEQUIRED. WHEN THE RELATIVE 
DIFFERENCE OF TW2 SUCCESSIVE FORMULAE DGES NOT 
EXCEED *EPSIL* THE LAST FORMULA COMPUTED IS TAKEN 
AS THE RESULT. 
F FCX) IS THE INTEGRAND. 

*** QUTPUT *** 


B 
EPSIL 


RESULT THIS ARRAYs WHICH SH@ULD BE DECLARED T@ HAVE AT 
LEAST 8 ELEMENTSs H@LDS THE KESULTS OBTAINED BY 
THE 1537s ETCes POINT FORMULAE THE NUMBEK OF 
FORMULAE COMPUTED DEPENDS @N *EPSIL*. 

K RESULTCK) H@LDS THE VALUE @F THE INTEGRAL T@ THE 
SPECIFIED RELATIVE ACCURACY. 

NPTS NUMBER INTEGRAND EVALUATIONS - 

ICHECK ON EXIT N@RMALLY ICHECK=0- HOWEVER IF CENVERGENCE 


TO THE ACCURACY REQUESTED IS N@T ACHIEVED ICHECK=1 
ON EXIT. 
ABSCISSAE AND WEIGHTS @F QUADRATURE RULES AKE STACKED IN 
ARRAY *P* IN THE @RDER IN WHICH THEY ARE NEEDED. 
DATA 
PC bdsPC 2dePC 3dePC 4)ePC SdeFC 6)5PC 7)5 
PC BdsPC APIs*PCIOIsSPCLLIsPCLAISFCIII» PCL 4)» 
PCLSePCI6ISPCLTISPCIBISPCIIIZSP C20) sPC21 5 
PC22)5PC23)ePC(24)9PC25) 9 PCO6)sF C27) s PC28dS 
0677459 6669241 48337704E 00506¢S5555555555555555556E 00s 
0 -8888888S8888E8888E889E 00s 0.268 488089868333 44073E 00> 
096049 126870802028342E 0020-10465622602646726519E 00s 
0 «4342437493 4680255800E 0050+-40139741477596222291E 00» 


gQaaagangaaangagaagananagaagaaaaagaaaaanaaa 


HERHH HH EK 


A 


4, 


COLLECTED ALGORITHMS (cont.) 


ERE HEH HEE 


0. 45091 65386584741 4235E 0090+613441525524378422036E 00>» 
051 603282997079739 69 7E-01 20 «200628 52937698902 103E 00» 
0 -99383196321275502221E 00.0-170017196299 40260339E-01>5 
0 «888 45923287225699889E 00s0-92927195315124537686E-015 
066211029 467372264029 4E OOsOelLTISILIIOIISESIISBOTIE 00s 
0 022338 66864289 6688163E 0020-21915685840158749640E 00s 
0 -22551049979820668739E 00s0+67207754295990703540E~-O0l» 
0 2580759809 6176653 565E-0140-10031 427861179557877E 00+ 
0 08 43456573932 11062 463E-0250 + 46462893261 75798 6541 E-Ols 
0 «857559200 49990351 154E-0150+109578 42105592 463824E 00/7 


DATA 


HRHRERKRHKEEKNRHKHHKEK HE 


PC29)eP030)5sPC31 9 PC 32)59P633)oPC34)5PC35)> 
P636)9P637)2PC38)5PC39)sPC 4009 PC 419 PC 42), 


P(643)9PC44)9PC45)9PC46)9PC 47) sPC 48 a PCI)» 
P50) PC519sPC52)9PC53)5PC54)9PC55)59PC56)/ 


0+99909812496766759766E 
0-9815311495537 4010687E 
02929654857 4297 400 5667E 
0+836725938 1 6886873550E 
0-702 49 6206491 527078 61E 
0+53131974364437562397E 
0+33113539325797683309E 
041124889 4313318662575E 
061127552567207 68 691 61E 


00s 062544780791 56187441 S4E-025 
0050+164460 49854387810934E-01>5 
0050 +35957103307129322097E-O15 
00s 0-56979509 49 4123357412E-01>» 
0020-76879 620 499003531043E-01>5 
0020 +93627109981264473617E-Ols 
0020-10566989358023480974E 00, 
0050-11195687302095345688E 00> 
0050336038771 48207730542E-015 


0-129038001003512 6562 6E=-0120+5015713930589953741 4E-Ol» 
0 4217630 44155885 48391 E-02s 0623231 446639910269 443E-01,4 
0 ¢ 428779 6002500773 449 3E-0150+-547892105279 628 65032E-01» 
012651 5655623006801 1 4E-02s 0 -8223007957235929 6693E-02, 
0+17978551 568 126270333E-0 150.28 4897547 45833548 613E-01/ 


DATA 
PO57)9PC58)sPC59)5PC60)sPC61)» PC62)5PC63)> 
PC64)5PC65)2PC66)sPC67) 5 PC 68)sPC693sPC70)» 
POCT12sPC72)9PC73) 2 PCTAD SPOTS Oo PCTE9sPCTT)> 
PC78)sPC79)sPC80)sPC81)sPC82)2PC83)sPCB 4S 


RRHRRHHHEKHHEHHEKEHKEHHEE 


0-38 439810249 455532039E- 
065283 49467901165198 62E- 


0 -9998728881203576119 4E 
0.99720625937222195908E 
0-988 68 4757547 429 4799 4E 
0-97218287474858179658E 
069 463 428 583734029051 5E 
0-91037115695700429250E 
048 6390793819369047715E 
0-80694053195021761186E 
0+7397560 4435269 4758 68E 
066290966002 478059 546E 


0120+ 46813554990628012403E-01,5 
0150-55978 436510476319 408E-O1ls 
0020363221 4818 45530659 69E-03,5 
0050 2257904979 468 5688272 4E-02s 
0020-6115506822117246339T7E-02;5 
002 0+-104982 46909 621321898E-O1>5 
005 0+154067 50 466559 497802E-O0ls 
00s0-2059423391 59127111 49E-Ot>» 
0030-25869 67932721 4746911 E-Ol>» 
0050-31073551111687964880E-01>5 
0050 +36064432780782572640E-Ol> 
0020-4071 551011 6944318934E-015 


0+57719571005204581 48 4E 000+ 44914531 65363219741 4E-O1s 
0+ 4836180269458 4102756E 0020+ 48564330 40667319871 6E-017 


DATA 


HRHRHRHKEHRKRKRHHHHHEH HH E 


RHR KHHKHKHHEHRA HRHHKHRHE*E 


HKREKHRHKEKRKRKRHRHKREE EE 


PC 85)sPC 86)ePC 87)2PC B8BIsPC B9IePC 9OI2PC 91> 

PC 92)sPC 93)sPC 94)2PC 95)sPC 96)9PC 97)2PC 98)» 

PC 99)ePC100)sPC101)5PC102)sPC103)5PC104)sPC105)> 

PC106)sPC107)sPC108>»PCLOIPISPCLIOISPCLIIIsPCLI2)/ 
0+ 38335932 419873034692E 0020+51 5839253952048 458777E-Ol>» 
0227774982202182431507E 0020+53905499335266063927E-01>5 
0.168235251552207 46498E 0020-55481 404356559363988E-01>5 
02563443130 465927899 72E-01 20 «56277 699831254301273E-01>5 
056377 62836038 471 7388E-0150-16801938574103865271E-01>+ 
0. 64519000501 757369228E-025 0225078569 6529 49 TE8707E-01>s 
0 «210881524572 66328793E-025041161 572331995513 4727E-O1>5 
0-21 4389800125038 672 46E-0150 +2739 460526398 143251 6E-O1, 
Oe 63260731936263354422E-03s 0+- 41115039786546930472E-02,5 
0 «898927578 40641357233E-025 061 42 4487737291677 4306E-015 
02192199051247277 6601 9E-010+23406777 49531 4006201 E-O1+s 
002641 747339 5058259931E-0150 «27989218255238159704E-Ol> 
0.18073956444538835782E-03. 0.128952 408261041 73921E-02>5 
0630577534101 755311361E-02s 0-52 49123 454808859 1251E-02/ 

DATA 

POLIS »SPCLI4AsPCLISIoPCLI6sPCLL7IsPCLIBIZPCI1IDs 
PC120)»PCI]1 os PC122)5PC123)sPC12 42» PCL2A5)s PC126)s 
P0127) 5P(128)5PC129)5PC130)9PC131)sPC132)sPC133)> 
PO13.4)9PC135)59P6136)9PC137)sPC138)9PC139)e PCL 407 
0-77033752332797 418 482E-023 0. 10297116957956355524E-0l» 
0 «1293 4839663607373 455E-0120+155367755558 43982 440E-01> 
0+«18032216390391286320E-0150-20357755058 472159 467E-O01, 
0622 4572658268 16098 707E-0150 «24282 1 6520333 6599358E-Ols» 
025791 62697 602 4229 3868E-015 0269527 49 667633031963E-O1>» 
06277 4070217827968199 4E-0120+281388 499156271 50636E-0l> 
0-99998243035489159858E 00»0-505360952078 6251 7625E-04s 


06999598799 67T1IIIO6B325E 002003777 4664632698 46602 7E-035 


0-998316635318 40739253E 
0299572 4104698 407188 S51E 
0991495721178 10613240E 
Oe9SS371L 499598520371 11E 
069771 4151463970571 41 6E 
0-966637851558 41 656709E 


00s 0 -9333698 4854238150079E-03>5 
0020+) 6811 4286542146990 63E-02+ 
0020+25687649 4379 40203731E-025 
0020 +3572892783517299 649 4E-025 
0020+ 46710503721143217474E-02,5 
0020-58 43 4498 758356395076E-02/ 


DATA 
PCL41)sPC142)5PC1439)59PC144)5PC145)9PC1 46.9 PCL 47)5 


PO148)2PC149)sPC1S50)sPCLS1) sPC152) 9 PC153)5PC154)5 
PO155)9PC156)sPC157)sPC1582sPC15S9»PCL60)sPC161)> 
PC162)2P6163)sPC164)oPC165)5PC166)9PC167)sPCLE8)/ 


0-95373000642576113641E 
0-93832039777959288365E 
0.9203 40025470012 42073E 
0 8997448997769 4003664E 
067651341 448 47052697 4E 
0 085064449 47 6835027976E 
0-82215625436498040737E 
079108 49337998 48361 43E 
0-757 48396638051363793E 
0-721 4230853700989 1 548E 
0+ 68298743109 107922809E 
0 642276642509759 S1377E 
0-599 4039302 422 4289297E 
0+655449513263193254887E 


0050-7072 489995433555 4680E-02+ 
0020-83 428387539 681577056E-025 
0020 -96411777297025366953E-025 
0020+ 1095573338 7837901 648E-01>+ 
00s 0-12275830560082770087E-Ol» 
00s 0-13591571009765546790E-O0l» 
0020-1 489364166481 5182035E-O1>» 
0050+161732187295777199 42E-Ol>» 
0020+17421930159464173747E-Ol» 
0020-186318 48256138 790186E-Ol» 
0020+197954950 48097499 488E-O1> 
00502209058 514458 12023852E-01>s 
0020 +21956366305317824939E-Ol> 
00s 0-229 409 64229387748 761E-017 


KRHHKKEKRKEHHKH HEHEHE HE 


468-P 4- 06 


DATA 


PC169)sPC170)oPC171)5PC172)5PCO1739)5PC174)9PC175)5 
PO176)sPCO177)9PCO178)sPC179ISPC1BOsPCIS1I»PCLB2)5 
PO183)sPC184)sPC185)5PC186)sPC187)sPC188)2PC189)» 
PO190)sPC1913sPC192)9PC193)sPOLI4)SPCIISISPCI96)S 


0+-50768775753371660215E 
0+¢45913001198983233287E 
0+ 40897982122988867241E 
0-357 40383783153215238E 
030 457644155671 40433 4E 
0 «250 67873030348317661E 
0+19589750271110015392E 
O+1 40 42 423315256017 459E 


00+ 0-23854052106038540080E-01, 
0020 -24690524744487676909E-O1> 
0050+254457 69965464765813E-O1s5 
0020 .26115673376706097680E-Ol>» 
0020-2669 6622927450359906E-Ol» 
0020627185513229 62479 1819E-O1,» 
0020027579749 566481873035E-Ols 
0050-27877251 ATH6613Z701L609E-Ot>» 


0 +8445 40 400837 10883710E-0150+28076455793817246607E-Ols 
0 +2818 46489 4974569 4339E-0150+28176319033016602131E-015 
0+2818881 4180192358 69 4E-0150+8 4009 692870519326354E-02>5 
0 «32259 500250878 68 461 4E-025 0+ 1253928 482647488 4353E-O01> 
01054407 6228 6331 67722E-02»+ 0+ 58078 61 6599775673635E-02, 
0+10719490006251933623E-0140-13697302 63199071 6258E-01/ 


DATA 


HRREKKRKEHKKKRAKHHEH EK EE 


PO197)sPC198)sPC199)5P(200)sPC201)PC202)sPC203)5 

PC204)sPC205)sP(206)2P(207) > P0208) sPC209)sPC210)5 

PC2V1sPC212) oP C213) sPC214) oP C215) sPC216) oP C217)» 

P(218)sPC219)2PC220)sPC221 » PC 222) 5PC223)9P(224)/ 

0+ 3163036608222 6447 689E~0350220557519893273 465236E-02» 
0+ 449 46378920320678 61 6E~02s 0.7122 438 68 64583871532E-02, 
0 «96099 525623638830097E-0250+11703388747657003101E-O1,5 
0+ 13208736697 529129966E-0150-1399 4609 1276190798 52E-01, 
0+9037273 46587511 49261 E-0 49 0. 6447620 41 30572.477933E-03, 
O:+ 15288 767050877 65568 4E-02» 02624561 727 40 4429 S626E-025 
0-3851 68761663987092 41 E-02,0.51 485584789781777618E-02, 
0- 6467 419831803686727 4E-02+ 0.77 6838777792199 t2200E-02» 
0-901 61081951956431 600E-02s0.10178877529236079733E-Ols» 
0-11228632913408049354E-01+0-.121 41082601 668299679E-O1>, 
0-128958134880121 1469 4E-0140-13476374833816515982E-01, 
0+13870351089 1398 40997E-015061 4069 42.4957813575318E-01, 
062515787038 4280661 489E-0 45 0+ 18887326450650 491366E-03, 
0+ 46918 492 42 478 50 4097 SE-030-8 40571 432710722 46365E-03/ 


DATA 


HEHE KHH EHR HE HRKH KH HE 


PC225)sPC226)5PC227) s PC 228 5 PC229) »PC230)sPC231)> 
P C232) 5P(233)5PC234)5P6235)5P6236) P6237)» PC238)5 
PC239)sP(240)5P(241)sP(242)5P(6243)9PC244)5PC245)5 
P6246)5P(247) P6248) sPC249) s PC250) os POSSI» PCA52)/ 
0-128 4382 4718970101768E-0250-1 786446391758 64982 47E-02,5 
0-23355251860571 608737E-0220.292172 493791 7819 7538E-025 
0635362 4499771677773 40E-025 0. 4171419397698 40788 528E-02, 
0+ 48205888 64851268347 6E-02s 0.54778 666939 1895082 40E-02>, 
0+ 61379152800 413850 435E-0220.679578 550 488277339 48E-02, 
027 446820832 407591017 4E-02+ 0.808 66093647888599710E-025 
0-871096507973208 68736E-0250 -931592 41280693950932E-02, 
0298977 4752 40 487 497 440E-0220.1045292572290601 1926E-O1,> 
0+10978183152658912470E-0150.11 47048211 469387 4380E-01, 
0 119270260530192700 40E~01s0-123452 623722 43838 455E-01>» 
061272288 4982732382906E-0150-1305783 66883530488 40E-Ot» 
0613348311 463725179953E-0150-1359275661 48123959 10E-O1; 
021378987 478 32 4093651 7E-O1s0.13938 6257383068 S080 4E-Ols 
0-1 403822789 6908 623303E-0150.1 408815951 6508301065E-017 


DATA 


KRREHKHHRHHRHHK HEHEHE HE 


PC€253)5P(254) »PC255)5PC256) 9 PC257)» PC258 de POASI)» 
PC260)sP(261)5PC262)5PC263)sPC264) »PC265)»PC266)>» 
P(267)sPC268)sPC269)5PC270) »PC2T1 ds PCLT2)» PC2T3)» 
PC274)5PC275)5P6276) PC 277)» PC278) s PC2T9)» PC2BO)S 


0699999759 63797 48 46462E 
00-9999 4399620705437 S76E 
0299976049092 44320 4733E 
0+99938033802502358193E 
0-99874561 446809511 470E 
0 -99780535449595727 456E 
069965141 459 1 489027385E 
0+99483150280062100052E 
0 «9927213 4428278861533E 
0990151370 4007701 5918E 
0+98709252795403 406719E 
0-983518657578 63272876E 
0-979 406281 6708 62 68381E 
069747344597 S2 4026677 6E 


DATA 


HHRHKKHRHHEHEKRHRKHHKHHEHEE 


HEHREEKRHRKE*E 


00s 0+ 6937936432 41082671 70E-05,5 
0020453275293 669780613125E-0 4» 
00s061357549109 4922871973E-03> 
0020 -249212 400 48299729 402E-03,> 
0020-3897 4528 447328 229322E-035 
0020255429531 493037471 492E-03, 
0050.74028280 42 44503330 46E-03, 
0030.9 45361 51 6858525382 46E-03, 
0020¢116748 4117429959 4077E-02,5 
000-1! 4049079956551 446427E-02, 
0050.16561127281544526052E-02, 
00s0-19197129T7T1I0138724125E-02, 
0020-219 44069253638 388388E-02, 
003062 47895822 66575679 307TE-02/ 


PC281)sPC282)5PC283) oP (284) »PC285)2PC286)5PC287)» 
PC288)5P6289)sPC290) »P C291)» PC292)9PC293)» PC29 4)» 
PC295)5PC296)9PC297) + P6298)» P6299) PC300)sPC301)>» 


PC302)sPC€303) P6304) P6305) »P( 306) » P6307) sP(308)7 


0-969 48 4659 502 45923177E 
0+96364062156981213252E 
095718821 61098 609627 4E 
0-95011529752129 487656E 
0-942 41156519108305961E 
0°934068 43615772578800E 
0+92507893290707565236E 
0+91543758715576504064E 
09051 4035881326159519E 
0-89 418.45683355590228 6E 
0 68825688 402 473 419068 4E 
0-8702930555481139058 SE 
085735831088 623215653E 
0-8 43766882 672708 6010 4E 


0020277219576 459345099 40E-02, 
00+0-3073018 434702578323 4E-02, 
00s 06338039799 108 69203823E-02, 
0020-36933779170256508183E-02, 
0050. 401106872 40750233989E-02>5 
00s 0+ 43326409 680929828 545E-02,5 
0020+ 465731 72997568 547773E-025 
0020-498 4364564765538 6012E-02, 
00s 0-531308 66051870565663E-02, 
0020-56428181013844441585E-02, 
0020-5972919565508 1 6580 49E-02, 
0020+ 6302773 4490857587172E-02, 
0020+ 66317812 4290188789 41E-02, 
0050-69593 61 409390 422939 4E-028/ 


DATA 


P€309)sPC310)5PC311)5PC312)5PC313)sPC314)sPC315)> 
PCO316)5PC317)5PC318)2PC319)5PC320)sPC321)5PC322)5 
P6323) 2PC324) PC 325) sPC326)5PC327)sPC328)sPC329)> 
PC330)5P(331)sP0332)5PC333) 2PC334)5PC335)2PC3396)7 
0-829522194637401 40018E 0020-72849 479805538070639E-02.5 
0081 4628787655137 41344E 0020+7607989 66571905658 32E-02,5 
0+7990922909608 41 40180E 0020-79279 4933 429 48 491103E-02, 
0+78291939411828301639E 00s0.82 443037630328 680306E-02, 
0+76611781930376009072E 0020.85565435613076896192E-02, 


COLLECTED ALGORITHMS (cont.) 


0 - 748 69 629361 693660282E 
0+73066452124218126133E 
0-71203315536225203 459E 
0.69281376977911A7T0289E 
0+673018830230418 47920E 
0-652661 66541001 749610E 
0663175643771 119423041E 
0-6103181137151864001 6E 
0-588362 434447662541 43E 
DATA 


HHH HKHE HHH 


Hee HEH 


0-56590588542365442262E 
0+542965666498311 490 49E 
0¢51955966153745702199E 
0.49570 6407918761 4601 7E 
0.471 425065871 65887 693E 
0 ¢ 44673538 7662028 4737 4E 
0. 421 65768 66261 6330006E 
0-39621280605761593918E 
0-370 4220879500782301 4E 
0+34430734159943802278E 
0317890812068 47668318E 
02911951 485182 4668196E 
0 «264243372 4109267619 4E 


00+0-8864173209 482 49 42 641E-02, 
00+0-9166711163560788 4067E-02, 
00209463 68999383006529 43F-02» 
00+0.975465653631741 14611 E-U2s 
0020-100391720440568 40798E-O1, 
000.1031 68123309 47621 682E-01,» 
0050-1058 716790 4885197931 E-Ols 
0050-108 498 4408933731 4099E-01>» 
00+0-111044611394006926537E-O017 


P6337) 2PC338)5PC339)+PC6340)9PC 9419s PC 342)92P(343)5 
P0344) »P6345)2P6346)sP6347)»PC348)5PC6349)sPC350)> 
PC351)5P03952)5PC353)2P6354)sPC 355) 2PC356)sPC 357)» 
PC35B)sPC359) P6360) P6361 2 P6362) 5 PC363)sPC364)7 


002 0-11350654315980596602E-01+ 
00201158807 40330 43952568E-01s 
0020-1181 6385890830235763E-Ols 
00,0-1203527078 5279 562630E-Ol» 
0050-1224442 4981 611985899E-Ols 
0020+1244356019071 4035263E-01% 
0050-12632 403643542078 765E-Ols 
0050-128106981 63877361967E-Ol»s 
0020-1297820223953739928 6E-Ol» 
0050+131346900919601 52836E-Ol» 
0020+13279951743930530650E-Ol,» 
0050-13413793085110098513E-Ol» 
0020 «1353603593 495621361 4E-01>» 


KR K HEHEHE KRHHK HE 


0 «2370588 4558982972721E 0020+13646518102571291 428E-O1/ 
DATA 
PC365)2PC366)sPC367)+P(368)sPC369)sPC370)sPC371)» 
PO372)sPC373)5P0374)sPC375)5P0376)sPC377)9PC978)» 
* PC3T9IsPC380)IePC 38197 
0+209 6652382 4318119477E 0020-1374509344300189 6632E-O0l»s 
0+18208649675925219825E 0020+13831631909S06428676E-Ols 
0-15434681148137810869E 00,0-13906019601325461264E-Ot» 
0+12647058 437230196685E 0050+1396815880651 693851 6E-Ol» 
0-98 4823965981 19202090E-0120+140179 680394566088 10E-Ols 
0-70 4069760 428551 79063E-0150-1 4055382072 649964277E-Ol» 
0-422 69164765363603212E-0120+ 1 4080351962553661325E-0ls 
0 «1409388 641078246261 4E-0120-1 40928 450691 60 408355E-01.s 
0+1 409440709009 61793 47E-O17 
ICHECK = 0 
C CHECK F@R TRIVIAL CASE. 
IF CA-EQ.B) GO TE 70 
C SCALE FACTORS. 
SUM = (€B+A)/2.0 
DIFF = (B-A)/2.0 
C 1-P@INT GAUSS 
FZERE = FCSUM) 
RESULTC1) = 2.0*FZERO*DIFF 


* * 


¥ 


HHH RH HK 


I =@ 
I@LD = 0 
INEW = 1 
K=2 
ACUM = 0-0 
GB TS 30 
10 IF CK-EQ-8) GG TO 50 
K = K +1 
ACUM = 0.0 


C CONTRIBUTI@N FROM FUNCTI@N VALUES ALREADY COMPUTED. 
DO 20 J=1,I@LD 
Ist+i1 
ACUM = ACUM + PCI)*FUNCTCJ) 
20 CONTINUE 
C CONTRIBUTION FROM NEW FUNCTI@N VALUES. 
30 I@LD:= I@LD + INEW 
D@ 40 J=INEW,s 1OLD 
I=t+1 
X = PCL)*DIFF 
FUNCTCJ) = FCSUM#X) + FOCSUM-X) 
I=It+1 
ACUM = ACUM + PCI) *FUNCTCU) 
40 CONTINUE 
INEW = I@LD + 1 
Is I+t 
RESULTCK) = CACUM+PC1L)*FZERQ) *DLFF 
C CHECK FOR C@NVERGENGE. 
IF CABSCRESULTCK)-RESULTCK-1) )-EPSIL*¥ABSCRESULTC(K))) 60s 
* 605 10 
C CONVERGENCE N@T ACHIEVED. 
50 ICHECK = 1 
C NORMAL TERMINATION. 
60 NPTS = INEW + I@LD 
RETURN 
C TRIVIAL CASE 
70 K.= 2 
RESULTCL) 
RESULT(2) 
NPTS = 0 
RETURN 
END 


Hou 


FUNCTI@N QSUBCAs Bs EPSIL» NPTS» ICHECKs KELERR» F) 

THIS FUNCTION ROUTINE PERFORMS AUTQ@MATIC INTEGRATION 
OVER A FENITE INTERVAL USING THE BASIC INTEGRATION 
ALGORITHM QUADs TOGETHER WITHs IF NECESSAKYs A N@N- 
ADAPTIVE SUBDIVISION PROCESS. 

THE CALL TAKES THE FOKM 

QSUBCAs Bs EPSILsNPTS»s ICHECKs RELERR» F) 

AND CAUSES FCX) T@ BE INTEGRATED QVER ¢AsB) WITH KELATIVE 
ERROR HOPEFULLY NOT EXCEEDING EPSIL+ SH@ULD QUAD CONVERGE 
CICHECK=0) THEN Q@SUB WILL RETURN THE VALUE OBTAINED BY IT 
OTHERWISE SUBDIVISION WILL BE INV@KED AS A RESCUE 
Q@PERATION IN A N@N-ADAPTIVE MANNER. THE ARGUMENT RELERR 
GIVES A GRUDE ESTIMATE OF THE ACTUAL KELATIVE ERROR 
OBTAINED. 
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c 


c 
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THE SUBDIVISION STRATEGY IS AS FOLLOWS 
LET THE INTERVAL (AsB>) BE DIVIDED INT@ 2**#N PANELS 
N OF THE SUBDIVISION PROCESS. QUAD IS APPLIED FIRS} 

THE SUBDIVIDED INTERVAL ON WHICH QUAD LAST FAILED To 

CONVERGE AND IF CONVERGENCE IS NOW ACHIEVED THE REMA. ae 
PANELS ARE INTEGRATED.« SHOULD A CONVERGENCE FAILURE _ceno 
BN ANY PANEL THE INTEGRATION AT THAT P@INT IS TERMINATED 

AND THE PROCEDURE REPEATED WITH N INCREASED BY 16 THE 

STRATEGY INSURES THAT POSSIBLY DELINQUENT INTERVALS ARE 

EXAMINED BEFQRE WORKs WHICH LATER MIGHT HAVE TO BE 

DISCARDEDs IS INVESTED ON WELL BEHAVIED PANELS. THE 

PROCESS IS COMPLETE WHEN N@ CONVERGENCE FAILURE @CCURS @N 

ANY PANEL AND THE SUM OF THE RESULTS @BTAINED BY QUAD @N 

EACH PANEL IS TAKEN AS THE VALUE OF THE INTEGRAL. 

THE PROCESS IS VERY CAUTIQUS IN THAT THE SUBDIVISION QF 
THE INTERVAL (A»B) IS UNIF@RMs THE FINENESS OF WHICH IS 
CONTROLLED BY THE SUCCESS @F QUAD. IN THIS WAY IT IS 
RATHER DIFFICULT F@R A SPUKI@US CONVERGENCE TG SLIP 
THR@UGH. 

THE CONVERGENCE CRITERI@N OF QUAD (S SLIGHTLY RELAXED 
IN THAT A PANEL IS DEEMED TQ HAVE BEEN SUCCESSFULLY 
INTEGRATED IF EITHER QUAD CONVERGES @R THE ESTIMATED 
ABSOLUTE ERR@R COMMITTED ON THIS PANEL DOES NOT EXCEED 
EPSIL TIMES THE ESTIMATED ABS@LUTE VALUE OF THE INTEGRAL 
@VER CAsB)» THIS RELAXATION IS TO TKY T@ TAKE ACCOUNT OF 
A COMMON SITUATION WHERE @NE PARTICULAR PANEL CAUSES 
SPECIAL DIFFICULTY» PERHAPS DUE T@ A SINGULARITY @F SOME 
TYPE. IN THIS CASE QUAD COULD OBTAIN NEARLY EXACT 
ANSWERS ON ALL @THER PANELS AND S@ THE RELATIVE ERROR FOR 
THE TOTAL INTEGRATIQN WOULD BE ALMOST ENTIRELY DUE TO THE 
DELINQUENT PANEL. WITH@UT THIS CONDITION THE COMPUTATION 
MIGHT CONTINUE DESPITE THE REQUESTED RELATIVE ERR@R BEING 
ACHIEVED. 

THE OUTCOME OF. THE INTEGRATION IS INDICATED BY ICHECK. 


ICHECK=0 - CONVERGENCE @BTAINED WITHGUT INVOKING 
SUBDIVISION. THIS CORRESPONDS T@ THE 
DIRECT USE @F QUAD. 

ICHECK=1 = RESULT @BTAINED AFTER INVOKING SUBDIVISION. 

ICHECK=2 - AS FOR ICHECK=1 BUT AT SOME PGINT THE 


RELAXED CONVERGENCE CRITERION WAS USED. 
THE RISK @F UNDERESTIMATING THE RELATIVE 
ERRGR WILL BE INCREASED. IF NECESSARYs 
CONFIDENCE MAY BE RESTGRED BY CHECKING 
EPSIL AND RELERR FOR A SERI@US DISCREPANCY. 
ICHECK NEGATIVE 
IF DURING THE SUBDIVISION PROCESS THE 
ALLOWED UPPER LIMIT @N THE NUMBER @F PANELS 
THAT MAY BE GENERATED (PRESENTLY 4096) IS 
REACHED A RESULT IS @ETAINED WHICH MAY BE 
UNRELIABLE BY CONTINUING THE INTEGRATION 
WITHOUT FURTHER SUBDIVISION IGNORING 
CONVERGENCE FAILURES. THIS OCCURRENCE IS 
FLAGGED BY RETURNING ICHECK WITH NEGATIVE 
SIGN. 
THE RELIABILITY OF THE ALGORITHM WILL DECKEASE FOR LARGE 
VALUES @F EPSIL« IT IS RECQ@MMENDED THAT EPSIL SHOULD 
GENERALLY BE LESS THAN ABOUT 0.001. 
DIMENSI@N RESULT(8) 
INTEGER BADs QUT 
LOGICAL RHS 
EXTERNAL F 
DATA NMAX/4096/ 
CALL QUADCAs By» RESULTs Kz» 
Q@SUB = RESULT(K) 
RELERR = 0-0 
IF CQSUB-NE-0-0) RELERR = 
* ABSCCRESULTCK)-RESULTCK=-1))/QSUH) 
CHECK IF SUBDIVISION IS NEEDED. 
IF CICHECK-EQ-0) RETURN 


EPSILs NPTSs ICHECKs F) 


SUBDIVIDE 
ESTIM = ABSCGSUB*EPSIL) 
cz! 
RHS = «FALSE. 
N=! 
H=B-A 
BAD = 1 

10 @SUB = 0.0 

RELERR = 0-0 
H = H*0.5 
N=N +N 


INTERVAL ¢AsB) DIVIDED INT@ N EQUAL SUBINTERVALS. 
INTEGRATE OVER SUBINTERVALS BAD T@ (EAD+1) WHERE TROUBLE 
HAS OCCURRED. 
M1 = BAD 
M2 = BAD + 1 
OUT = 1 
G@ T® 50 
INTEGRATE OVER SUBINTERVALS 1 
20 Mi = 1 
M2 = BAD - 1 
RHS = «FALSE. 
@UT = 2 
G@ T@ 50 
INTEGRATE @VER SUBINTERVALS (BAD+2) TO Ne 
30 Mt = BAD + 2 


TO CBAD~-1) 


M2 2 N 
OUT = 3 
G@ T@ 50 


SUBDIVISION RESULT 
40 ICHECK = IC 
RELERR = RELERR/ABS(QSUB) 
RETURN 
INTEGRATE OVER SUBINTERVALS Ml 
50 IF (M1-GT.M2) G@ TO 90 
D2 80 JJ=M1,mM2 
J = JJ 


TQ Me, 


COLLECTED ALGORITHMS (cont.) 


HEEEHEHEEHN HEE 


0+ 450916538658 4741 4235E 00,0.13441 525524378 422036E 00s 
0-51 603282997079739697E-01 502200628 52937698902103E 00> 
0-99383196321275502221E 0050-17001719629940260339E-01, 
0-888 45923287225699889E 00,0.92927195315124537686E-0l1s 
0+ 6211029 4673722640294E 0050+17151190913639138079E 00s 
0 «22338 668642896688163E 0050+21915685840158749640E 00, 
0022551049979820668739E 0050+ 67207754295990703540E-Ols 
062580759809 6176653565E-01s0+10031427861179557877E 00s 
0 «8 43 456573932 11062 463E-025 0. 46462893261757986541E-Ol»s 
06857559200 49990351 154E-0150+109578 42105592 463824E 00/ 


DATA 


HRHEKRKKEKEHRKRHEKKHRKEKKESE 


PC29)4PC30)5PC31)sPC32)59P633)9PC34)sPC35)5 
P€36)sPC37)5PC38)sPC39)sPC40dePC 41s PC42)5 
PC43)2PC44)2PC45)9PC46)9PC47) eo PC 4839 PC49)2 
P(50)sPC51)+PC52)5P053)5PC54)sPC55)5PC56)7 


0+99909812496766759766E 
069815311 495537 4010687E 
029296548 574297 4005667E 
06836725938 16886873550E 
0 +702 49 6206491527078 61E 
025313197 4364437562397E 
0-33113539325797683309E 
01124889 4313318662575E 
0+1127552567207 68 69161E 


0020 +254478079156187441 S4E-025 
0030 +164460 4985438781093 4E-01>5 
0050 +35957103307129322097E-Ol» 
0050256979509 49 4123357412E-O1,s 
0050-76879 620 499003531043E-01> 
00+0+93627109981264473617E-Ols 
00+0+10566989358023480974E OO» 
00s 0+11195687302095345688E 00s 
0020 +336038771 48207730542E-O1>» 


0+129038001003512 6562 6E-01+0+5015713930589953741 4E-O1>» 
0+ 4217630 4415588548391 E-02+ 0623231 446639910269 443E-01» 
0 ¢ 428779 6002 500773449 3E=-0150-547892105279 628 65032E-Ol> 
0212651 5655623006801 1 4E-025 0 -8223007957235929 6693E-02» 
0017978551 568 128270333E-0 140628 4897547 45833548 613E-017 


DATA 


HRHRUHRKEHRAEKRKRHEHRHREHHK EK 


PC57)2PC58)5PC59)sPC60)9PC61)sPC62)5PC 63)» 
PC64)5PC65)sPC66)PC 67) sPC 68) 2PC69)sPC70)> 
PCT71)9PC72)9PC73)9PC74)9PC75)9PC76)2PC77)» 
PC78)ePC79)sPC80)sPCB1 Is PCB82)5PCE3)sPCB84)7 
0.38 4398102 49 455532039E-0150+ 46813554990628012403E-01>5 
O.5283 49 467901165198 62E-0120-55978 436510476319 408E-O1>+ 


0.99987288812035761194E 
0.99720625937222195908E 
0-988 68 4757547429 4799 4E 
0.97218287474858179658E 
0 69 463 428 583734029051 5E 
0-91037115695700429250E 
0.8 6390793819369047715E 
0-8069 4053195021761186E 
0.7397560 4435269 4758 68E 
0+662909 66002 478059 S46E 
0-57719571005204581 48 4E 
0+ 4836180269 458 4102756E 


0020-36322 1 4818 45530659 69E-03.5 
005 00257904979 468 S6882724E-025 
0050+ 611550682211 7246339T7E-02,5 
0020+10498246909621321898E-Ol» 
003 0+15406750 466559 497802E-01s 
0050-2059 4233915912711149E-Ol> 
0020 62586967932721 474691 1E-Ol» 
0050+31073551111687964880E-01» 
0050 +36064432780782572640E-Ol> 
0020+ 40715510116944318934E-O1» 
0050+ 4491 4531 65363219741 4E-Ols 
00s 0+ 48564330 406673198716E-017 


DATA 


HRHHREKRHHKKEHRHHHKHHH HEE 


PC 85)sePC 86)sPC 87)sPC 88)sPC B9sPC GOs PC 91)5 
PC 92)sPC 93)oPC 94.sPC 95)sPC 96)sPC 97)>ePC 98)> 
PC 99)sPC100)sPC101)sPC102)5PC103)sPC104)sPC105)> 
PCLO6)sPCLO7IsPCLOBISPCLOPISPCLIOISPCLILdsPCLI2)7 
0+38335932419873034692E 0020+51583253952048 458777E-O0ls 
0627774982202 182431507E 00s0+53905499335266063927E-01>» 
0-168235251552207 46498E 0020-55481 404356559363988E-01>5 
0-563 443130 46592789972E-0150-56277699831254301273E-Ol» 
056377 62836038 4717388E-0120+1680193857 41038 65271E-O0l» 
0+64519000501757369228E-0250 225078569 6529 49768 707E-Ol» 
0221088152 45726632879 3E-0230211615723319955134727E~-Ol» 
0-21 4389800125038 672 46E-0120 +2739 4605263981 432516E-Ol>s 
0+ 63260731936263354422E-035 06 4111503978 6546930472E-02,5 
0 «898927578 40641357233E-0250¢1 424487737291677 4306E-O1» 
02192199051247277 66019E£-0150-23406777 49531 4006201 E-O0ls 
0 +-26417473395058259931E-0150«27989218255238159704E-01,5 
0+18073956444538835782E-0350-128952 4082610417392 1E-02>5 
0+30577534101755311361E-0220-5249123 4548088591251E-02/ 


DATA 


RHEHREHEHKEHKRAHA HHH E 


POLED sPCL1L 4) sPCLISISPCLI6»PCLITISPCLIBIZPCLIDI» 
PC120) sPCLALIs PC122)5PC1239)sPC124) os PC125)» PC126)5 
PC127)5PC128)5PC129)5PC130)5PC131)sPC132)5PC133)> 
PO134)9PC135)5PC136)9PC137)sPC1382oPC139)ePC1 407 
0+77033752332797 418 482E-025 06102971 1695795635552 4E-Ol» 
0-12934839663607373 455E-0150+155367755558 43982 440E-O1> 
0-1803221 639039 1286320E-0150-20357755058 472159 467E-Ol» 
0-22 4572658268 1 6098 707E-0150-2 42821 65203336599358E-Ol» 
0-2579 1 62697 602 4229 386 E-019 00269527 49 6676330319 63E-Ol» 
02277 4070217827968199 4E-0150+281388 499156271 50636E-Ol» 


0 +999982 430354891 59858E 
0+99959879967191068325E 
0.99831 6635318 40739253E 
0599572 4104698 40718851E 
00991 49572117810613240E 
00985371 499598S2037111E 
009771 4151463970571 41 6E 
009 66637851558 41 656709E 


DATA 
*® PCL41)2PC142)5PC143)5PC144)5P0145)5PC146)9PC147)5 
PCL48)5PC149)5PC150)5PC151)5PC152)5PC153)5PC154)5 
PCLS5)ePC156)oPC157)oPC158 de PC1S9IsPC160)sPC 161)» 
PCL 62)2PC163)52PC164)5PC165)5PC166)5PC167)sPC168)7 


HHHREK HK HHH HE HK EK 


0095373000642 576113641E 
0 693832039777959288365E 
0 «9203 40025470012 42073E 
0 «8997448997769 4003664E 
068765134! 448 47052697 4E 
0 «85064449 47 6835027976E 
0 -82215625436498040737E 
0+ 79108 49337996 48361 43E 
0-7574839 6638051363793E 
0.721 42308537009891548E 
0.68298743109107922809E 
0+ 64227664250975951377E 
0-599 4039302 422 4289297E 
0-55449513263193254887E 


00» 0+5053609 5207862517 625E-04» 
00203777 4664632698 466027E-035 
00s 0 29333698 4854238150079E-03>» 
0020+16811 4286542146990 63E-025 
002025687649 4379 40203731E-025 
0020 +3572892783517299 649 4E~025 
0030+ 46710503721143217474E-02>5 
0050-58 43 4498 75835639507 6E-02/ 


0040+ 7072 4899954335554680E-025 
0020-83 428387539 681577056E-02,5 
0050 -96411777297025366953E-02>5 
0020 -1095573338 783790 1 648E-Ol» 
00s 0-12275830560082770087E-Ol» 
00s0+13591571009765546790E-Ol>5 
0020+¢14893641664815182035E-01> 
0020+161732187295777199 42E-015 
O0s0-+1 74219301 594641 73747E-Ol> 
0050+18631848256138790186E-01> 
0020+197954950 48097499 488E-O14 
00s 0+209058 51 4458 120238 52E-Ol» 
0050+21956366305317824939E-01,5 
0050 +229 409 64229387748 761E-O17 


468-P 4- 0 


DATA 


KRRKHHKEHRKHRKHKHKHHRHHHHREE 


PCL69)ISPCLTOISPCITIDsPOLT2ISPCL7T3ISPCL 74)» PCIL75)» 
PC1L76)s2PCO177)sPC178)oPC179)5PC180)sPC181)5PC182),5 
PC183)5PC184)5PC185)5PC186)ePC187)»PC188)sPC1B89)» 
PCI9IOISPCLI1LISPCLI2sPCI9I3IsPOLI4ISPOIDSI»POI9IGB)S 


0-507 68775753371660215E 
0+ 45913001198983233287E 
0. 40897982 122988867241E 
0-357 40383783153215238E 
030457 644155671 40433 4E 
0-250 67873030348317661E 
0+619589750271110015392E 
014042 423315256017 459E 


00s0-238540 52106038 540080E-015 
002 0+2 469052474448 7676909E-Ol, 
0020254457 699654647658 13E-Ol» 
0020 426115673376706097680E-Ols 
0020 +2669 6622927450359906E-Ol> 
00s0¢27185513229 6247918 I9DE“O1s5 
00s 00275797 49566481873035E-Ols 
0050627877251 476613701609E-Ol» 


0-8 44540 400837 10883710E-01+0+28076455793817246607E-Ol» 
02818 46489 4974569 4339E-0120+281 7631903301 6602131E-01, 
0.281888 1 4180192358 69 4E-0150+8 4009 6928 7051932635 4E-025 
0 «32259 500250878 68 461 4E-02, 0. 1253928 482647 488 4353E-O01>» 
0+1054407 6228 6331 67722E-02s 0+ 58078 616599775673635E-02» 
0-10719490006251933623E-0150+1369730263199071 6258E-017 


DATA 


KHHRHHKHKEHRKRHAHEHRHRHHH KE 


PC197)5PC198)2PC199)5P(200),PC201)»PC202)s P6203)» 
P6204)» P(205)5 P6206) »PC(207)sPC208)sPC209)sPC210)» 
PC211)sPC212)sPC213)9PC21 4) oP C215)» PC216)sPC217)s 
PC218)sPC219)5PC220)5PC221 sPC222) »PC223)eP(224)7 
0-31 63036608222 6447 689E-03s 0«20557519893273465236E-025 
0 « 449 46378920320678 61 6E-0250-7122 438 6864583871532E-02,5 
0 696099 525623638830097E-02s0-11703388747657003101E-01, 
0+ 132087366297 529129966E-0150+1399 4609 1276190798 52E-01» 
09037273 46587511 49261 E-0 4s 0 «6447620 4130572 477933E-03>» 
0-1528876705087765568 4E-023 0-262 4561727 40 4429 5626E-02,> 
00-3851 6876166398709241E-0250¢51 48558 4789781777618E-025 
0+ 6467419831803686727 4E-02,» 0+ 77 6838777792199 12200E-02, 
0-901 6108195195643! 600E-0250+10178877529236079733E-O1,+ 
0-112286329134080 49354E-0150-121 41082601 668299679E-O1, 
0-1289581348801211 469 4E-0150+1347637 483381 6515982E-01, 
013870351089 1398 40997E-0150+1 406942 4957813575318E-O1,> 
02515787038 4280661 489E-04,0+18887326 450650 491366E-03,5 
O+ 46918 492 42 478 50 4097 SE-035 0-8 40571 432710722 46365E-03/7 


DATA 


HHRHEEKHKEHRHHEHEHHHKH HE HE 


PC225)5PC226)5 PC 227) » PC228 ) » PC229)o PC2]30)s PCA31)» 
P€232)5PC233)5P(234)2PC235)5PC236) » P6237) 2 PC238)5 
P€239)5P€240)5PC241 )»PC242)5P0243)2P(244)sPC245)5 
PC246)sPC247)sP(248)s P6249) sPC250)sPC251 ds PC252)/ 
0-128 4382 4718970101768E-02.s 0-1 786446391 75864982 47E-02> 
0233552518 6057 1 608737E-02s 042921 724937917819 7538E-02s 
0-35362 4499771677773 40E-02s 0¢ 417141937698 40768 528E-02, 
0+ 482058886 48512683 47 6E-022 0+ 54778 6669391895082 40E-02, 
0661379152800 41 3850 435E-0250+679578550 488277339 48E-O2,5 
0+ 7 4468208 32 407591017 4E-02»0-80866093647888599710E-02s 
0 -871096507973208 687 36E-0250.931592 412606939 50932E-02, 
0.98977 4752 40 487 497 440E-025 0-10 452925722906011926E-Ols 
0«10978183152658912470E-0150+1147048211 469387 4380E-O1, 
0-1192702 60530192700 40E-01,40-123452623722 43838 455E-Ols 
0-1272288 4982732382906E-0150+ 1305783 66883530 488 40E-01» 
0613348311 463725179953E-01 5021359275661 48123959 10E-O1- 
0-1378987 478324093651 7E-0150139386257383068 5080 4E-015 
0-1 403822789 6908 623303E-0150+1 408815951 6508301065E-O1/ 


DATA 


RHEHEHEHHRHEHRHKEHKRHRHEEH * 


HEH HKHKHKHKRKEHRHRKEHHHHEX 


RRREKRHH HEH 


P€253)5PC254)5PC255) oP C256)» PC257)s PC2]58 ds PC]59)» 
P(260)5PC6261)sPC262)5PC263)sPC(264) »PC265)9PC266)5 
P€267) sPC268)sP(269)5P(270)5PC271)s PC272)» PCOT3)» 
P€274)5PC275)5PC276) » PC277) » PC278) es PC2TI) » PC 280) S 


0699999759 63797 48 46462E 
009999 4399620705437 576E 
0299976049092 44320 4733E 
0 «99938033802 502358193E 
0299874561 446809511 470E 
0 «99780535449595727 456E 
0.99651 41 4591 489027385E 
0.99 483150280062100052E 
0 .99272134428278861533E 
02990151370 40077015918E 
0-98709252795403406719E 
0983518657578 6327287 6E 
0-979 40628 1 6708 62 6838 1E 
069747344597 52 402 66776E 


DATA 


00s 0+ 6937936432 41082671 70E-052 
0050253275293 669780 613125E-045 
002061357549 109 492287 1973E-035 
0020-2 49212 400 48299729 402E-03. 
0050-3897 4528 447328229322E-035 
00% 0+55429531 493037471 492E-035 
00+0+74028280 42 44503330 46E-035 
0020-9 45361 516858525382 46E-03,> 
0030+116748 4117429959 4077E-02, 
0020 +1 4049079956551 44642 7E-025 
00,0+-16561127281544526052E-02,4 
0020-1919712971013872 4125E~025 
0050219 44069253638 388388E-02,5 
00» 0 2 4789 5822 66575679 307E-02/ 


PC281)sPC282)5PC283)5PC6284)sPC285)2PC28 6)sPC287)> 


PC288)5PC289)5PC290)sPC291 2 P6292) 9 PC293)sPC29 4)» 
P€295)sPC296) »PC297) + P6298 )s P6299) P6300) s PC 301)» 
P€302)5P¢6303)sP(304)sP(305) 5 P6306) »PC307) »PC308)/ 
0 969 48 4659502 45923177E 00504277219576459345099 40E-02, 
0+96364062156981213252E 00+0-3073018 434702578323 4E-02, 
0 «9571882161098 6096274E 0050338039799 108 69203823E-02, 
0-95011529752129487656E 0020+369337791 70256508 183E-02s 
0694241156519 10830598 1E 0050+ 401106872 40750233989E-02> 
0934068 43615772578800E 0020+ 43326409 680929828 545E-02» 
0+92507893290707565236E 000+ 46573172997568547773E-025 
0+91543758715576504064E 0020 498 43645647 65538 6012E-025 
0-9051 4035881326159519E 00+0-531308 66051870565663E-02, 
0 «89 418.456833555902286E 0020+564281810138 44441 S85E-02> 
0 -8825688 402473 4190684E 0020-.5972919565508 1 6580 49E-02, 
0+87029305554811390585E 000+ 63027734490857587172E-025 
0685735831088 623215653E 00+0+66317812 4290188789 41E-02s 
0-8 4376688267270860104E 00»0+ 6959361 409390 422939 4E-02/ 


DATA 


PC309)sPC3L10)sPC311)sPC312)5PC313)sPC3L 4s PC315)5 
PC316)5PC317)5PC318)5PC319)5PC320)oPC321)5PC322)5 
P€323)5PC324)sPC325)5PC326)5PC 327) »PC328)sPC329)5 
P€330)sP(331)sPC332) 5P6333)2PC6334)5P6335)9PC336)/ 

0 -829522194637401 40018E 0050+ 72849 479805538070 639E-02>5 
081 462878765513741344E 00s0+7607989 66571905658 32E-02> 
0+79909229096084140180E 0020-79279 4933429 48 491103E-02> 
0-78291939411828301639E 00+0-82443037630328 680306E-02,5 
0+76611781930376009072E 00+0-85565435613076896192E-025 


COLLECTED ALGORITHMS (cont.) 


c 
C 


c 


Cc 


c 


aangaanargaaaaa 


0-748 69 629361 693660282E 0020-.8864173209 482 49 42641E-02,> 
0+73066452124218126133E 00.0-9166711163560788 4067E-02. 
0-71203315536225203 459E 00.0.946368999383006529 43E-02,5 
0+ 6928137697791 470289E 0020-975465653631741 14611 E-U2, 
0¢67301883023041847920E 00s 0.100391720440568 40798E-O1s 
0-65266166S541001749610E 000-1031 68123309 47621 682E-O1,5 
0+63175643771119423041E 0020.10587167904885197931E-Ol> 
0-61031811371518640016E 000.108 498 4408933731 4099E-O01>s 
0.588362 434447662541 43E 00+0-11104461134006926537E-01/ 
DATA 
P6337)P6338)2P6339)sPC340)sPC341sPC342).P(343)5 
P0344)2P0345)2PC346)3P(347)+PC348)5P6349)2PC350)5 
P€351)sP€352)4PC353)5P6354)9PC6355)2PC356)9PC357)» 
PC358)5P€359)+PC360)sPC361)»P6362)5PC6363)sPC364)7 
0-56590588542365442262E 00,0-11350654315960596602E-01,5 
0 +5429 65666498311 49049E 00s0-1158807 40330439 52568E-O1s 
0+51955966153745702199E 0030-11816385890830235763E-O01s 
0+495706407918761 4601 7E 00.0.12035270785279 562630E-Ol>» 
0+ 471 42506587165887693E 0020+1224442 4981 611985899E-Ols 
06 44673538766202847374E 0020.1244356019071 4035263E-01,2 
0+ 42165768 662616330006E 0050+12632 403643542078 765E-Ol» 
0-39621280605761593918E 00,0.12810698163877361967E-Ol,> 
0 +370 4220879500782301 4E 0050+12978202239537399286E-Ole 
0+34430734159943802278E 00.0.13134690091960152836E-01, 
0317890812068 47668318E 00s0-13279951743930530650E-O1, 
02911951 485182 4668196E 0020.13413793085110098513E-Ol» 
0 +2642 43372 4109267619 4E 0020-1353603593495621361 4E-Ol» 
02370588 4558982972 721E 0020-13646518102571291 428E-O01/7 
DATA 
PC365)sPC366) »PC367) »PC6368)sP6369)5PC370)9PC371)9 
P(372)2P6373)5P0374)2P0375)»P6376)2PC377)2PC378)>» 
P(©379)sPC380)sPC381)/ 
0+20966523824318119A477TE 0050213745093 44300189 6632E-O015 
0+ 18208649675925219825E 0050.13831631909506428676E-O1> 
001543 4681148137810869E 00,0.13906019601325461264E-Ol,» 
0+12647058 437230196685E 0050.139681 5880651 6938516E-O1, 
0 698 48239 6598119202090E-0140.1 40179 68039 456608510E-01> 
0 +7040 697 60 428551 79063E-0150.1 4055362072 649964277E-015 
0+ 42269164765363603212E-0150-14080351962553661325E-015 
0+ 1409388 6410782 46261 4E-0150.1 40928450691 60408355E-01,5 
0+14094407090096179347E-O17 
ICHECK = 0 
CHECK FOR TRIVIAL CASE. 
IF CAeEQ-B) GO TB 70 
SCALE FACTORS. 
SUM = (BtA)/2.0 
DIFF = (B-A)/2e0 
1-PGINT GAUSS 
FZER€ = FCSUM) 
RESULTCL) = 2¢O*FZEROQ*DIFF 


HERE KHKEKRK HE 


KEKE HRHEKKEKHHRHARHH EE 


SHREK HKHEKHHH*E 


I= 0 

I@QLD = 0 

INEW = 1 

K = 2 

ACUM = 0-0 

G@ TS 30 

10 IF CK-eEQ-8) GO TO 50 

Ks K+ 1 

ACUM = 0.0 


CONTRIBUTION FROM FUNCTION VALUES ALKEADY COMPUTED. 
D@ 20 J=lsI OLD 
l=.t+41 
ACUM = ACUM + PCL)*FUNCT(J) 
20 CONTINUE 
CONTRIBUTIQ@N FROM NEW FUNCTI@N VALUES. 
30 I@LD = I1@LD + INEW 
D@ 40 J=INEWs I@LD 
I=sIl+#1 
X = PCL)*DIFF 
FUNCTCJ) = FCSUM#X) + FCSUM-X) 
I=I+1 
ACUM = ACUM + PCL) kFUNCTCJ) 
40 CONTINUE 
INEW = IQLD + 1 
l=.t+41 
RESULTCK) = CACUM+PCL) *#FZERQ) *DIFF 
CHECK FOR CONVERGENCE. 
IF CABSCRESULTCK)-RESULTCK=-1) )-EPSIL*ABSCRESULTCK))) 60. 
* 60+ 10 
CONVERGENCE N@T ACHIEVED. 
50 ICHECK = 1 
NORMAL TERMINATION. 
60 NPTS = INEW + I@LD 
RETURN 
TRIVIAL CASE 
70 K = 2 
KESULTC1) 
RESULT¢2) 
NPTS = 0 
RETURN 
END 


oo 
oo 


FUNCTION QSUBCAs Bs EPSIL» NPTSs ICHECKs RELERRs F) 

THIS FUNCTI@N ROUTINE PERF@RMS AUTOMATIC INTEGRATION 
OVER A FINITE INTERVAL USING THE BASIC INTEGRATION 
ALGORITHM QUAD, TOGETHER WITHs IF NECESSARYs A NON- 
ADAPTIVE SUBDIVISIQ@N PROCESS- 

THE CALL TAKES THE FORM 

QSUBCAs Bs EPSIL»NPTS» ICHECKs RELERRs F) 

AND CAUSES FCX) T@ BE INTEGRATED @VER CAsB) WITH KELATIVE 
ERRGR HOPEFULLY N@T EXCEEDING EPSIL+ SHOULD QUAD CONVERGE 
CICHECKS0) THEN @SUB WILL RETURN THE VALUE @BTAINED BY IT 
OTHERWISE SUBDIVISION WILL BE INV@KED AS A RESCUE 
@PERATION IN A N@N-ADAPTIVE MANNER. THE ARGUMENT RELERR 
GIVES A CRUDE ESTIMATE @F THE ACTUAL KELATIVE ERROR 
OBTAINED. 


AMQAAAAAARAMQAAANAAAANAAANAAANAAAAAANAAAANAAANAANANAAANAAAAANANAAAANNAA 


Cc 


Cc 


aaa 
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THE SUBDIVISION STRATEGY IS AS FOLLOWS 
LET THE INTERVAL ¢AsB) BE DIVIDED INT®@ 2**N PANELS AT STEP 
N OF THE SUBDIVISION PROCESS- QUAD [S APPLIED FIRST TO 
THE SUBDIVIDED INTERVAL @N WHICH QUAD LAST FAILED T@ 
CONVERGE AND IF CONVERGENCE IS NOW ACHIEVED THE REMAINING 
PANELS ARE INTEGRATED. SH@ULD A CONVERGENCE FAILURE @CCUR 
ON ANY PANEL THE INTEGRATI@N AT THAT POINT IS TERMINATED 
AND THE PROCEDURE REPEATED WITH N INCREASED BY 1+ THE 
STRATEGY INSURES THAT POSSIBLY DELINQUENT INTERVALS ARE 
EXAMINED BEFORE WORKs WHICH LATER MIGHT HAVE TO BE 
DISCARDED, IS INVESTED @N WELL BEHAVED PANELS- THE 
PROCESS IS COMPLETE WHEN N@ CONVERGENCE FAILURE @CCURS ON 
ANY PANEL AND THE SUM @F THE RESULTS @BTAINED BY QUAD ON 
EACH PANEL IS TAKEN AS THE VALUE OF THE INTEGRAL. 

THE PROCESS IS VERY CAUTIQUS IN THAT THE SUBDIVISION OF 
THE INTERVAL CAsB) IS UNIFORM» THE FINENESS OF WHICH IS 
CONTR@LLED BY THE SUCCESS @F QUADe IN THIS WAY IT IS 
RATHER DIFFICULT FOR A SPURKIQUS CONVERGENCE TQ SLIP 
THROUGH. 

THE CONVERGENCE CRITERION @F QUAD IS SLIGHTLY RELAXED 
IN THAT A PANEL IS DEEMED T@ HAVE BEEN SUCCESSFULLY 
INTEGRATED IF EITHER QUAD CONVERGES OR THE ESTIMATED 
ABSOLUTE ERROR COMMITTED ON THIS PANEL DOES NOT EXCEED 
EPSIL TIMES THE ESTIMATED ABSOLUTE VALUE @F THE INTEGRAL 
OVER (AsB>-« THIS RELAXATION IS TO TRY T@ TAKE ACCOUNT OF 
A COMMON SITUATION WHERE ONE PARTICULAR PANEL CAUSES 
SPECIAL DIFFICULTY» PERHAPS DUE T@ A SINGULARITY @F SOME 
TYPE. IN THIS CASE QUAD C@ULD OBTAIN NEARLY EXACT 
ANSWERS '@N ALL @THER PANELS AND S@ THE RELATIVE ERROR FOR 
THE TOTAL INTEGRATION WOULD BE ALMOST ENTIRELY DUE TQ THE 
DELINQUENT PANEL. WITH@UT THIS CONDITION THE COMPUTATION 
MIGHT CONTINUE DESPITE THE REQUESTED RELATIVE ERR@R BEING 


ACHIEVED. 
THE OUTCOME OF THE INTEGRATION IS INDICATED BY ICHECK. 
ICHECK=0 - CONVERGENCE OBTAINED WITHGUT INVOKING 
SUBDIVISIGN. THIS C@RRESP@NDS T@ THE 
DIRECT USE OF QUAD. 
ICHECK=1 - RESULT OBTAINED AFTEK INVOKING SUBDIVISION. 
ICHECK=2 - AS FOR ICHECK=1 BUT AT SOME POINT THE 


RELAXED CONVERGENCE CRITERION WAS USED. 
THE RISK OF UNDERESTIMATING THE RELATIVE 
ERROR WILL BE INCREASED. IF NECESSARY» 
CONFIDENCE MAY BE RESTGRED BY CHECKING 
EPSIL AND RELERR FOR A SERI@®US DISCREPANCY. 
ICHECK NEGATIVE 
IF DURING THE SUBDIVISION PROCESS THE 
ALLOWED UPPER LIMIT GIN THE NUMBER @F PANELS 
THAT MAY BE GENERATED (PRESENTLY 4096) IS 
REACHED A RESULT IS @BTAINED WHICH MAY BE 
UNRELIABLE BY CONTINUING THE INTEGRATION 
WITHOUT FURTHER SUBDIVISI@N IGNORING 
CONVERGENCE FAILURES+ THIS @CCURRENCE IS 
FLAGGED BY RETURNING ICHECK WITH NEGATIVE 
SIGN. 
THE RELIABILITY @F THE ALGORITHM WILL DECKEASE F@R LARGE 
VALUES OF EPSIL- IT IS REC@MMENDED THAT EPSIL SH@ULD 
GENERALLY BE LESS THAN ABOUT 0-001. 
DIMENSI@N RESULT(8) 
INTEGER BADs GUT 
L@GICAL RHS 
EXTERNAL F 
DATA NMAX/4096/ 
CALL QUADCAs Bs RESULTs Ks EPSILs» NPTS» ICHECKs F) 
QSUB = RESULTCK) 
RELERR = 0-0 
IF (QSUB-NE-0-0) RELERR = 
* ABS(CRESULTCK)-RESULTCK=-1))/QSUBD 
CHECK IF SUBDIVISION IS NEEDED. 
IF CICHECK-EQ-0) RETURN 


SUBDIVIDE 
ESTIM = ABS(GSUB*EPSIL) 
Ic = 1 
RHS = «FALSE. 
N = 1 
H=B-A 
BAD = 1 

10 QSUB = 0-0 

RELERR = 0-0 
H = H*0.5 
N=N+4+N 


INTERVAL (A»B) DIVIDED INT@ N EQUAL SUBINTERVALS. 
INTEGRATE OVER SUBINTERVALS BAD T@ (BAD+1) WHEKE TROUBLE 
HAS OCCURRED. 
“M1 = BAD 
M2 = BAD + 1 
@UT = 1 
G@ T@ 50 
INTEGRATE ®VER SUBINTERVALS 1 T@ (BAD-1) 
20 Mi = 1 
M2 = BAD - 1 
RHS = «FALSE. 
OUT = 2 
G8 T@ 50 
INTEGRATE OVER SUBINTERVALS CBAD+2) TQ Ne 
30 M1 = BAD + 2 
M2 = N 
OUT = 3 
G@ T@ 50 
SUBDIVISION RESULT 
40 ICHECK Ic 
RELERR RELERR/ABS(CQSUB) 
RETURN 
INTEGRATE @VER SUBINTERVALS M1 TO M2. 
50 IF (M1-«GT.M2) G@ TE 90 
D@ 80 JJ=M1sM2 
J = JJ 
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Arithmetic Over a fori: = step —p utp do 
weer 7 gin 
Finite Field [A1] elfli— 11] := fli—pl; 


for 7 := i — pstep 1 until i — 2 do e[f[/}] := f[j+1] 
C. Lam* and J. McKay + [Recd. 8 Oct. 1971] sao 
* Department of Mathematics, Caltech University, Pasa- 
dena, CA 91101 ¢ School of Computer Science, McGill 
University, P.O. Box 6070, Montreal 101, P.Q. Canada 


Key Words and Phrases: algebra; CR Categories: 5.19 
Language: Algol 


Description 

The rational operations of arithmetic over the finite field F,, 
of g = p(n > 1) elements, may be performed with this algorithm. 

On entry a[i] contains a; € F, withO <a; < p,i=0,..., 
n-—- 1, and x € F, satisfies the primitive irreducible polynomial 
P(x) = x + Soho ax, fq produces e; in e[i],i = —1,...,q— 2, 
where | + x! = x°* with the convention that —1 represents * and 
x* = 0. During execution the range of the a; is altered to —p < 
a; < 0, i =0...n — 1. The storage used is 2q + n + 6 locations 
including the final array e. 

With appropriate conventions for *, multiplication and division 
are trivial, and addition and subtraction are given by x* + x° = 
x“(1 + x*) fora < band x2 — x = x*@ + xc) xb when p ~ 2. 
For small values of q, it is suggested that addition and multiplica- 
tion tables be generated by this algorithm. A description of the 
method and its generalization to a multi-step process when n is com- 
posite is in [2]. A list of primitive irreducible polynomials is given 
in [1]. Further useful information (especially for p = 2) is to be 
found in [3]. 
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Algorithm 
procedure fg(p, 1, a, ¢); 
integer p, 1; integer array a, e 


begin 
integer array c(0:1—1], f[O:p f n—1]; integer gq, i, /, d, 5, w; 
q:i= pT a; 


for i := O step 1 until » — 1 doif a{i] ~ 0 then afi] := afi] — p; 
for i := 1 step 1 until x — 1 doc{[i] := 0; 
c(0):= 1; fll] :=0; flO] := —1: 
for i: = 1 step | until g — 2do 
begin 
d:=eln— 1]; s:=0; 
for 7 := n — 1 step —1 until 1 do 
begin 
w:=cfi-l])—dxX<alj]; w:=w-—-w+pXp; 
cljj:=w; S:=pXs+w 
end; 
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EXAMINE FIRST THE LEFT @R RIGHT HALF OF THE SUBDIVIDED 
TROUBLES@ME INTERVAL DEPENDING ON THE @BSERVED TREND- 


IF CRHS) J = M2 + ME = JJ 

ALPHA = A + HeCJ-1) 

BETA = ALPHA + H 

CALL QUADCALPHAs BETAs RESULTs Ms EPSILs NF» ICHECKs F) 


C@MP = ABSCRESULT(CM)-RESULTCM-1)) 
NPTS = NPTS + NF 
IF CICHECK-NE.1) G@ TO 70 
IF CCOMP-LEe«ESTIM) G6 T@ 100 
SUBINTERVAL J HAS CAUSED TROUBLE. 
CHECK IF FURTHER SUBDIVISION SHOULD BE CARKIED @UT. 
IF (NeEQ@eNMAX) G2 TO 60 
BAD = @*J - 1 
RHS = »FALSE. 
IF €C€J*2*€J/2))-EQ@-0) RHS = 
G@ TO 10 
60 Ic = -IABSC(IC) 
70 QSUB = Q@SUB + RESULT(M) 
80 CONTINUE 
RELERR = RELERK + COMP 
90 G@ TO (20330240)» OUT 
RELAXED CONVERGENCE 
100 IC = ISIGNC2sI1C> 
G@ Te 70 
END 


«TRUE. 


FUNCTI@N QSUBACAs Bs EPSIL» NPTSs ICHECKs RELERRs F) 

THIS FUNCTION R@UTINE PERFORMS AUTOMATIC INTEGRATION 
@VER A FINITE INTERVAL USING THE BASIC INTEGRATION 
ALG@RITHM QUAD T@GETHER WITHs IF NECESSARY AN ADAPTIVE 
SUBDIVISION PROCESS. IT IS GENERALLY MOKXE EFFICIENT THAN 
THE N@N-ADAPTIVE ALGORITHM QSUB BUT IS LIKELY T@ BE LESS 
RELIABLECSEE COMP +Jes14.18921971)+ 

THE CALL TAKES THE FORM 

@SUBACAsB. EPSILs NPTSs IGHECKs RELERK» F) 
AND CAUSES F(X) T@ BE INTEGRATED GVER (AsB) WITH RELATIVE 
ERROR H@PEFULLY N@T EXCEEDING EPSIL. SH@ULD QUAD CONVERGE 
CICHECK=0) THEN @SUBA WILL RETURN THE VALUE @BTAINED BY IT 
OTHERWISE SUBDIVISI@N WILL BE INV@KED AS A RESCUE 
OPERATION IN AN ADAPTIVE MANNER. THE AKGUMENT RELERR GIVES 
A CRUDE ESTIMATE @F THE ACTUAL RELATIVE ERKGR OBTAINED. 

THE SUBDIVISION STRATEGY IS AS FOLLOWS 
AT EACH STAGE OF THE PROCESS AN INTERVAL IS PRESENTED FOR 
SUBDIVISION CINITIALLY THIS WILL BE THE WH@LE INTERVAL 
€AsB))- THE INTERVAL IS HALVED AND QUAD APPLIED T@ EACH 
SUBINTERVAL. SH@ULD QUAD FAIL @N THE FIRST SUBINTERKVAL 
THE SUBINTERVAL IS STACKED FOR FUTURE SUBDIVISION AND THE 
SECOND: SUBINTERVAL IMMEDIATELY EXAMINED. SH@QULD QUAD FAIL 
@N THE: SECOND SUBINTERVAL THE SUBINTERVAL IS 
IMMEDIATELY SUBDIVIDED AND THE WH@LE PR@CESS REPEATED. 
EACH TIME A CONVERGED RESULT IS @BTAINED IT IS 
ACCUMULATED AS THE PARTIAL VALUE @F THE INTEGRAL. WHEN 
QUAD CONVERGES ON B@TH SUBINTERVALS THE INTEKVAL LAST 
STACKED IS CHOSEN NEXT FOR SUBDIVISI@N AND THE PROCESS 
REPEATED. & SUBINTERVAL IS N@T EXAMINED AGAIN ONCE A 
CONVERGED RESULT IS @BTAINED FOR IT S® THAT A SPURIQUS 
CONVERGENCE IS MORE LIKELY T@ SLIP THROUGH THAN F@R THE 
NON-ADAPTIVE ALGORITHM QSUB. 

THE CONVERGENCE CRITERI@N OF QUAD IS SLIGHTLY RELAXED 
IN THAT A PANEL 1S DEEMED T®@ HAVE BEEN SUCCESSFULLY 
INTEGRATED IF EITHER QUAD CONVERGES @R THE ESTIMATED 
ABSOLUTE ERR@R COMMITTED @N THIS PANEL DOES NOT EXCEED 
EPSIL TIMES THE ESTIMATED ABS@LUTE VALUE @F THE INTEGRAL 
@VER (As8). THIS RELAXATION IS TO TRY T@ TAKE ACCOUNT OF 
A COMMON SITUATION WHERE GNE PARTICULAR PANEL CAUSES 
SPECIAL DIFFICULTY, PERHAPS DUE T@ A SINGULARITY OF SOME 
TYPE. IN THIS CASE QUAD COULD OBTAIN NEARLY EXACT 
ANSWERS @N ALL OTHER PANELS AND S@ THE RELATIVE ERROR FOR 
THE TOTAL tNTEGRATIGN WOULD BE ALMOST ENTIRELY DUE T@ THE 
DELINQUENT PANEL. WITH@UT THIS CO@NDITIG@N THE COMPUTATION 
MIGHT CONTINUE DESPITE THE REQUESTED RELATIVE ERR@R BEING 
ACHIEVED. 

THE BUTCOME @F THE INTEGRATION IS INDICATED BY ICHECK. 


ICHECK=0 - C@NVERGENCE @BTAINED WITHOUT INVOKING SUB- 
DIVISION. THIS WOULD CORRESPOND T@ THE 
DIRECT USE @F QUAD. 

ICHECK=t - RESULT @BTAINED AFTER INV@KING SUBDIVISION. 

ICHECK22 - AS FOR ICHECK8! BUT AT SOME POINT THE 


RELAXED CONVERGENCE CRITERION WAS USED. 
THE RISK @F UNDERESTIMATING THE RELATIVE 
ERROR WILL BE INCREASED. IF NECESSARY» 
CONFIDENCE MAY BE RESTORED BY CHECKING 
EPSIL AND RELERR FOR A SERIOUS DISCKEPANCY. 
ICHECK NEGATIVE 
IF DURING THE SUBDIVISION PROCESS THE STACK 
OF DELINQUENT INTERVALS BECOMES FULL CIT IS 
PRESENTLY SET T@ H@LD AT MOST 100 NUMBERS) 
A RESULT IS OBTAINED BY CONTINUING THE 
INTEGRATI@N IGNGRING CONVERGENCE FAILURES 
WHICH CANNOT BE ACC@MM@DATED @N THE STACK. 
THIS OCCURRENCE IS FLAGGED BY RETURNING 
ICHECK WITH NEGATIVE SIGN. 
THE RELIABILITY @F THE ALG@RITHM WILL DECREASE FOR LARGE 
VALUES OF EPSIL. IT IS REC@MMENDED THAT EPSIL SH®BULD 
GENERALLY BE LESS THAN AB@UT 0.001. 
DIMENSIG@N RESULTC8)» STACKC 100) 
EXTERNAL F 
DATA ISMAX/100/ 
CALL QUADCAs Bs RESULT» 
QSUBA = RESULTCK>) 
RELERR = 0-0 
IF CQSUBA.NE.0-0) 
* RELERR = ABSC(CRESULTCK)-RESULTCK-1)) /QSUBA) 


Ky» EPSIL» NPTSs ICHECKs F) 
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C CHECK IF SUBDIVISION IS NEEDED 

IF CICHECK-EQ@-0) RETURN 
C SUBDIVIDE 

ESTIM = ABSCQSUBA*EPSIL) 

RELERR = 0-0 

QSUBA = 0-0 

Is=z=l 

Ic = 1 

SUB1 3 A 

SUB3 = B 

10 SUB2 # (SUB1+SUB3)*0.5 

CALL QUADCSUBLs SUB2s RESULTs Ks» EPSIL» NFs ICHECKs F) 

NPTS = NPTS + NF 

COMP = ABSCRESULT(K)-RESULTCK=-1)) 


IF CICHECK-EQ-0) GO TA 30 
IF CCOMP.LE-ESTIM) GO T@ 70 
IF CIS.GE-ISMAX) G@ TO 20 
C STACK SUBINTERVAL (SUB1sSUB2) FOR 
STACKCIS) SUBt 
Is = IS + 
STACKCIS) 
IS =Is + 
G2 T@ 40 
IC = -IABSCIC) 
QSUBA = QSUBA + RESULTCK) 
RELERR = RELERR + COMP 
40 CALL QUADC(SUB2s SUB3s 
NPTS = NPTS + NF 
COMP = ABSCRESULTCK)-RESULT(K-1)) 
IF CICHECK-£Q-0) GO TO 50 
IF (COMP.LE-ESTIM) GO TO 80 
C SUBDIVIDE INTERVAL (SUB2»sSUB3) 


FUTURE EXAMINATION 


SUB2 


— im it 


20 
30 


RESULTs Ks EPSILs NFs ICHECKs F) 


SUB1 = SUB2 
G@ TS 10 

50 QSUBA = QSUBA + RESULTCK) 
RELERR = RELERR + COMP 


IF CIS-EQ@-i) GO TO 60 
C SUBDIVIDE THE DELINQUENT INTERVAL. LAST STACKED 
Iis=I1Ss- 1 
SUB3 = STACKCIS) 
Is =1IS- t 
SUB1 = STACKCIS) 
G@ T2@ 10 
C SUBDIVISION RESULT 
60 ICHECK = IC 
RELERR = RELERR/ABS(QSUBA) 
RETURN 
CG RELAXED CONVERGENCE 
70 IC = ISIGNC2,1C) 
G@ T2 30 
80 IC = ISIGNC2, IC) 
G@ T@ SO 
END / 


0 
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Linear Systems with Almost 
Tridiagonal Matrix [F4] 


Milan Kubicek [Recd. 6 Dec. 1971, 8 May 1972, 12 Oct. 
1972, 12 Dec. 1972] 

Department of Chemical Engineering, Technical Uni- 
versity, Technicka’ 1905, Praha 6, Dejvice, Czecho- 
slovakia 


Key Words and Phrases: system of linear equations, almost 
tridiagonal matrix, sparse matrix 

CR Categories: 5 14 

Language: Fortran 


Description 
The program FAKUB is based on the method of modified 

matrices. In fact, FAKUB solves Tx = b where T = T + R, Tis 

tridiagonal (# Xn) and R is a matrix of low rank. Let us write 

R = RR," where R, , R, aren X m matrices. R, contains columns 

jis jo,-++,Jm of T — T, and Re is matrix of unit vectors 

Cj,» Cjny e+ y Cim- 

Subroutine FAKUB performs the following steps: 

Step 1. Determine n by m matrix V and vector y satisfying TV = R; 
and 7y = b, (The Thomas algorithm [1] is used to split T = LU 
and V and y are obtained by back solving m + 1 times. This 
algorithm is in principle the standard LU factorization of a 
tridiagonal matrix, see e.g. [2]. Note that we normalize L, 
while in [2] U is normalized.) 

Step 2. Form m by m matrix A = J + R,’ Vand vector w = Ry y. 

Step 3, Solve Az = w for z. 

Step 4. Calculate the solution x = y — Vz. 


The method described here will be particularly useful if m « n, 
however, it can be used advantageously also if m < n. 

Let us now define the matrix B, 1 X (m+1), in the following 
way: (1) the first column of the matrix B is the vector 5; (2) (k-++1)-st 
column of the matrix B is equal to the Ath column of the matrix 
Ri, i.e. to the jth column of the matrix 7 — T. This holds for 
k= 1,2,...,m. 

The description of the formal parameters of the subroutine 
FAKUB is given in the comments at the beginning. In accordance 
with the symbols used above we have 


Nwn, M~m-+1, 

SQ) ~ tien, t= 2,3,...,0 
DD) ~ ti, | hal iy rae 7 
AD~ tis, i=1,2,...,7-1 
BU, J) ~ 6:5, JPROM(K)~ jx, 


where T == {f;,;} and B = {6,,;}. 

Two parameters deserve to be discussed in detail. The param- 
eter EPS tests zero on the diagonal in the course of the Thomas 
algorithm. If | DU) | < EPS, then the value of ALFA is added to 
D(D and the RHS of B is modified so that the solution of the 
system remains the same; at the same time the statement in the 
form 


470-P 1- 0 


FAKUB INFORMATION ON ZERO ON LINE I 


is printed. During this modification the matrix B can be expanded 
in one column, which has to be considered when declaring MM. 
If during the modification the space assigned for array B is ex- 
ceeded, the statement 


FAKUB MANY REARRANGEMENTS, END OF FAKUB 


is printed, and after return the value of M is equal to —1. For 
practical problems this occurrence is a very rare event. The dimen- 
sion specifications A(20, 20), PS(20) can be changed if 20 is low; 
however, we must have M@ < 20, and M can always increase by one 
during the above mentioned modifications. If the dimension 
specification was low (see statement number 49) the statement 


FAKUB LOW DIMENSION, END OF FAKUB 


is printed, and after return 4 = —2. This can be corrected, e.g. 
by increasing the parameter ALFA. 

If the matrix 7 is singular (see the comment under statement 
label 5 in subroutine GAUSD, which has to be modified specifically 
with respect to the type of computer) the statement 


FAKUB SINGULAR MATRIX OF SYSTEM, END OF FAKUB 


is printed, and after return M = 0. 

After regular return (M > 0), the results are in the first column 
of the matrix B. 

If m = 0, the given algorithm is equivalent to the Thomas 
algorithm with the exception that it insures against zeros occurring 
on the diagonal. Subroutine GAUSD plays the role of a standard 
linear equation solver. Any other standard routine can be used, 
e.g. see [2]. 

The program was successfully run for calculations of distilla- 
tion columns (7 = 100, m = 3). It can also be applied in linear 
multipoint boundary value problems. 

Acknowledgment. The author would like to thank to Dr. Fred 
Gustavson of IBM Thomas J. Watson Research Center for his 
very valuable comment. 
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SUBR@UTINE FAKUBCNs Ss Ds Hs Bs Ms NN» MMs JPROM» ALFA» EPS) 
DIMENSION SCN)s DCN)» HCN)» BCNNs MM)» JPROMC20)5 AC20220). 
* WC20) 

S@LUTION OF SYSTEM @F LINEAR EQUATI@NS WITH MATRIX OF SPECIAL 

CALMOST TRIDIAG@NAL) TYPE 

NsNUMBER @F EQUATIONS 

S€2)5SC€3)s¢0e+=LOWER DIAG@NAL ELEMENTS 

DC 1)2DC2)s6ee=MAIN DIAG@NAL ELEMENTS 

HO1)sHC2)5¢-6e=5UPPER DIAGONAL ELEMENTS 

BC1s1)2BC251)52¢e=RIGHT HAND SIDES 

JPROMC1)» JIPROM(2) 5 e002 JPROMC(M-1)= INDICES @F UNKN@WNS F@R WHICH 

N@N-ZERB NONDIAGONAL COEFFICIENTS EXIST 

(BCI, J+1)s1T=1,N2CQLUMN GF COEFFICIENTS 

CWITH@UT DIAG@NAL ELEMENTS)» WHICH C@RRESP@NDS 

T@ UNKOWN WITH INDEX JPROMC J) 

M-1=NUMBER @F TRANSFERRED UNKN@WNS 

ALFA=N@N ZER@ PARAMETER USED F@R REARRANGEMENTS 

EPS=SCALE OF ZER@ DIAGONAL ELEMENT» DEPENDENT @N THE COMPUTER 

TYPE 

M-EQ-O AFTER RETURN: MATRIX WAS SINGULAR 

M.EQe-1 AFTER RETURN: MANY REARRANGEMENTS» SMALL VALUE @F MM 

Me-EQ--2 AFTER RETURN: LOW DIMENSION SPECIFICATION IN FAKUB 

WE WISH T@ SOLVE G*x=C WHERE G IS A N BY N MATRIX AND C IS A 

VECTOR. 

G = T + Re R = RI*R2ET. RI AND R2 ARE N BY ME MATRICES OF RANK 

Mie 

CR2T---R2 TRANSPOSE) THE METHOD OF MODIFIED MATRICES IS USED. 
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T IS A TRIDIAG@NAL MATRIX GIVEN BY INPUT VECT@RS Ss D AND H. 
B= (CsRID IS AN BY M MATRIX. RI IS A SET OF M1 COLUMNS OF G - 
Te : 
R@ 1S A SET OF M1 UNIT VECTORS SPECIFIED BY JPROM- 
FOR EFFICIENCY RANK Mt IS MUCH LESS THAN Ne 
KPR IS PRINTER DEVICE NUMBER 

DATA KPR/6/ 


99999 FORMATC//45H FAKUB SINGULAR MATRIX @F SYSTEM»END OF FAKUB//) 
99998 FORMATC//34H FAKUB INFORMATION ON ZER@ ON LINE, 15/7) 

99997 FORMATC//39H FAKUB MANY REARRANGEMENTS, END OF FAKUB//) 
99996 FORMAT(C//33H FAKUB L@W DIMENSI@N» END OF FAKUB//) 


Cc 


Cc 


c 
Cc 


aaaaa 


N}=N- | 
Ml =M- ft 
JUMP = 1 
FORM LeU AND L¥*(-1)*8, NOTE LeU = Te 
) oe | 
10 P = D«I> 
IF CABS(P)-LE.-EPS) G@ T@ 40 
20 HCI) = HCI1)/P 
Pl = S¢I+1) 
DB 30 J=1.M 
IF (BCIsJ)eEa-0-0) GO TB 30 
BCI» J) = BCI» J)/P 
BCI+#1l,J> = BCI+)4J5) - PI*BCI,J) 
30 CONTINUE 
DCI#1) = DCI+1) - Pl¥HCI) 
rer+t 
IF CI*LE-NI1) G@ T@ 10 
MATRICES L»U AND L**(-1)#B ARE DETERMINED HERE 
G@ Te 100 
40 WRITE (KPR»99998) I 
PIVOT DCI) NEARLY ZER@- ADJUST MATRICES T AND R1 SO THAT 
G REMAINS! EQUAL T@ T + Re NEW T HAS PIVOT DCI) NEAR TO ALFA. 
IF ¢CM1+«EQ@.0) GO T@ 60 
DO SO: J=teMt 
IF (JPROMCJ).EQ@-1) GO TB BO 
50 CONTINUE 
60 M=M+t t 
Mt= Mi + 1 
IF (M+GT-eMM) GO TO 200 
D@ 70: J=leN 
BC Jo.M) = 0.0 
70 CONTINUE 


BCIsM) = ~ALFA 
JPROMCMI) = I 
G8 Te 90 


BO BCIlsJ+tl> = BCIsJ+1)> ~ ALFA 
90 DCI) = OCI) + ALFA 
P = DCI) 
GO TB €204110)5 JUMP 
100 IF CABSCDC(N)).GT-EPS) GO T@ 110 


I=N 
JUMP i= 2 
6B Ta 40 


110 D@ £20 J=t.™ 
BON, J) = BCNsJ)/DCOND 
120 CONTINUE 
FORM UteC-])#Le8#(-1)eB = Te*C-1)4B. Te#(-1)*B = CY¥sV) 
D® 140 11=1,N1 
T=nN- li 
De 130 J=1.M 
BCI,J) = BClsJ>) - HCI *BCI+1, J) 
130 CONTINUE 
140 CONTINUE 
IF (M1-EQ-0) RETURN 
THE NEXT STATEMENT NECESSARY AS A AND W HAVE DIMENSION @F 20. 
IF (M1-GT-20) G@ TO 210 
FORM M1 BY M1 MATRIX A = LI + R2T*V AND Mt 
D@ 160 I=15M1 
11.5 JPROMCI) 
DQ; 150 J=1sM1 
ACIsJ) = BCILsJ+1)> 
150 CONTINUE 
WEI) = BCI1,1) 
ACIsI) = ACIsI) + 120 
160 CONTINUE 
SOLVE A*zZ = W FOR Z USING SUBR@UTINE GAUSD. 
CALL GAUSD(MIs As Ws M2, 20) 
IF (M2-EQ-0) GO TO 190 
F@RM S@LUTION VECTOR X = Y - V#¥Z. 
D@ 180 [=1s5N 
D@.170 J=2s™ 
BCIs31) = BCIs1) - BCIe J #WCU-1) 
170 CONTINUE 
180 CONTINUE 
RETURN 
190 WRITE (KPR»99999) 
M = 0 
RETURN 
200 WRITE (KPRs99997) 
M= -1 
RETURN 
210 WRITE (CKPRs 99996) 
by ace 
RETURN 
END 


VECTOR W = R2T#Y. 


SUBROUTINE GAUSDCNs As Bs Ms NN) 
DIMENSION ACNNs NN)» BCNNDs IRRC20)s XC20) 
SOLUTION OF SYSTEM OF LINEAR EQUATIONS 
N=NUMBER @F EQUATIGNS (NeLE-20) 
A=MATRIX’ OF SYSTEM 
B=RIGHT HAND SIDES 
MIF MeEQeO AFTER RETURNs THEN MATRIX A WAS SINGULAR 
“= 1 
ID = 1} 
D@ 10 I=lsN 
IRRCI) = 0 
10 CONTINUE 
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20 IR = 


D@ 60 I=1,N 
IF CIRRCI)) 602 30% 60 
30 D@ SO J=1s5N 
P = ABSCACIsJ)) 
IF (P-AMAX> 50s SO» 40 
40 IR =I 
Is=J 
AMAX = P 
so CONTINUE 
60 CONTINUE 
IF CAMAX.NE-0-0) GO T® 70 
C THIS CONDITION MUST BE SPECIFIED M@RE EXACTLY 
C WITH RESPECT T@ COMPUTER ACTUALLY USE) 
M= 0 
G@ Te 120 
70 IRRCIRD = IS 
D@ 90 I=1sN 
IF CI-EQ-IR +«OR- ACIZ1S)-EQe060) GB TB 9O 
P = ACIsIS)/ACIRs IS) 
D@ 80 J=15N 
IF CACIRs J) «NE0060) ACIO J) = ACIS J) - PRACIRs J) 
80 CONTINUE 
ACIs1S) = 0-0 
BCI) = BCI) - P*BCIR) 
90 CONTINUE 
ID= ID + 1 
IF CIDeLE-N) G8 TO 20 
De 100 I=15N 
IR = IRRCID 
XCIR) = BCID/ZACI,ZIR) 
100 CONTINUE 
D@ 110 I=1.N 
BCI) = xX¢ID 
110 CONTINUE 
120 RETURN 
END 


0 
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Description 
1. Introduction. The functions 


Re) 


Ey) -{ e'rndt, x > O, 2 an integer, 
1 


are referred to as exponential integrals. The special case 1» = 0 
gives Ey(v) = e*/x, and for 1 negative we have 


E, (x) = (—1)"(d/dx)lEp(x), 2 < 0, 


for which an algorithm was published previously [3]. Our concern 
here is with the case of positive integers 1. We present an algorithm 
which evaluates 


filx) = e Ey (x), 


to an accuracy of d significant decimal digits. 
2. Method of Calculation. The basic tool of computation is the 
well-known recurrence relation 


frailx) = (1 — xfi(x))/n. (2.1) 


We use it in two different ways, depending on whether 0 < x < 1 
orx > 1. 

On the first interval, we apply (2.1) for = 1,2,...,N— 1, 
assuming a real procedure f1 to supply the starting value /((x). 
The real procedure f1 furnished below obtains fi(x) accurately to d 
significant digits. It is based on the power series expansion 


fo) =e (Ser GPE y- moo), 


x>0, n=1,2,...,M 


(2.2) 


where y = .5772156649... is Euler’s constant. Since the terms in 
the infinite series of (2.2) are alternating in sign and strictly de- 
creasing in modulus (if 0 < x < 1), the partial sums of even order, 
So , Converge monotonically increasing to the limit value s,,, 
while those of odd order, 52x41 , converge monotonically decreasing 
to s,,. Consequently, if 5 = (Sex + Se%41)/2, we have | 5, — s,,| < 
tel 5, | aS SOON AS Soxy1 — Sox < eS. The last inequality, with 
« = 1074, is used as a termination criterion for the summation of 
the infinite series in (2.2). In order to prevent infinite loops in cases 
where d is specified unreasonably large for a particular computer, 
we use Rutishauser’s device [8, §36.3] of terminating the summation 
process also if the machine representations of sa, OF So.41, Cease 
to exhibit monotonic behavior. 
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The subtraction of y + /n(x) from the infinite series in (2.2) 
does not cause any appreciable loss of significance if x is restricted 
to the interval 0 < x < 1. This consideration was partly responsible 
for choosing x = 1 as the transition point. 

On the remaining interval, x > 1, we let 71 = (x), the integer 
closest to x, and compute f(x) by backward recurrence for 
1 <n <1, andby forward recurrence for #1 <n < N(ifN > nl), 
thereby maintaining optimal error propagation characteristics |2, 
Ex. 5.4]. The starting value /.,(x) for both recursions is obtained 
from Legendre’s continued fraction [7, p. 103] 


1 A 1 n+1 2 nt+2 3 . 
xt it xt If xt 14 xt 


Noting that the partial numerators and denominators are all posi- 
tive, it follows that the convergents of even and odd order approach 
the common limit value monotonically increasing and decreasing, 
respectively. Therefore, devices similar to those described above for 


eE,(x) = (2.3) 


fi(x) can be used to terminate the continued fraction evaluation. 


The convergents of even order are obtained as the successive con- 
vergents of the even contraction 


aq a2 a3 


Aa) eta, coe a 2.3 
eE n(x) ane Re vo ; (2.3e) 
where 

a=1, db =x4+n, 


a=(k—-1(iatk—2)\, _ 
by 2 Pi 2D fk eee 


while those of odd order are obtained as the successive convergents 
of the odd contraction 


7 ay a, as 
tE (x) = — —- — XL —*", 
e*E,, (x) = (: nee sig ae i 
where 


a=n,b=x+n+1, 


a=(k-—i(n+k—1)|k =2,3,4,.... 
b =x +n+ 2k -— 1 


(2.30) 


In either case, the successive convergents are evaluated directly by 
the third method described in [1, p. 29]. Overflow problems asso- 
ciated with the more common method based on the three-term 
recurrence relation for the numerators and denominators are thus 
avoided. 

The number of convergents required in (2.3e) and (2.30), to 
meet a particular accuracy requirement, was observed to be a non- 
increasing function of x on x > I, if we take n = (x). In contrast, 
the number of terms required in the infinite series of (2.2) increases 
with x. Some relevant information is collected in Table I. For values 
of x between 0 and I, the numbers listed represent the number of 
even (and odd) partial sums required in (2.2) to obtain fi(x) ac- 
curately to d significant digits. Similarly, for x > 1, we list the 
number of even (and odd) convergents of the Legendre continued 
fraction required to obtain e"£,(x) for n = (x) to the same ac- 
curacy. 

It will be noted that near the transition point x = 1 the con- 
tinued fraction evaluation is considerably more time-consuming 
than the series evaluation. The imbalance could easily be corrected 
by moving up the transition point. In so doing, however, the evalua- 
tion of fi(x) from (2.2) involves progressively more loss of sig- 
nificant accuracy. In our algorithm, we have decided to leave the 
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Table |. Number of Partial Sums in (2.2), and Convergents 
in (2.3), To Meet Specific Accuracy Requirements 


\d 
ri? 4 6&8 O24 6B DD 24 26 
o12 2 #3 3 3 4 46 4 5 S$ 5 6 6 
. 20) 2 3. 4 4 5 3 6 7 7 8 8 9 9 
42 3 4 5 6 6 7 8 8 9 9 10 " 
63 4 #5 S$ 6 7 8 8 9 0 0 Wi 22 
.80| 3 4 5 6 7 8 8 9 10 10 W 12 13 
1.00] 3 4 5 6 7 8 9 10 10 i $2 13 13 
1.0K4 11 20 31 45 62 81 103 128 155 185 «218 = =6251 
$.20| 4 9 17 27 39 53 70 88 109 132 157 185 214 
4.50) 4 9 15 23 34 45 59 714 92 100 060131 154 177 
2.00| 3 7 2 19 27 3% 46 58 7 86 {Ot 119 137 
$0012 5 7 WW 18 19 2% 9 38 42 49 57° 65 
10.00/2 4 6 8 0 3 6 9 23 27 31 35) 40 
20.00/22 3 4 6 8 1 12 14 6 1 22 24 27 
40.002 3 4 5 6 8 9 WW 13 14 16 18 20 
s0.00;1 2 #3 4 = $ 7 8 9 10 12 13 14 16 
transition point at x = 1, thus sacrificing efficiency in favor of 
accuracy. 


Alternatively, instead of the continued fraction (2.3) we could 
use a Taylor expansion about x=, when # is moderately large, and 
asymptotic. formulas, when x and # are large. This would result in 
a more efficient, but larger, program. It would also become neces- 
sary to store key values of £,(7) and thus to fix the precision d. 

No provisions are made to test for overflow or underflow con- 
ditions which may arise near the singularities x = 0 and x = 
of f(x). As for the first, overflow occurs only for extremely small 
values of x and is likely to be caught by the library subroutine for 
the logarithm. At the singularity at infinity underflow occurs only 
for extremely large values of x or 1, or both. 

3. Tests. Exponential integrals are tabulated by G.F. Miller [5], 
who gives (x -+ n)etE,(x) to nine significant digits in the range 
0 <x < 20 andO < x7 < .05, generally for 7 = 1(1)24. We 
tested our algorithm (with nmax = 24, d = 9) against these tables 
for selected x-values in the interval (0, 20], and for x! = .001, 
x! = .005(.005).05. No discrepancies were detected, other than 
occasional end figure errors of one unit. We also found ourselves in 
agreement with the initial portion (x < .6) of the 7-10S table in 
Kourganoff and Busbridge [4], but observed many end figure 
discrepancies (of up to 12 units) in the remaining portion of the 
table. A double check with Miller’s table indicates that these dis- 
crepancies. are due to small errors in the Kourganoff-Busbridge 
table. John W. Wrench Jr. has kindly supplied the author with 
25S values of E,(10), 1 = 1(1)25, which he computed in 40S 
arithmetic on a desk calculator. A double precision Fortran version 
of our algorithm (run. with nmax = 25, d = 25) reproduced these 
values correctly to all 25 significant digits. The same Fortran version 
of the algorithm was used with mmax = 1, d = 16, to compare 
against the 16S table of e*E,(x) given by Miller and Hurst [6]. 
For the test values x = .2(.05)1.0, x = 1.05, x = 1.5, x = 25, 
k = 1(1)6, no discrepancies were observed, except for x = .95, 
where the last digit was in error by one unit. All tests were performed 
on a CDC 6500 computer. 

4. Formal parameter list. 


x the argument in f(x); type real; 

nmax the maximum value N of #; type integer; 

d the desired number of significant decimal digits; type 
integer; 

tf an array of dimension [l:7max] holding the result f(x) 
in f[a]. 


Acknowledgment, The author is pleased to acknowledge valu- 
able suggestions of the referee, which resulted in a simpler and more 
flexible algorithm. 
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Algorithm 
procedure fsubn(x, nmax, d, f); 
value x, nmax, d; 
integer nmax, d; real x; array f; comment {1 :max]; 
comment This procedure evaluates fa(x) = e7E,(x) for x > 0, 
n = 1, 2,..., nmax, to an accuracy of d significant decimal 
digits. The results are stored in the array f. If x < 0, or nmax < 0, 
the procedure immediately sends control to a procedure recovery 
and exits from the procedure fsubn. A call is made to a real pro- 
cedure f1 which is to return fi(x) for 0 < x < 1, with an accuracy 
of d significant digits. A possible version of such a procedure is 
declared below; 
begin 
integer #2, 71, k, k1; 
real eps, ue, ve, we, wel, uo, vo, wo, wol, W, I, S; 
real procedure /1(x, d@); value x, d; integer d; real x; 
begin 
integer £, k1, k2; real eps, gamma, se, sel, so, sol, s, te, to; 
eps := 10 T (-—d); 
comment The constant gamma in the following statement 
should be supplied to at least d significant decimal digits. 
For the first 328 digits see [9]; 
gamma := .5772156649015328606065 12; 
se := 0; sel := —1.0;so:= to:= x;sol:= 2% x;8:= x/2; 


k1 :=1; 
for k := k1 while so-se > eps X s (A se > sel A so < sol do 


begin 
Sel := se; sol := so;k2:=2Xk; 
te := (k2—-1) X x X& to/(k2X%k2); se := se + to — te; 
to:=k2Xx X te/((k24+1) & (k24+1)); so := so — te + 


to; 
s := (se+so)/2;k1:= k1 +1 
end; 
fi := (s—gamma—In(x)) XK exp(x) 
end f1; 


if x < 0 V nmax < 0 then begin recovery; go to exit end; 
comment recovery is a procedure which the user has to supply 
and in which he may wish to print appropriate error messages; 
if x < 1 then 
begin 
A{l] := fl, a); 
for n := 1 step 1 until nmax — 1 do 
ffati] = A-xxXflal)/n; 
go to exit 
end; 
eps := 10 f (—a); 
nl := entier(x+.5); 
ue := 1.0; ve := we := 1/(x+n1); wel := 0; 
uo := 1.0; vo := —nl/(xX(x+nl1-+1)); wol := 1/x; wo := 
vo + wol; 
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w:= (we+wo)/2; 


kl :=1; 
for k := k1 while wo-we > eps X w A we > wel A wo < wol do 
begin 

wel := we; wol := wo; 


r:=eaml+ks:=r+xt+k; 
ue := 1/(L—kX(r—1) Xue/((s—2) Xs)); 


uo >= 1/(1—kXrXuo/(sXs—1)); 
ve := ve X (ue—1); vo := vo X (uo—1); 
we := we + ve; wo := wo + vo; 
w:= (we+wo)/2;k1:=kl1+1 
end; 


if nl < nmax then f[n1] := w; 
for 1 := ni — 1 step —1 until 1 do 


begin 

wi= (l-nXw)/x; 

if 2 << nmax then f[7] := w 
end; 


for 1 := nl step 1 until nmax—1 do 
fal) := —xx fli) /n; 


exit: end fsubn 
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Description 

1. Introduction 

The purpose of the procedures presented here is to determine 
the interpolating natural spline function S(x) of degree 2m — 1 for 
the set of data points (x, , yi), i = N1, NI + 1,...,N2 where it is 
assumed that xy, <Xwi41) <+°* < Xyq . The interpolating natural 
spline function S(x) with the knots xy, ,..., Xy2 has the proper- 
ties: (1) S(x) is a polynomial of degree 2m — 1 in each interval 


(xi, Xigi) i = NI,...,N2— 1. (il) S(x) and its derivatives DiS(x), 
j= 1,2,..., 2m — 2 are continuous in (xy, , Xyo). (If m = 1 the 
conditions: on the derivatives are not applicable.) (iii) DiS(xy,) = 
DiS(xy.) = 0,j =m,m+1,...,2m — 2if m > 1. (iv) S(x;) = 


y:. If N2 — N1 + 1 > m then there is a unique natural spline 
function which has the properties (i)-(iv). (See, e.g. Greville[{3, 4].) 
This spline function can be represented in the form 


S(x) = Ain + Ant? + Aial® + +++ + Airmail (1) 


with ¢ = x — x; forx; < « < xi41,i = MI,..., N2 — 1. Evi- 
dently A;o = y,. Three of the procedures calculate the other ele- 
ments A;; of the matrix of the coefficients of (1). 

The procedure NATSPLINE computes the coefficients of the 
natural spline in the general case described above. Because the com- 
putation requires the calculation of mth order divided differences 
of the data and these are subject to serious roundoff errors when m 
is large, it is recommended that this procedure not be used for large 
values of m, say, greater than seven. Moreover, the condition of the 
matrix which occurs in the system of equations which must be solved 
in the computation deteriorates rapidly with increasing m. 

Procedure NATSPLINEEGQ treats the case of equidistant knots 
x; . Ifthe knots are known to be equidistant, the use of this procedure 
results in considerable economy of computational effort. The time 
required for the calculation of the coefficients using NATSPLINEEQ 
is less than half that required if NATSPLINE is used. Note that in 
the case of equidistant knots it is not necessary to specify the values 
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of x; . The representation (1) is still used, but now t = (x — x:)/h 
where # = x;4, — x; , the constant spacing of the knots. 

Since the case of a cubic natural spline is of frequent occurrence, 
we give also a procedure, CUBNATSPLINE, which computes the 
coefficients in this special case. This procedure is very much faster 
than either of the other procedures when used with m = 2 to produce 
the same results. 

In some applications of cubic natural splines it is more.efficient 
to evaluate the spline approximation by means of the formula 


S(x) = yl — 1) + vigil (2) 
+ V(—2r + 37 — *)/6 + Wit — 1/6 

with tf = (x — x:)/hi for xi < x < Kia, Ai = Xin — Xi, V = 
h?S"(x:), W = h?S”(xis1), instead of using (1). The procedure 
CUBNATSPLINE2D calculates the second derivatives S”(x;) and 
the values of 4; which are the quantities needed to use (2). Since 
this procedure uses one less array than does CUBNATSPLINE, the 
saving of storage may be significant if the number of data points is 
arge. It is also slightly faster than CUBNATSPLINE. 

2. Method of Calculation 

(a) General case. The calculation of the coefficients is carried 
out in a numerically stable manner following a method described by 
Anselone and Laurent [1] specialized to the case of the interpolating 
natural spline as described above. The method is based on the use 
of minimum support B-splines [2, 4] to form a basis for the class of 
mth derivatives of the natural splines. For convenience of calcula- 
tion we use a normalizing factor different from that of Greville 
[4]. For a fixed m, our B-splines are defined by 


Milx) = M(x; Xk, Xky1,~. +, Xkgm) (3) 
where 
M(x;t) = ((-1)"/(m — IDG = xP (4) 


Here x," = x" if x > Oand 0 otherwise. M(x;xn , Xka1,.. +, Xkym) 
denotes the mth divided difference of M(x;t) with respect to t based 
on the arguments x, , Xk41,..-, Xkim - Mx(x) is of constant sign in 
(xk, Xk+m) and vanishes outside this interval. It is known that a 
natural spline function S(x) may be extended uniquely over the 
whole real Jine by imposing the continuity conditions (ii) at all 
points. Then outside (xn1, xn), S(x) is a polynomial of degree 
m — 1, and consequently D”S(x) vanishes outside (xn: , Xn2). It 
follows that D”S(x) has a unique representation of the form 


DS(x) = oy” de(2m — 1)! Mix). (5) 


The constants d; are found by solving the well-conditioned system of 
equations 


So Nia Hee, PGT SONI NOS (6) 
where 

Nee = NXt py Kis io Re Fe Ey Ss og Bee) (7) 
with 

Ns, ) = (s — Ne. (8) 


Here N;x are the elements of a positive definite band matrix with 
Nix = 0 if |i — k | > m. The solution of this system is obtained by 
Gaussian elimination without pivoting. 

In order to determine S(x), eq. (5) has to be integrated m times. 
We introduce two m-fold integrals of (2m — 1)! Mx(x): 


x 


E(x) = Qm — 1)! / pits | dx: Mi(x), (9) 
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and 


x x 
F(x) = Qm — nf dx... | dx M,(x). (10) 
+00 +00 
If we use the well-known form of the mth divided difference (see, 
e.g. Greville [4]) we can use (3) to obtain two alternative explicit 
formulas for M;(x). When we substitute these in eqs. (9) and (10), 
we obtain 


Ex(x) = Som, & - Xba /We'! (Xe) (11) 
and 

F(x) = ih (Xkei — x)" /wel (Xie) (12) 
where 

we'(x) = Dal[(x — xK)(X — Xe) 0+ (4X — Xkam)]. (13) 
Equation (11) shows that E;.(x) = 0, ifx < x,,and 

Ex(x) = (x — xx)?" /we' (xn), ifxnr <x < Xe. (14) 


Each time we pass a knot x;,; from left to right, there enters a term 
(x — Xn pi)2"—1/we' (xe41) which is added to the current polynomial. 
We can therefore write E;(x) in the form 


—1 is 
Ex(x) = yo! ene (X — Xvi)? in Xn SX < Xepey 


(15) 
From eq. (14) it is clear that 

fF = 0, 1,245.5 2m — 2, 

j = 2m—1. 


Ck,05 = 0 
= 1/wz' (xr) 


The other ex, ;,; are determined recursively. When e:,;-1,; have been 
calculated so that Ex(x) is determined by (15) in xx4:-1 < x < 
Xk,i, we use the complete Horner scheme to expand the poly- 
nomial in powers of x — xz,; and then add the appropriate term 
required to pass to the interval [x.4: , Xe4141). In the same way F x(x) 
may be written in the form 


F(x) = Doyo" ek, 1, G(X — Xkym~d4,  Xtam i < XS Xkgmaigi. (16) 


Again the ex,_:,; are determined recursively. It suffices to generate 
x, :,; and ex,_;,; for only a very limited set of values of k and ias we 
see below. 

By integrating eq. (5) m times, using (9) and (10), and noting 
that E;(x) = 0 for x < x;, and F;(x) = 0 for x > xj4m , we find 
that 


S(x) = Tx) + P(x) (17) 
where 
T(x) = recta oe GFX) + DOES* djEs(x) (18) 


with k arbitrary and P(x) a polynomial of degree m — 1 depending 
on k. 

We now let & assume the set of values best described by the 
Algol 60 for-clause 


for k:= N1 step m — 1 until N2 — m, N2 — m+ 1 do. 


For each such value of k we calculate 7(x) in the interval [xz , Xk4m—1). 
Then P(x) is uniquely determined by the interpolation conditions 


Yegt = T(xnyt) + Pen), $= 0,1,...,m—1. 


Newton’s divided difference formula is used in obtaining P(x). For 
each value of k it is necessary to calculate the values of e,,;,; only 
forp =k,k+1,...,k+m-— 2,i=0,1,...,4 +m— p—2, 
j=0,1,...,2m— 1, andforp =k—m+1,k—m-+2,..., 
k-1,i = -1, -2,...,k —m-—p,j =0,1,..., 2m — 1. 
Furthermore, p is restricted to lie between N1 and N2 — m. More 
details on the organization of the calculations are given in [5]. 

(b) Equidistant knots. The calculation of the coefficients in 
NATSPLINEEQ for the case of equidistant knots is carried out in 
the same manner as in NATSPLINE for the general case. However, 
there are a number of simplifications which result in considerable 
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Table I. Cubic Natural Spline. 
Five nonequidistant knots. Coefficients calculated by 
NATS PLINE 


x S(x) S’ (x) S”" (x)/2 S”'(x)/3: 
—3.000000 7.000000  —1.999998 0 0.9999998 
11.00000 9.999996 5.999997 0.9999998 

—1.000000  11.00000 10.00000 5.999999  —1.000000 

25.99998 18.99998 2.999999 —1.000000 

0 26.00000 18.99997 2.999995 —1.999996 

55.99995 —16.99994 —14,99997 —1.999996 

3.000000 56.00000 —16.99998 —14,99999 4.999996 

29.00003 —31.99995 0 4.999996 

4.000000  29.00000 


economy of computational effort. It is not necessary to specify the 
x; . Hence we can assume that x; = i. It is convenient to modify 
eq. (6) slightly. First of all the right-hand sides reduce to 
A”™y;/m! where A”y,; are ordinary mth differences and require no 
divisions in their calculation. In the second place it can be shown 
that Nj, is the 2mth ordinary difference of s—1"'/((—1)™(m!)?) 
based on the valuess —- t=i-—k—m,...,i—k +m. Were- 
scale M(x; t), Mi(x), Ex(x), F(x) by multiplying their representa- 
tions in eqs. (4), (11), and (12) by (—1)”m!. Thus d; is rescaled by 
dividing it by (—1)”m!. We denote the rescaled coefficients by d;*. 
If we let Ni be the 2mth difference of /*”' based on the values 
j=i-k-—m,...,i—k +m, then Nx = Nix(—1)"(m!)? and 
eq. (6) becomes 


eat Ni di*t = Amy;, i= Nl,...,N2—m. 


For large values of m, the calculation of Nix by the obvious differ- 
encing technique involves serious cancellation and may introduce 
errors in the computed values of Ni. It can be shown that these 
differences satisfy the recurrence relation 


(19) 


Ane) = (a tar + 1E7) — jantGt”). (20) 
We need to calculate these quantities only for 1 = 2m at 
jz=i-k-—mfori-k = —-m+4+1,...,0,1,...,m-— lie, 
forj = —2m + 1,..., —2, —1. In this range, the two weight fac- 


tors 2m + jand —/are both positive, one ranging from 1 to 2m — 1 
and the other from 2m — 1 to 1. Thus no cancellation can occur 
when formula (20) is used for calculating Nv. . 

A further simplification occurs because the coefficients of 
E;(x) and F;(x) are independent of &. It therefore suffices to com- 
pute the coefficients of Eo(x) and Fo(x). Moreover Fo(x) = 
(—1)"E,(m — x). Thus we have only to calculate the values of an 
array e;; fori = —m+1,..., —1,0,1,...,.m — 1 andj = 
0, 1,..., 2m — 1. This is a major saving over the calculations for the 
general case. The rest of the calculations are carried out as in the 
general case. 

(c) Cubic spline. Much computational labor is saved by treat- 
ing this as a special case instead of using the general program with 
m = 2. We Start with eq. (1) setting m = 2. By imposing the condi- 
tions (ii), (ili) and (iv) at the knots, we get relations between the 
coefficients, which yield a tridiagonal system of equations for 
Ain(= S”"(x;)/2, the coefficients of 7. This tridiagonal system is 
solved by Gaussian elimination. In the procedure CUBNATSPLINE.- 
2D the values of S”(x;:) and A; = xXi41 — x: are output. In the pro- 
cedure CUBNATSPLINE the values of 4, , Aig and A;j3 are output. 


3. Tests 

These procedures have been tested in Alcor Algol on the Tele- 
funken TR-4 at the Rechenzentrum of the Technischen Universitat 
Miinchen and in Algol W on the IBM 360/67 at the Stanford Com- 
putation Center. The latter tests included timing tests of the pro. 
cedures over a range of values of m up to 7 and number of knots N = 
N2 — N1 + 1 up to 100. The time was found to be approximately 
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proportional to the number N of knots and to the square of m. The 
time 7 in seconds for the execution of the procedure NATSPLINE 
was found to be approximately 


T = N/60 (0.117m? — 0.296m + 0.512). 

For NATSPLINEEQ the time was approximately 

T = N/60 (0.014m? + 0.023m + 0.029). 

For CUBNATSPLINE the time was approximately 

T = .045N/60 = .OOO7SN. 

For CUBNATSPLINE2D the time was approximately 
T = .03N/60 = .OOOSN. 


In order to check the accuracy of the coefficients calculated for 
the spline approximation S(x), the values of D*S(x)/k!, k = 
1, 2,..., 2m -— 2 were calculated at the right-hand endpoint of 
each subinterval [x; , x;,1) and compared with their values (the co- 
efficients in:eq. (1)) at the left-hand endpoint of the next subinter- 
val. It was found that the accuracy deteriorated somewhat for 
larger values of m, although for m = 7, with the data used, the largest 
relative differences were observed to be approximately 0.0018. 
Table I shows the results of a typical run using NATSPLINE for 
five nonequidistant knots with m = 2. The first line of each box 
gives the tabulated quantities at the given value of x, which is the 
left-hand endpoint of the subinterval, and the second line of the 
box gives the tabulated quantities at the right-hand endpoint of the 
same subinterval. The close agreement of these quantities D*S(x)/k!, 
k = 1, 2,..., 2m — 2 shows that the spline function and its de- 
rivatives satisfy the specified continuity conditions. This is a good 
indication of the correctness of the results. 


References 

1. Anselone, P.M., and Laurent, P.J. A general method for the 
construction of interpolating and smoothing spline functions. 
Numer. Math. 12 (1968), 66-82. 

2. Curry, H.B., and Schoenberg, I.J. On Polya frequency func- 
tions. IV. The fundamental spline functions and their limits. J. 
Analyse Math. 17 (1966), 71-107. 

3. Greville, T.N.E. Spline functions, interpolation and numerical 
quadrature. In Mathematical Methods for Digital Computers, Vol. 
I. A. Ralston and H.S. Wilf (Eds.) Wiley, New York, 1967. 

4. Greville, T.N.E. Introduction to spline functions. In Theory 
and Applications of Spline Functions. T.N.E. Greville (Ed.) Aca- 
demic Press, New York, 1969, pp. 1-35. (Pub. No. 22 Mathe- 
matics Research Center, U.S. Army, U. of Wisconsin.) 

5. Herriot, John G., and Reinsch, Christian H. Algol 60 pro- 
cedures for the calculation of interpolating natural spline func- 
tions. Tech. Rep. STAN-CS-71-200, Comput. Sci. Dep., Stanford 
U. 1971. 


Algorithm 
procedure NATSPLINE(N1, N2, m, x, A); 
value N1, N2,m; integer N1, N2, m; 
array x, A; 
comment NATSPLINE computes the coefficients of a natural spline 
S(x) of degree (2Xm-—1), interpolating the ordinates y[i] at 
points x[é], i = N1 through N72. For xx in [x[é], x[i+1]): S(xx) = 
Afi, O] + Afi, 1] X t+... + All, 2Xm—-1] Kt T (2Xm—-1) 
with tf = xx — x[i], 
Input: 
N1, N2 subscript of first and last data point 
m 2Xm-—1 is the degree of the natural spline, 
admissible values range from 1 to N2 — N1 + 1, 
recommended values are not greater than seven (say) 
x[N1:N2] contains the given abscissas x[{i] which must be 
strictly monotone increasing 


472-P 3- 0 


A[N1:N2, 0:2Xm-—1] contains the given ordinates as zero-th 
column, i.e. A[i, 0] represents y{i], 
Output: 
A[(N1:N2, 0:2 m-— 1] the coefficients of the natural spline as de- 
scribed above (the zero-th column is unchanged and no val- 
ues are assigned to the last row of A); 


ifm >0Am < N2 — Mi + 1 then 
begin 


integer i, j,k, /, /1, m1, m2, mm, n, mk, k\, jj, kk, J; 
real f, z, w; 
array C[0:2Xm], D[N1:N2], E{O:n—1, 1—m:m—1, 0:2Xm—1], 
P(0:m], Q[0:m, N1:N2); 
comment i-j-entry of band-matrix stored in A[i, j/—-i+1], right- 
hand stored in vector D; 
mi :=m—1;m2:=m—2; mm:=2Xm—1; n:=N2—m; 
for j := N1 step | until N2 do 
begin 
l:=j+m; iff > N2thenl := N2; 
for i := j step | until / do Q{i—/, j] :== (x[i] — x{/]) T mm 
end; 
for i:= NI + 1 step 1 until N2 do 
begin 
!:=i-—WN1; if/ > mthen/ := m, 
for j := Ostep 1 until / do P[j] := Q[/, i—J/); 
for kK := 1 step 1 until m do 
begin 
A:=i+tk-—N2; if/1 < 1then/1 := 1; 
for j := /step —1 until /1 do 
PU] := (PU—1—PLU)/(xlé—7+4) — xli—i) 


i 


end; 

for j := /1 step 1 until / do Q[j, i—/) := Ply] 
end; 
for / := N1 step 1 until n do 
begin 


for i := O step 1 until m do P[i] := Qi, /]; 
for k := 1 step 1 until m do 
begin 
A:=N1l—j+k; if/i < 1then/1 := 1; 
for i:= m step —1 until/1 do 
P(i) := (P{i]—Pli—1))/(«lit+/]—xli+/—£)) 


end; 

for i := /1 step 1 until m do Q{i, /) :== P{i] 
end; 
for j := 1 step 1 until m do 
begin 

l:=n—jt+1; 


for i := N1 step 1 until / do A[i, j} :== Q[m—/+1, i+/j-—1] 
end; 
for i := N1 step 1 until N2 do D[i] := Ali, 0]; 
for k := 1 step 1 until mdo 
begin 
1:= N2-k; 
for i := N1 step 1 until / do 
Di) := (Dli+1)— D[i))/lit+k] —x[i)) 
end; 
comment Gaussian elimination without pivoting, rational 
Cholesky; 
for i := N1 step 1 until 7 do 
begin 
l:=i+ml; if/>nthen/:= 4; 
for /:= i+ 1 step 1 until / do 
begin 
comment f:= /-i-entry/i-i-entry, symmetry; 
f:= Ali, j-—i+1)/Ali, 1); 
Dij):= Dif) -—f xX Df; 
for k := jstep | until /do 
Alj,k—j+1] := Alj,k-j+1]) — f X Ali, k-i+1] 
end j 
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end i; 
comment Back substitution; 
for i := nstep —1 until N1 do 


begin 
l:=n—i; if] > mthen/ := m1; 
f:= D{ij; 


for 7 := 1 step 1 until/ do f:= f— Afi, j+1] < D[i+)); 
D{i] := f/Ali, 1) 
end i; 
comment Now compute the coefficients of the natural spline; 
if m1 = 0 then 


begin 
for kK := N1 step 1 until 7.do 
Ak, 1] := —D[k]/(x[k +1] —x{k]) 
end 
else 
for k := N1 step m1 until 1, N2 — m1 do 
begin 


comment Now compute coefficients of the two sets of m-fold 
integrals of the minimum support splines scaled with 
(2><m— 1) factorial; 
1:= m2; iff] >n—kthen/:=n — k; 
for kk := O step 1 until / do 
begin 
mk := mi — kk; 
for j := O step | until mm do C[/] := 0; 
for i := 1 step 1 until mk do 
begin 
kl :=k+kk; 
wi= 1; 
for 7 := O step 1 until m do if 7 ~ i -- 1 then 
wi=w XX (x[kAl+i-1)—x[k1+]); 
C[mm] := Cimm] + 1/w; 
for j := O step 1 until mm do 
if i < mk then 
begin 
zi= x{ki+i] — x(kl+i-1]; 
for j := 1 step 1 until mm do 
for j1 := mm step —1 until j do 
Cijij—1] := Cll] X z+ Cljl—1] 
end 
end 
end; 
l:= ml; if] >k — Nithen/:=k — M1; 
for kk := 1 step 1 until / do 
begin 
mk := m — kk; 
for j := O step 1 until mm do C[/] := 0; 


for i := 1 step 1 until mk do 
begin 

kl := k — kk; 

wil; 


for j := O step 1 until m do if 7 ~ m — i+ 1 then 
w:i=w X (x[kl+m—i+1] — x[k1+/]); 
C[mm] := C[mm] — 1/w; 
z:= x[kl+m—i] — x(kit+m—i+]); 
for / := 1 step 1 until mm do 
for jl := mm step —1 until j do 
Cijl—1) := Cll] X z+ Cijl—J]; 
for j := O step 1 until mm do E[kk —1,—i,/] := CU] 
end 
end; 
for / := 0 step 1 until 2 do 
begin 
comment Coefficients of the spline 7(x) of degree (2 m-—1) 
in the interval [x[k-+/], x[k+/+1]) stored as (k+/)-th 
row of A, P(x) = y(x) — T(x) at the points x = x[k] 
through x[k-++m— 1] stored in P; 
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for j := O step 1 until mm do C{j] := 0; 
for i := / — ml step 1 until / do 
begin 
adie Jk Py; 
if i < 0 then begin; := j — m; jj := mi — jjend; 
if7 > N1 Aj <n then 
begin 
f:= Dil; 
for jl := 0 step 1 until mm do 
Cif} := CIA) +f x Ei 
end / 
end j; 
for j := 1 step 1 until mm do A(k+/,j] := CU/]; 
P(l] := A[k+/,0] — C[0] 
end /; 
f:=0; z:= x[k+m1] — x[k+m1—1]; 
for 7 := mm step —1 until 0 do f:= f X z + CL}; 
Pi{m1] := Alk+m1,0] — f; 
comment Compute P(x) from its ordinates at the points 
x = x[k] through x[k+m—1] using Newton’s divided 
difference scheme for interpolation; 
for i := 1 step 1 until m1 do 
for j := m1 step —1 until ido 
Pij) := (PL{)—PU—1))/@lk +/1—x1k +7—- 2); 
for / := O step 1 until m2 do 
begin 
comment Add coefficients of P(x) in interval 
[x[k-+/],x[k+/+1]) to those of T(x) stored in (k+/)th 
row of A; 
for j := O step 1 until m1 do C[/] := PL]; 
for i := m2 step —1 until 0 do 
for j := istep | until m2 do 
Cl] := Cl) + lk+4—x[k+i]) X Cli+4); 
for 7 := 1 step 1 until m1 do 
Ak +1,j] := Alk+/j\ + Cli] 
end / 
end k 


end NATSPLINE; 


procedure NATSPLINEEQ (N1,N2,m,A); 


value NI, N2,m; integer N1,N2,m; 
array A; 


comment NATSPLINEEQ computes the coefficients of a natural 


spline S(x) of degree (2m-—1), interpolating the ordinates y{/] 
at equidistant points x[i], i = N1 through N2. For xx in 
[x[é], x[i+1]): S(xx) = Ali,0] + Ali l] X t+.... 
+ A(i,2xm—1] XK tt (2Xm-—1) with 
t = (xx—x{[i})/(x[i+1]—-x[i]) from [0,1), 
Input: 
N1, N2 subscript of first and last data point 
m 2Xm-—1 is the degree of the natural spline, 
admissible values range from 1 to N2 — N1 +1, 
recommended values are not greater than seven (say) 
A[N1:N2,0:2<m— 1] contains the given ordinates as zero-th 
column, i.e. A[i,0] represents y{i], 
Output: 
A[N1:N2,0:2 Xm-— 1] the coefficients of the natural spline 
as described above, (the zero-th column is unchanged and 
no values are assigned to the last row of A); 


ifm >OAm < N2— M1 +1 then 
begin 


integer i, j, j1,k,/, m1, m2, mm,n;_ real f; 

array C[0:2Xm], D[N1:N2], E[lL—m:m—1,0:2*m-—1], P[0:m); 

comment i-j-entry of band-matrix stored in Ali,j—i+-1], 
right-hand stored in vector D; 


mi :=m—1;m2:=m—2; mm:=2Xm—1; 
n:= N2—™m; 
for i := 1 step 1 until mm do 
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begin 
Cli] := 1; 
for 7 := i — 1 step —1 until 2 do 
Cl) := G+1-/) X CH-1]) +7 X Chi] 
end i; 
for i := N1 step 1 until N2 do D[i] := A[i,0]; 
for j := 1 step 1 until m do 
begin 
f:= C[m+1—j]; /:= N2 —j; 
for i := N1 step 1 until 1 do A[i,j] := f; 
for i:= M1 step 1 until /do D[i/] := Dfi+1] — D{i] 
end /; 
comment Gaussian elimination without pivoting, rational 
Cholesky; 
for i := N1 step 1 until » do 
begin 
f:=i+mi; if] > nthen/ := 7; 
for 7 := i + 1 step 1 until / do 
begin 
comment /: =/-i-entry/i-i-entry, symmetry; 
f= Ali,j—i+t]/Aliy; 
Dijj:= Dijl- f xX Diil; 
for k := / step 1 until / do 
Alj,k—j+1] := Alj,k—j+1] — f X Ali,k-i+1] 
end / 
end /; 
comment Back substitution; 
for i := n step —1 until N1 do 


begin 
l:=n—i;if] > mthen/ := ml; 
f:= Dfil; 


for; := 1 step 1 until/ do f:= f — Ali,j+1] X D[i-/); 
Dii| := f/Ali,1 
end i; 
comment Now compute coefficients of the two m-fold integrals 
of the minimum support spline scaled with (2Xm—1) 
factorial; 
l:=1; 
for 7 := O step | until mm do C[j] := 0; 
for i := 1 step 1 until 1 do 
begin 
C[mm] := C[mm] + J; 
l:=1X (i-1—m)/i; 
for 7 := O step 1 until mm do E{i—1,j] := CL]; 
for j := | step 1 until mm do 
for k := mmstep —1 until j do C[k—1] := C[k—1] + CIk]; 
for 7 := Ostep 1 until mm do E[—i,j] := Cl] 
end i; 
comment Change sign; 
for 7 := m1 step —2 until 0, m + 1 step 2 until mm do 
for i := —ml] step 1 until —1 do Efi,j] := —£{i,/); 
comment Now compute coefficients of the natural spline; 
if m1 = O then 
begin 
for k := N1 step 1 until 1 do A[k,1] := D{k] 
end 
else 
for k := NI step m1 until 71, N2 — m1 do 
begin 
for / := O step 1 until 2 do 
begin 
comment Coefficients of the spline 7(x) of degree (2 m—1) 
in the interval [k-+/,k-+/+1) stored as (k+/)-th row of 
A, P(x) = y(x) — T(x) at the points x = & through 
k + m — 1 stored in P; 
for j := O step 1 until mm do C[/] := 0; 
for i := / — mt step 1 until / do 
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begin 
J:=k+l/—-—i;, ifi < Othen;:= / — m; 
if; > NI Aj < then 
begin 
f:= Diji; 
for /1 := O step 1 until mm do 
Ci] := CU +f & Elis 
end j 
end /; 
for j := 1 step 1 until 7m do A[k-+-/,/| := CL]; 
. Pil} := A[k+/,0] — C0] 
end /; 
f:= 0; 
for j := mmstep —1 until Odo f:= f+ ClJj; 
P(m1] := A[k+m1,0] — f, 
comment Compute P(x) from its ordinates at the points x = k 
through k + m — 1} using Newton’s divided difference 
scheme for interpolation; 
for i := | step | until m! do 
for 7 := m\ step —1 until (do P[/] := P[yj| — Pi j—1]; 
f:= 1; 
for j := 2 step 1 until m1 do 


begin 

f:=fX J; Pli\ = P\f\/f 
end /; 
for / := O step | until m2 do 
begin 


comment Add coefficients of P(x) in interval [A -+/,A4 +/+-1) 
to those of T(x) stored in (k+/)-th row of A; 
for j := O step 1 until ml do C[/] := P{J); 
for i := m2 step —{ until 0 do 
for j := istep | until m2 doC[/|:= C{/] + Cl/+1] x U-d); 
for j := 1 step | unfil ml do A[A+/,j| := A[A+//] + CL] 
end / 
end k 
end NATSPLINEEQ; 


ll 


procedure CUBNATSPLINE(N1,N2,x,y,B,C,D); 
value N1, N2; integer N1, N2; 
array x, y, B,C, D; 

comment CUBNATSPLINE computes the coefficients of a cubic 
natural spline S(x) interpolating the ordinates y{/] at points x[/], 
i = M1 through N2. For xx in [x[/],x[i-+1]): 
S(xx) = ((D[i}Xt+Cli)) Xt4+ Bli]) X 1+ ylilwiths = xx — x[i, 
Input: 

N1, N2 subscript of first and last data point 

x, y[N1:N2] arrays with x[/] asabscissa and y|[/] as ordinate of 
i-th data point. Theelements of the array x must be strictly 
monotone increasing, 

Output: 

B, C, D{N1:N2] arrays collecting the coefficients of the 
cubic natural spline S(xx). C[N2] = 0 while B[N2] and 
D{[N2] are left undefined; 

begin 
integer i, M1, M2; real R, S; 
M1:= N1l+1; M2:=N2—-1; S$S:=0; 
for i := N1 step 1 until 1/2 do 
begin 

D{i) := x{i+1] — x(a; 

R:= (yli+1]—yl[i))/DIA; 

Clij:= R-—S; S:=R 

end /; 

R:= S$ := C[N1j := C[N2] := 0; 
for i := M1 step | until 2 do 
begin 

Cli] := Cli] + R x Cli-1]; 

Bi] := (xli-t]—xfi41]) X 2-RXS; 

S:= Dirj; R:= S/Bli) 
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end /; 

for /:= M2 step —1 until M1 do 
Cli] := (Dl XCli+1]—CliJ)/Bli); 

for i := NI step | until M2 do 

begin 
Bi) -= (vlit+t]—ylf)/Dll — (2K Cl+Cli4+1]) X Did; 
Di] := (Cli+1]—Cli))/DIA]; 
Cli} := 3x Cli 

end i 

end CUBNATSPLINE; 


procedure CUBNATSPLINE2D(N1,N2,x,¥,D,A); 
value V1, N2; integer N1, N2; 
array x, y, D,h. 
comment Construction of a cubic natural spline S(x) interpolating 
the ordinates y|/| at points x|/], i = NI through N2. For xx in 
[xli,xlit-1]): 
S(xx) = yl] & (Ilr) + yi] X 14+ V 
K (-2&14+3X&txK1—1 Xt XK1)/6 + WX Xt XI—-1) 6 
with ¢ = (xx—x[i]) Ali), Alé| = x[/41] — x(i], : 
V = hii} X hli| X Dil, W = Ali} X Ali) — DIi+1). This form 
is especially suited for the evaluation of S(x) and its second 
derivative at points corresponding tos = 1/2, 1/4, 3,4, 1,8, 
ce 
Input: 
N1, N2 subscript of first and last data point 
x, y|N1:N2] arrays with x|/] as abscissa and |i] as ordinate 
of i-th data point. The elements of the array x must be 
strictly monotone increasing, 
Output: 
D(|N1I:N2]  Dii| is the second derivative of S(v) at x = x{i], 
i = N1 through N2 
A|NI:N2| Ali) = x[i+1] — xlé], i = NI through N2 — 1; 
begin 
integer /, M1, M2; real U, V, W; 
Mi := NE+1; M2:=N2—1; U:= y|NII; 
for i := NI step | until 1/2 do 
begin 
V := yli+1), Att) := x[f4+1] — x); 
Diitl):= V-—U)/Ali); U:=V 


end /; 

W :=Ah|NI]; D[NI] := := 0; 
for i := M| step | until 2 do 
begin 


comment U = /ili—1|/Pli—1], V = Afi-—1], W = Afi), Pli] 
stored in /|/], where P[/] denotes diagonal coefficient in the 
Gaussian elimination; 
Vi= W;, We= hil; Al):= V4W)xK2-UXY; 
D{i} := Dli+1] — Dli} —- UK Dii-1); U:= W/Ali| 
end /; 


D(N2] := 0; 
for i := M2 step —1 until 1/1 do 
begin 


comment Back substitution and restore /[/]; 
W := x(i+l1] — x[i; 
Dli} := (xD{[ij—-Wx DUi+1)/h[H); 
Ali}:= W 
end i 
end CUBNATSPLINE2D 
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Computation of Legendre 
Series Coefficients [C6] 


Robert Piessens [Recd. 13 Mar. 1972 and 5 Sept. 1972] 
Applied Mathematics Division, University of Leuven, 
Heverlee, Belgium 


This work was supported by the Nationaal Fonds voor Weten- 
schappelijk Onderzoek (Belgium) under Grant No. 10.174. 


Key Words and Phrases: Legendre series, Chebyshev series 
CR Categories: 5.13 
Language: Fortran 


Description 

LEGSER approximates the first N + 1 coefficients B, of the 
Legendre series expansion of a function f(x) having known Cheby- 
shev series coefficients A, . Several algorithms are available for the 
computation of coefficients A, of the truncated Chebyshev series 
expansion on [—1, 1] 


N 
ayes ee AnT,, (x), (1) 
where >,’ denotes a sum whose first term is halved. The commonly 
used algorithms are based on the orthogonal property of summa- 
tion of the Chebyshev polynomials [1]. The application of the 
analogous property of the Legendre polynomials for the calculation 
of the coefficients B, of the expansion 


N 
S(x) = Do BaPa(x) (2) 
Nand 
is less suitable for practical use since it requires the abscissas and 
weights of the Gauss-Legendre quadrature formulas [2]. 
We present here a simple method for the calculation of the 
coefficients B, , when the coefficients A, are given. Since 


+1 
B, = (n + 1/2) | Prlx) f(x) dx (3) 
—1 
we have 
N 
B, =~ (n + 1/2) pm Axln,r , (4) 
where 
+1 
gk [ Pp(x)Ti(x) de. (5) 
1 


The integrals J,,, can be calculated using the recurrence formula 
[(k — Dk — nw + INK + 2) 
((k + 3)(k + 2) — n(n + DIK 


Inky (6) 


Tn sk-re = 


where [nx = Oif kK <n, Inn = 2?"(n!)?/(2n + 1) Sif a > 0, Lo,0 = 2. 


Example. The Chebyshev series coefficients of the function 
f(x) = 1/(2 — x) are An = 2"(1 — V0.75)"/V0.75. 


ei TT ET AS 
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Table I. Coefficients of the Legendre Series Expansion of 


f(x) = 1/(2 — x) 


n Exact Bn Errors in computed 8n 
Absolute Relative 
errors errors 
0 0.549294E0 0.12E —4 0.22E —4. 
1 =0.295830E0 0.59E—5 0.20E —4 
2 0.105917E0 0.20E —5 0.19E —4 
3  0.340972E—1 0.56E —6 0.16E ~—4 
4 0.104495E-I1 0.17E —6 0.16E —4 
5 0.311269E —2 0.42E —7 0.13E—4 
10 0.601250E —5 0.41E—10 0.68E —§ 
15 0.101339E —7 0.29E —12 0.29E —4 
20 0.161332E—10 0.63E —12 0.39E —1 


In Table I, the exact Legendre series coefficients of this function 
are compared with the computed values (NV = 20). The computa- 
tions are carried out in single precision on an IBM 370 computer. 

In this example, the Chebyshev coefficients are known exactly. 
In most cases, they must be calculated using an algorithm as in [1]. 


References 
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Algorithm 


SUBKGUTINE LEGSERKCAs Bs N) 
THIS SUBROUTINE CALCULATES THE COEFFICLENTS CF THE 
LEGENDRE SERIES EXPANSION Cr A FUNCTION HAVING 
KNOVN CHERYSHEV SENIES EXPANSICN. 
INPUT PAKAMETEKS 
N DEGREE OF THE TxUNCATED CHEBYSHEV SENWIES 
4 VECTOR OF DIMENSION N+l1 WeKICH CONTAINS THE 
CHEBYSHEV CCEFFICIENTS 
GUTPUT PAKAMETER 
B VECTGR GF DIMENSION N+tl 
LEGENDRE COEFFICIENTS 
KEAL As AKs ALs Bs Bis Co UL 
INTEGER Ks Ls LLs Ns Ni 
DIMENSION ACN), BON) 
Nt =N + 1 
At = 0-0E0 
C CALCULATION OF THE FIKST LEGENDat) CCEFFICIENT 
BOlL) = O0-5E0#AC1) 
IF (N-1) 70s 305 10 
190 DD 20 K=39N192 
AK = AK + 2-0K0 
ROL = BOL) = ACKISCAK*¥AK-1 OED) 
20 CONTINUE 
30 C = 2.0E0/3-NE9D 
AL = 0-.0FD 
C STAKT MAIN LGGr 
DO 69 L=2s5N1 
C CALCULATION GF THE L-TH LEGENDskE COMPRFICIENT 
LL = Ll + 2 


wHICH CONTAINS THE 


Oaaaaanaaan 


AL = AL + 1¢0:O 

BR = C*ACL) 

Ik CLL-«GITeN1) GE TC 5O 
bz C 

AK = AL 


DC 40 K=eLLeNlod 
Uv = €CAK$1 eUFU) KAKMAL*CAL +1 -UEUI) KCAK+2 eURU) FD/ 
* COCAK+3 2OEU €CAKH2 ED) - ALECAL+E ODEO) ) *AK) 
BR = QR + ACK)*D 
AK = AK + 2.0EU 
40 CONTINUF 
90 C = 4e0EO#eC# CAL +E -OF DO) KCAL +1 eOLUI SC CALFAL+3-0b0) 
* *KCAL+AL+¢eUrU)) 
BCL) = CAL4+0.5EO) 5B 
60 CONTINUE 
TA wETURN 
FND 
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Bivariate Interpolation and 
Smooth Surface Fitting Based 
on Local Procedures [E2] 


Hiroshi Akima (Recd. 30 Mar. 1972 and 3 Nov. 1972) 
U.S. Department of Commerce, Office of Telecommu- 
nications, Institute for Telecommunication Sciences, 
Boulder, CO 80302 


Key Words and Phrases: bivariate interpolation, interpolation, 
partial derivative, polynomial, smooth surface fitting 

CR Categories: 5.13 

Language: Fortran 


Description 

Introduction, User information and Fortran listings are given 
on two Subroutines, 7PLBV and SFCFIT. Each subroutine imple- 
ments the method of smooth bivariate interpolation based on local 
procedures [3]. These subroutines are written in ANSI Standard 
Fortran [4]. 

Outline of the method. This method interpolates values of a 
single-valued smooth bivariate function z = z(x,y) and fits a 
smooth surface to a set of values of the function given at grid 
points in an x-y plane. These grid points may be unevenly spaced. 

The method Is an extension of the method of univariate inter- 
polation developed earlier by the author [1,2] and is likewise based 
on local procedures. It is designed to avoid excessive undulations 
between grid points. 

This method is based on a piecewise function composed of a 
set of bicubic polynomials in x and y; a bicubic polynomial in x and 
y is a polynomial that has terms x7y8, where a = 0, 1, 2, 3 and 
8 = 0, 1, 2, 3. Each polynomial is applicable to a rectangle in the 
x-y plane. In this method, three partial derivatives 9z/dx, 0z/dy, 
and 0?z/dxdy are determined at each data point locally by the 
coordinates of 13 data points, with the data point in question as 
the center, two data points on each side of it in the x and y direc- 
tions, and one data point in each diagonal direction. Each bicubic 
polynomial corresponding to a rectangle in the x-y plane is deter- 


Editor’s note: Algorithm 474 described here is available on magnetic 
tape from the Department of Computer Science, University of 
Colorado, Boulder, CO 80302. The cost for the tape is $16.00 (U.S. 
and Canada) or $18.00 (elsewhere). If the user sends a small tape 
(wr. less than 1 lb.) the algorithm will be copied on it and returned to 
him at a charge of $10.00 (U.S. only). All orders are to be prepaid 
with checks payable to ACM Algorithms. The algorithm is recorded 
as one file of BCD 80 character card images at 556 B.P.1., even 
parity, on seven track tape. We will supply algorithm at a density of 
800 B.P.I. if requested. Cards for algorithms are sequenced starting 
at 10 and incremented by 10. The sequence number is right justified in 
column 80. Although we will make every attempt to insure that the 
algorithm conforms to the description printed here, we cannot guaran- 
tee it, nor can we guarantee that the algorithm is correct.—L.D.F. 
and A.K.C. 
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Table I. An Example Set of Input Data 


Z(IX, IY) 

1Y = 

1 2 3 4 5 6 7 8 9 
IX XUX) |YUY) = 

0.0 5.0 10.0 15.0 20.0 25.0 30.0 35.0 40.0 
1 0.0 ‘58.2 61.5 47.9 62.3 34.6 45.5 38.2 41.2 41.7 
2 5.0 (37.2 40.0 27.0 41.3 14.1 24.5 17.3 20.2 20.8 
3 10.0 |22.4 22.5 14.6 22.55 47 7.2 #18 21 24 
4 15.0 [21.8 20.5 12.8 176 58 76 08 06 06 
5 20.0 |16.8 144 81 69 62 06 O11 -0.0 0.0 
6 25.0 |12.0 80 53 29 06 00 00 00 0.0 
7 30.0 7.4 48 1.4 #O1 00 00 00 00 0.0 
8 35.0 3.2 0.7 00 00 00 00 00 00 00 
9 40.0 0.0 00 00 00 00 00 00 00 0.0 
10 45.0 0.0 00 00 00 00 00 00 00 00 
11 50.0 0.0 00 00 00 00 00 00 00 00 


mined by the values of the function and its three partial derivatives 
at four corner points of the rectangle. 

When interpolation is made near or on the boundary of the 
defined range of z, the z values estimated at several grid points 
outside the range are used to determine the partial derivatives. In 
this method, this estimation is based on three data points in the 
x or y direction, the boundary point and two adjacent given data 
points. 

The resulting surface of this method is invariant under a linear- 
scale transformation of the coordinate system; different scalings of 
the coordinates result in equivalent surfaces. 

This method requires only straightforward procedures, not 
iterative solutions of equations with preassigned error tolerances, 
which are required by some methods. No problem concerning 
computational stability or convergence exists in application of 
this method. 

The ITPLBV subroutine. This subroutine interpolates, from 
values of the function given at input grid points in an x-y plane and 
for a given set of points in the plane, the values of a single-valued 
bivariate function z = z(x,y). 

The entrance to this subroutine is achieved by 


CALL ITPLBV (IU, LX, LY, X, Y,Z, N, U, V, W) 


where the input parameters are 

IU = logical unit number of standard output unit, 

LX = number of input grid points in the x coordinate (must be 
two or greater), 

LY = number of input grid points in the » coordinate (must be 
two or greater), 

X = array of dimension LX storing the x coordinates of input grid 
points (in ascending order), 

Y = array of dimension LY storing the y coordinates of input grid 
points (in ascending order), 

Z = doubly-dimensioned array of dimension (LX,LY) storing 
the values. of the function (z values) at input grid points, 

N = number of points at which interpolation of the z value is 
desired (must be one or greater), 

U = array of dimension WN storing the x coordinates of desired 
points, 

V = array of dimension WN storing the » coordinates of desired 
points, 

and the output parameter is 
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Table II. Output Data Obtained from the Input Data Given 
in Table I 


W(KX, KY) 
KY = 
1 2 3 4 5 6 7 8 9 
KX U(KX) |V(KY) = 
0.0 2.5 5.0 7.5 10.0 12.5 15.0 17.5 20.0 
i 0.0 58.20 61.70 61.50 55.01 47.90 54.82 62.30 48.13 34.60 
2 2.5 47.08 350.59 50.40 43.75 36.45 43.73 51.62 36.94 22.94 
3 5.0 37.20 = 40.31 40.00 = 33.81 27.00 33.86 41.30 27.41 14.10 
4 7.5 28.22) 30.35 29.90 24.80 19.22 25.03 31.18 19.15 7.49 
5 10.0 22.40 23.29 22.50 18.75 14.60 18.45. 22.50 13.47 4.70 
6 12.5 21.91 22.19 21.02 17.47 13.67 16.39 = 19.28 12.14 5.23 
7 15.0 21.80 21.82 20.50 16.74 12.80 15.07 17.60 11.66 5.80 
8 17.5 19.28 18.98 17.48 13.78 10.33 10.92 11.79 9.12 6.12 
9 20.0 16.80 16.05 14.40 10.96 8.10 7.40 6.90 6.57 6.20 
10 22.5 14.39 12.86 11.12 8.73 6.69 S.61 4.65 3.94 3.49 
11 25.0 12.00 9.79 8.00 6.58 5.30 4.10 2.90 1.71 0.60 
12 27.5 9.68 7.77 6.15 4.71 3.29 2.05 1.15 0.60 0.17 
13. 30.0 7.40 6.18 4.80 3.07 1.40 0.45 0.10 0.03 0.00 
14 (32.5 5,24 3.86 2.57 1.34 0.35 0.04 0.01 0.00 0.00 
1S 35.0 3.20 1.68 0.70 0.20 0.00 0.00 0.00 0.00 0.00 
16 37.5 1.09 0.41 0.08  -0.01 0.00 0.00 0.00 0.00 0.00 
17 40.0 0.00 0.00 0.00 0.00 0.00 0.00 90.00 0.00 0.00 
18 42.5 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 
19 45.0 0.00 0.00 0.00 0.00 0.00 0.00 90.00 0.00 0.00 
20 0=«(47.5 .0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 
21 =50.0 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 
KY = 
9 10 il 12 13 14 15 16 17 
V(KY) = 
20.0 22.5 25.0 27.5 30.0 32.5 35.0 37.5 0.0 
1 0.0 34.60 40.39 45.50 41.20 38.20 39.80 41.20 41.67 41.70 
2 2.5 22.94 29.19 34.69 30.29 27.23 28.95 30.46 30.99 31.08 
3 5.0 14.10 19.63 24.50 20.25 17.30 18.84 20.20 20.70 20.80 
4 7.5 7.49 «11.32 14.73 10.48 7.34 «68.35 9.26 9.58 9.68 
5 10.0 4.70 6.12 = 7.20 4.03 1.80 1.96 2.10 2.12 2.10 
6 12.5 5.23 6.11 6.60 3.41 1.17 0.93 0.75 0.68 0.62 
7 15.0 5.80 6.84 7.60 3.74 0.80 0.66 0.60 0.59 0.60 
8 17.5 6.12 4.79 3.61 1.72) 0.39 0.28 0.22 0.21 0.22 
9 20.0 6.20 3.37 0.60 0.25 0.10 0.04 0.00 —G.01 0.00 
10 22.5 3.49 1.77 0.16 0.06 0.02 90.0 0.00 -0.00 -0.00 
1 25.0 0.60 0.04 0.00 0.00 0.00 0.00 0.00 0.00 0.00 
12 (27.5 0.17 -—0.02 0.00 0.00 0.00 0.00 0.00 0.00 0.00 
13° 30.0 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 
14 32.5 0.00 0.00 0.00 0.00 0.00 90.00 0.00 0.00 0.00 
15 35.0 0.00 0.00 0.00 0.00 0.00 00 £0.00 0.00 0.00 
16 37.5 - 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 
17 40.0 0.00 0.00 0.00 0.00 0.00 90.00 0.00 0.00 0.00 
18 42.5 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 
19 45.0 : 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 
20 «(47.5 | 0.00 0.00 0.00 0.0 0.00 0.0 0.00 0.00 0.00 
21 = 530.0 0.00 0.00 0.00 0.00 0.00 90.00 906.00 0.00 0.00 


Fig. 1. Perspective representation of (a) the original data points 
given in Table I and of (b) the surface fitted by the SFCFIT sub- 
routine with LX = 11, LY = 9, MX = 5, MY = 5, NU = Sl, 
and NV = 41. 


(a) (b) 
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W = array of dimension N where the interpolated z values at 
desired points are to be displayed. 

This subroutine occupies 1577 locations on the CDC-3800 
computer. Computation time required for this subroutine on the 
same computer is approximately equal to: 1 + 3.0 * N msec for 
LX = LY = 10; 10 + 4.0 * N msec for LX = LY = 100. 

When the function to be interpolated represents a periodic 
function of x and/or y, the input data to this subroutine should 
consist of the data that cover a whole period and two additional 
grid lines on each side of them. 

The SFCFIT subroutine. This subroutine fits a smooth sur- 
face of a single-valued bivariate function z = z(x,y) to a set of 
input data points given at input grid points in an x-y plane. It 
generates a set of output grid points by equally dividing the x and y 
coordinates in each interval between a pair of input grid points, 
interpolates the z value for the x and y values of each output grid 
points, and generates a set of output points consisting of input 
data points and the interpolated points. 

The entrance to this subroutine is achieved by 


CALL SFCFIT (IU, LX, LY, X, Y, Z, MX, MY, NU, NV, U, 
V, W) 


where the input parameters are 

IU = logical unit number of standard output unit, 

LX = number of input grid points in the x coordinate (must be 
two or greater), 

LY = number of input grid points in the y coordinate (must be 
two or greater), 

X = array of dimension LX storing the x coordinates of input grid 
points (in ascending or descending order), 

Y = array of dimension LY storing the y coordinates of input grid 
points (in ascending or descending order), 

Z = doubly-dimensioned array of dimension (LX,LY) storing the 
values of the function at input grid points, 

MX = number of subintervals between each pair of input grid 
points in the x coordinate (must be two or greater), 

MY = number of subintervals between each pair of input grid 
points in the y coordinate (must be two or greater), 

NU = number of output grid points in the x coordinate = (LX—1)* 
MX + 1, 

NV = number of output grid points in the y coordinate = (L Y—1)* 
MY + 1, 

and the output parameters are 

U = array of dimension NU where the x coordinates of output 
points are to be displayed, 

V = array of dimension NV where the y coordinates of output 
points are to be displayed, 

W = doubly-dimensioned array of dimension (NU,NV) where 
the z coordinates of output points are to be displayed, 
This subroutine occupies 1333 locations on the CDC-3800 

computer. Computation time required for this subroutine on the 

same computer is approximately 


(1.5 + (0.15 + 0.1 * MX) * MY) * LX «LY msec. 


When the surface exhibits periodicity with respect to x and/or 
Y, the input data to this subroutine should consist of the data that 
cover a whole period and two additional grid lines on each side of 
them, and two intervals on each side be discarded from the set of 
output points. 

Test results. All tests were performed on a CDC-3800 com- 
puter. An example is shown in Tables I and II. The X, Y, and Z 
values shown in Table I were given to the SFCFIT subroutine as 
input data with LX = 11, LY = 9, MX = 2, MY = 2, NU = 21, 
and NV = 17, and the U, V, and W values shown in Table II were 
obtained. Also, the data in Table I, together with each combination 
of the U and V values in Table II, were given to the JTPLBV sub- 
routine with LY = 11, LY = 9, and N == 1, and the respective W 
value in Table II was obtained each time. Figure 1(a) depicts the 
original data points given in Table I, and Figure 1(b) the surface 
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fitted by the SFCFIT subroutine with LX = 11, LY =9, MX = 5, 
MY = 5, NU = S51, and NV = 41. This example demonstrates 
one of the properties of this method, that the resulting surface is 
free frorn excessive undulations. 
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SUBROUTINE ITPLBVCIU, LX, LY. Xs» Ys Zs Ne Us Ve Wd 
BIVARIATE INTERPOLATION 
THIS SUBROUTINE INTERPOLATES, FROM VALUES OF THE FUNCTION 
GIVEN AT INPUT GRID POINTS IN AN X-Y PLANE AND FOR A GIVEN 
SET OF POINTS IN THE PLANE, THE VALUES OF A SINGLE-VALUED 
BIVARIATE FUNCTION Z = Z¢(XsY)- 
THE METHOD IS BASED ON A PIECE-WISE FUNCTION COMPOSED OF 
A SET OF BICUBIC POLYNOMIALS IN X AND Ye EACH POLYNOMIAL 
1S APPLICABLE TO A RECTANGLE OF THE INPUT GRID IN THE X-Y 
PLANE. EACH POLYNOMIAL IS DETERMINED LOCALLY. 
THE INPUT PARAMETERS ARE 
IU = LOGICAL UNIT NUMBER OF STANDARD OUTPUT UNIT 
LX = NUMBER OF INPUT GRID POINTS IN THE X COORDINATE 
(MUST BE 2 OR GREATER) 

LY * NUMBER OF INPUT GRID POINTS IN THE Y COORDINATE 
(MUST BE 2 OR GREATER) 

x @ ARRAY OF DIMENSION LX STORING THE X COORDINATES 
OF INPUT GRID POINTS (IN ASCENDING ORDEP) 

Y = ARRAY OF DIMENSION LY STORING THE Y COORDINATES 
OF INPUT GRID POINTS CIN ASCENDING ORDER) 

z = DOUBLY-DIMENSIONED ARRAY OF DIMENSION (LX.LY) 
STORING THE VALUES OF THE FUNCTION (2 VALUES) 
AT INPUT GRID POINTS 

N = NUMBER OF POINTS AT WHICH INTERPOLATION OF THE 
Z VALUE IS DESIRED (MUST BE 1 OR GREATER) 

U = ARRAY OF DIMENSION N STORING THE X COORDINATES 
OF DESIRED POINTS 

Vv = ARRAY OF DIMENSION N STORING THE Y COORDINATES 
OF DESIRED POINTS 

THE OUTPUT PARAMETER IS 

W = ARRAY OF DIMENSION N WHERE THE INTERPOLATED Z 
VALUES AT DESIRED POINTS ARE TO BE DISPLAYED 
SOME VARIABLES INTERNALLY USED ARE 

ZA = DIVIDED DIFFERENCE OF 2 WITH RESPECT TO X 

2B = DIVIDED DIFFERENCE OF Z WITH RESPECT TO Y 

ZAB = SECOND ORDER DIVIDED DIFFERENCE OF Z WITH 

RESPECT TO X AND Y 

ZX = PARTIAL DERIVATIVE OF 2 WITH RESPECT TO X 

ZY = PARTIAL DERIVATIVE OF Z WITH RESPECT TO Y 

ZXY = SECOND ORDER PARTIAL DERIVATIVE OF Z WITH 

RESPECT TO X AND Y 
DECLARATION STATEMENTS 
DIMENSION XC(LX), YCLY). ZCLXs+LY), UCN), VEN). WEN) 
DIMENSION ZAC5,2). Z2BC2,5)4.ZABC3,3)5 Z2XC4,4)5 ZYC44), 
« ZXYC4,4) 

EQUIVALENCE (Z3A1,ZAC1)), (Z23A2,ZAC2))4 (Z3A3,ZAC3)), 
(Z3A4s4ZAC4))» CZ3ASAZAC5) 2s (Z4ALSZACO)), CZ4A2,Z2AC7)), 
(Z4A3,ZACB)), (Z4A4,ZA69)), CZ4A5sZAC1B8))2, (Z3B1,ZB(1)), 
(Z3B2,Z2B(3)), (Z23B3,Z2B¢5)), (23B4sZB(7)), (Z23B5,2B(9)), 
(Z4B1,Z2B(2)), (Z4B2,Z2B64)), (Z4B3-ZBC6)), (Z4B4,ZB(8))» 
(Z4BS,ZB(18)), (ZA2B2,ZABC1))s (ZAIB2,ZABC2)), 
(ZA4B2,ZAB(3))4. (ZA2B3,ZABC4)), (ZAIBIsZABCS)), 
(ZA4B3,ZAB(6)), (ZA2B4,ZABC7)), (ZAIBA,ZABC B82). 
(ZA4B4,ZAB(9)), (Z2X33,72KC6)), CZX4322X(7)), 
CZX34,ZXC1B) 4 CZK44,ZXC11)), CZY33,ZYC6)),» 

CZY43,ZY C705 CZY344ZYC1B8)d, CZY44,ZYC119), 
CZXY33,ZXYC6))5 CZXVYAILZXVC7) I, CZXYIB4AAZXYC1B)), 
(ZXY44,ZXYC11)), €PBO,Z33), (PB1,ZY33), ¢(P18,ZX33), 
CPLI,ZXY33) 

EQUIVALENCE (LX@,ZX(1))5 CLXM1,ZXC4)), (LXM2,2XC13))> 
CLXP1,ZX€16)2, CLY@s,ZYC1))5 CLYML2Z2YC4)), CLYM2,ZYC13)), 
CLYPLSZYC16))5 CIXsZXYC1I) 9 CLYSZXYC4)), CIXPV,ZXYC13) >. 
CIYPV,ZXYC16))5 CIMNsJX)o CIMXsJY» CUXM2,TX1), 
CJYM2,SY1), CUK,DX)s CVKSDY), CAILZA5S,B1,B5,ZX(2),A, 08), 
€A2,ZX€5),B,Q01), (A4,2ZX(8),C,02), (B2,Z2Y(2),D,03), 
CB4,ZYC14),E), (X2,2X0322AISO, (X44ZX09))2 CXS,Z2KC12)), 
CY2,ZXC14)), CY4,ZY63),B3SQ0), CY5sZXC15),P02), 
(Z223,ZYC5)4P83), (Z2Z24,ZY(8), P12), (232,2Y(9), P13), 
CZ34,2YC12), P20). (Z3S,ZYC15),P21)4 (242,Z2XYC2), P22), 
(Z43,ZXYC5),P23), (Z44,ZXYC3),P3G)s (Z45,7XY(8) P31)» 
€Z53,ZXY(09),P32)5 CZ54,2XYC12),P33)5 CW2,WY2,W4A), 
CW3,WY3,WL,W5)5 CWXA,ZXYC14)), CWXS,ZXYC15)) 
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C PRELIMINARY PROCESSING 
C SETTING OF SOME INPUT PARAMETERS TO LOCAL VARIABLES 
1ue = LU 
LX@ = LX 
LXM1 2 LX® - 1 
LXM2 = LXMI - 1 
LXP1 = LX@ + 1 
LY@ = LY 
LYM! = LY® - ! 
LYM2 = LYMI - 1 
LYPIL = LY@ + 1 
N@ = N 
C ERROR CHECK 
IF CLXM2-LT.@) GO TO 716 
IF (LYM2-LT.@) GO TO 728 
IF (N@eLTe1) GO TO 738 
DO 10 IXs2,LxX@ 
IF (XCIX-1)-XCIX)) 18, 748, 758 
18 CONTINUE 
DO 20 1Y#2,LY8 
IF CYCIY-1)-YCLY)) 28, 778, 788 
28 CONTINUE 
C INITIAL SETTING OF PREVIOUS VALUES OF IX AND IY 
IXPV = @ 
1YPV = @ 
C MAIN DO-LOOP 
DO 788 K=1,N8 
UK = UCK) 
VK = VCK) 
ROUTINES TO LOCATE THE DESIRED POINT 
TO FIND OUT THE IX VALUE FOR WHICH 
CUCK) «GEsXCIX=-1)) eANDe CUCK) ¢LTeXCIX)) 
IF (LXM2.EQ-@) GO TO 88 
IF CUK.GEsX(LX8)) GO TO 78 
IF CUKeLT»X(1)) GO TO 60 
IMN = 2 
IMX = LX@ 
38 IX 2 CIMN+IMX)72 
IF CUKeGE-X(IX)) GO TO 48 
IMX = IX 
GO TO 58 
48 IMN = IX + 1 
S58 IF CIMX.GT.IMN) GO TO 38 
IX = IMX 
GO TO 98 
68 xX 21 
GO TO 98 
78 IX = LXP1 
GO TO 98 
7) Ix = 2 
C TO FIND OUT THE IY VALUE FOR WHICH 
C CUCK) -GEeYCLY=1)) AND. CVCK) -LT«YCIY)) 
98 IF (LYM2-EQ-@) GO TO 156 
IF (VK.eGE-YCLY@)) GO TO 148 
IF (VKeLT.YC(1)) GO TO 136 
IMN = 2 
IMX = LY@ 
18 TY = CIMN+IMX)/72 
IF (VUK.GE.YCIY)) GO TO 118 
IMX = IY 
GO TO 12@ 
116 IMN = 1Y + 1 
120 IF CIMX.GT.IMN) GO TO 180 
1Y = IMX 
GO TO 168 
130 ly sit 
GO TO 168 
146 IY = LYPI 
GO TO 168 
150 y2e 


aaa 


C TO CHECK IF THE DESIRED POINT IS IN THE SAME RECTANGLE 
C AS THE PREVIOUS POINT. IF YES. SKIP TO THE COMPUTATION 
C OF THE POLYNOMIAL 
168 IF (IXsEQ.IXPV -AND. IY-EQ.I1YPV) GO TO 698 
IXPV = IX 
IYPV = LY 
C ROUTINES TO PICK UP NECESSARY X,. Ys AND Z VALUES, TO 
C COMPUTE THE ZA, ZB, AND ZAB VALUES, AND TO ESTIMATE THEM 
C WHEN NECESSARY 


JX = IX 

IF (JX-EQ.1) JX = 2 

IF (JXeEQeLXPI1) JX = LXB 
JY = IY 

IF (JY-EQ.1) JY = 2 

IF (JY-EQ.LYPI) JY = LY@ 
JXM2 = JX = 2 


JXML = JX - LX 
JYM2 3 JY - 2 
JYML = JY - LY@ 


C IN THE CORE AREA, 1.Ee. IN THE RECTANGLE THAT CONTAINS 
C THE DESIRED POINT 
X3 = XCJUX-1) 
X4 = XCJX) 
AS = 1.80/¢€X4-X3) 
Y3 = YCJY-1) 
= YCJY) 
B3 = 1.8/¢Y4-Y3) 
= Z(JUK- ls SY 1) 
Z43 = ZCUX,SY-1) 
= Z2¢J5X-1,J0Y) 
Z44 = Z(JUX,SY) 
(Z43-Z233)*A3 
(244-234) *A3 
Z3B3 = (234-233)*B3 
Z4B3 = (244-243)*B3 
ZA3B3 = (Z4B3-23B3) *A3 
C IN THE X DIRECTION 
IF (LXM2-EQ.@8) GO TO 238 
IF (JXM2.EQ-8@) GO TO 178 
X2 = XCJX-2) 
AQ = 1.0/(X3-X2) 
223 = Z2¢UK-2,JSY-1) 
224 = 2CJX-2,dY) 


0 


COLLECTED ALGORITHMS (cont.) 


178 


180 


190 


228 


238 


Z3A2 = €233-223)*A2 

Z4A2 = (234-224) ¥A2 

IF (JXML-EQ-8) GO TO 188 
XS = XCJUXe1) 

A4 = 1.0/(K5-X4) 

Z83 = ZCUX+lsJSY-1) 

Z54 = ZCUX+ls, JY) 

ZBA4S = (253-243) "Ad 

ZHAG = (254-744) 4A4 

IF (JXM2-NE-8) GO TO 198 
Z3Ae = Z3A3 + Z2Z3A3 - 23A4 
Z4A2 = ZT4A3 + Z4A3 - Z4A4 
GO TO 190 

Z3A4 2 2343 + Z3A3 - Z3A2 
Z4A4 * Z4A9 + Z4A3 - Z4A2 
ZA2BS = (Z24A2-Z3A2) *BI 
ZA4BI = (Z4A4-Z3A4) 453 

IF CJX-LE«3) GO TO 200 

Al = 1460/¢(X2-XCUX-3)) 


ZSAL = €223=265X-3, 5-1) #Al 
Z4AL = (CZ24-Z2¢6UX-3,SY)) #AL 
GO TO 210 


Z3Al © Z3A2 + Z3A2 - 23A3 
Z4AlL = Z4A2 + Z4A2 - Z4A3 
IF (JX+GE+LXMI1) GO TO 228 
AS = 16O/¢X(SX+#2) -XS) 

Z3AS * (ZCSX+2,SY- 19-253) 4A 
ZAAS = CZCSX#2, SY) -Z54) #AS 
GO TO 248 

Z3A5S = Z3A4 + Z3A4 - Z3AI9 
Z4AS = 24A4 + Z4A4K ~ Z4A3 
GO TO 248 

Z3A2 = Z3A3 

Z4A2 = 24A3 

GO TO 188 


C IN THE Y DIRECTION 


248 


258 


318 


IF (LYM2.EQ.8) GO TO 318 
IF CJYM2-EQ.8) GO TO 258 
y2 = YCJY-2) 

B2 = 1.0/¢¥3-Y2) 

232 = Z2¢JK-1,J5Y-2) 

Z42 = ZCIXs SY-2) 

Z23B2 = (233-232) "B2 

Z4B2 = (243-242) *B2 

IF CJYML~EQ-8) G TO 268 
YS = YCJY+l) 

B4 = 1.0/¢(Y5=-Y4) 

235 = ZCJX-1,JYe1) 

Z45 = ZCUK,JSY+1) 

ZIB4A = (235-234) *B4 

Z4Ba = (245-244)"B4 

IF (JYM2.NE.8) GO TO 278 
Z3B2 = Z3B3 + Z23B3 - Z3B4 
Z4B2 = Z4B3 + Z4B3 - Z4B4 
GO TO 278 

Z3B4 = 23B3 + Z3B3 - Z23B2 
Z4B4 2 Z4B3 + Z4B3 - Z4B2 
ZAIB2 = (Z24B2-Z3B2)*A3 
ZAIB4 = (24B4-Z3B4) *A3 

IF (JY-LE-3) GO TO 288 

Bl = 1.8/¢Y2-YCJUY-3)) 
Z3B1 = (Z32-ZCUX~-1,50Y-3)) "BI 
Z4B1 = (Z42-Z2¢05X,SY-39)) *Bl 
GO TO 298 

Z3B1 = Z3B2 + Z3B2 - 23B3 
Z4Bl © Z4B2 + Z4B2 - Z4B3 
IF (JY¥.GE.LYMI) GO TO 368 
BS = 1.8/¢6YCUY+2)-Y5) 
Z3BS = (Z¢CUKX-1,SY+2)-235) "BS 
Z4B5 = (2¢(UXsSY+2)-245) #BS 
GO TO 328 

Z3BS 2 Z3B4 + Z3B4 ~- Z23B3 
Z4B5 = Z4Ba + Z4B4 - Z4B3 
GO TO 32¢ 

Z3B2 = 23B3 

Z4B2 = Z4B3 

GO TO 268 


C IN THE DIAGONAL DIRECTIONS 


328 


33@ 


348 


356 


368 


370 


368 


398 


48 


IF (LXM2.EQ.@) GO TO 4@@ 

IF (LYM2.EQ.@2 GO TO 41@ 

IF (JXML.-EQ.@) GO TO 356 

IF (JYM2.EQ.@) GO TO 338 

ZAMB2 = ((Z253-Z2¢UX+1,JY¥-2) ) #B2-24B2) "AG 
IF (JYML-+EQ.@) GO TO 34@ 

ZAGBA = ((ZCIX+ 1, JY 41) 254) #B4-Z4BA) MAA 
IF (JYM2«NE.@) GO TO 38¢ 

ZA4B2 = ZA4B3 + ZA4BQ - ZAABG 

GO TO 388 

ZA4B4 = ZA4B3 + ZA4B3 - ZA4B2 

GO TO 388 

IF (JYMZ2.EQ-@) GO TO 368 

ZAQB2 = (Z9BQ-(Z23-Z(UX=-2,SY~2) ) #B2) #A2 
IF (JYML.EQ.8) GO TO 370 

ZAQBA = (Z3B4-(ZCIX-2,5Y41) -Z24) *B4) #A2 
IF (JYM2.NE.@) GO TO 390 

ZA2B2 = ZA2B3 + ZA2B3 - ZAzB4 

GO TO 390 : 

ZA2B4 = ZA2B3 + ZA2B3 - ZA2B2 

GO TO 398 

IF (JXM2.NE.@) GO TO 358 

ZA2B2 = ZA3B2 + ZA3B2 - ZA4B2 

ZA2B4 = ZA3B4 + ZAIB4 - ZAAB4 

GO TO 429 

IF (JXML.NE-»@) GO TO 428 

ZA4B2 = ZA3B2 + ZA3BZ - ZA2B2 

ZA4B4 © ZA3B4 + ZA3B4 - ZA2B4 

GO TO 426 

ZA2B2 = ZAIB2 

ZA4B2 = ZA3B2 

ZA2B4 = ZA3BA 

ZA4B4 = ZAIB4 

GO TO 420 
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418 ZA2Be = ZA2B3 
ZAGB4A = ZA2B3 
ZA4B2 = ZA4B3 
ZA4B4 = ZA4B3 


C NUMERICAL DIFFERENTIATION dele TO DETERMINE PARTIAL 


c 
c 


DERIVATIVES ZX. ZY. AND ZXY AS WEIGHTED MEANS OF DIVIDED 
DIFFERENCES ZA, ZB, AND ZAB, RESPECTIVELY 
420 DO 486 JY=2,3 
DO 476 JX22,3 
W2 = ABSCZACIX+2,JSY-1) -ZACUX+1,SY-1)) 
W3 = ABS(ZACIX,JSY- 1) -ZACIX- 1-19) 
SW = W2 ¢ W3 
IF (SWeEQ.@.@) GO TO 438 
WX2 = Wessw 
WX3 = W3/SW 
GO TO 44@ 
438 WX2 = 0.5 
WX3 = 8.5 
446 ZXCIX2SY) = WX2eZACIKsSY-1) + WX3"ZACUX+1,SY~1) 
W2 = ABSCZBCJX-1,JY¥42)-ZBCIX- lo JY+1)) 
W3 3 ABSCZBCJUX- 1, SY) -ZBCIX- 1s dY¥-1)) 
SW = W2 + W3 
IF (SWeEQ.2@.8) GO TO 458 
wre = We/sw 
WY3 = W3/SV 
GO TO 468 
45@ wY2 = 6.5 
WY3 = @.5 
4608 ZYCIXASY) = WY2"ZBCUX-1,0Y) + WY3*ZBCUX-1,5Y*1) 
ZXYCUKsJSY) # 
* WY2* (WX2eZABCIX= 1s JY- 1) +WXI*ZABCUX,TY- 19) + 
* WY3 4 C WX2eZABCIX-1, SY) +WXO*ZAB UUs SYD) 
478 CONTINUE 
489 CONTINUE 
WHEN CUCK) «LT eX€1))¢OR+ CUCK) « GTe XC LX) D 
IF CIXsEQ-LXP1) GO TO 538 
IF CIX-NE-1) GO TO 598 
We = A4*(3. O#AI+A4) 
Wl © 2.0*AI*CA3-A4) + W2 
DO 580 JY=2,3 
ZXCLLSY) 2 CWLEZACLSY~ 1) +W2eZAC2,oY- 1/7 CWl+W2) 
ZYCLASY) = ZYC2,SY) + ZYC2.SY) » ZYC3,JSY) 
ZXYCLsSY) = ZXYC2,SY) + ZXYC2,T) - ZXYCIISY?) 
DO 498 JX1s2,3 
JX = 5 - JX! 
ZXCUXsSY) & ZXCUX-1,SY) 
ZYCIXSY) = ZYCIKX-1,SY) 
ZXYCUKsSY) = ZXYCIX-1,0Y) 
498 CONTINUE 
582 CONTINUE 
X3 = X3 - 1.8/7A4 
Z33 3 233 - Z3A2/A4 
DO 51@ JY31,5 
ZBC2,NY) = ZBC1AJY) 
518 CONTINUE 
DO S528 JY=2,4 
ZBCIL,JY) © ZBCILsJY) - ZABCIsSY-19/7A4 
528 CONTINUE 
AZ = A4& 
JX 2 1 
GO TO 578 
538 W4 2 A2Q*(3.0*#A3+A2) 
W5 = 2-O"AI*CAI-A2) + WA 
DO 558 JY#2,3 
ZXC4,J°Y) = CW4e ZACK, JY- 1) +W5eZAC Se SY= 1)09/ CWA4 WS) 
ZYC4,UY) = ZYCIJY) + ZYCILY) - ZYC2,SY) 
ZXYC4,SV) = ZXYC3,SY) + ZXYC3,SY) - ZXYC2,SY) 
DO 548 JX22,3 
ZXCUX,SY) = ZKCUX+lsJSY) 
ZYCIKeJY) = ZYCIX+1 SY) 
ZXYCUXsSY) @ ZXYCIX+1 SY) 
548 CONTINUE 
558 CONTINUE 
X33 X4 
233 = 243 
DO 568 JY#1,5 
ZBCILZSY) = ZBC2.dY) 
568 CONTINUE 
A3 = A2 
JX = 3 
$78 ZAC3,1) = ZACIX+l, 1) 
DO 588 JY#=1,3 
ZABC2,JY) #* ZABCUXsJSY) 
588 CONTINUE 
WHEN (VCK) eLTeYC1)) OR CUCK) GT YCLY) ) 
598 IF CLY-EQ-LYP1) GO TO 638 
IF CLY.NE-!) GO TO 688 
we = B4*(3.0*B3+B4) 
Wl = 2.0*B3*(B3-B4) + We 
DO 628 JX=2,3 
LF (JX-eEQ.3 e AND. IX-EQ-LXPI) GO TO 600 
IF CJK«EQe2 sANDe IX-EQ-1) GO TO 6008 
ZYCIXe 1) = CWLRZBCUX= 1a 1) #W2HZBCUK= 1,297 CW 42) 
ZX(SX2 1) = ZXCIX22) + ZXCIK22) = ZKXCIX1I) 
ZXYCIX21) = ZXYCUK22) + ZXYCIK,2) - ZXYCIKs3) 
606 DO 618 JY!=2,3 
JY = 5 - JYl 
ZYCIXeTY) = ZYCUKsSY- 1) 
ZXCIKedY) @ ZXCUXSY" 1) 
ZKXYCIKe TY) = ZXYCIKsdY- 1) 


618 CONTINUE 
622 CONTINUE 
Y3 = Y3 - 1-0/8B4 


233 = 233 - Z3B2/B4 
Z3A3 2 Z3A3 - ZA3B2/B4 
Z3B3 = Z3B2 
ZA3B3 = ZAI3B2 
B3 = B4 
GO TO 678 
638 W4 = B2e(3.0*B3+B2) 
WS = 2.0"B3*(B3-Be) + W4 
DO 668 JX=2,3 


0 


COLLECTED ALGORITHMS (cont.) 


Cc 


IF (JXeEQe3 «AND. IX-eEQ-eLXP1) GO TO 648 
IF (JX-EQ.2 -AND. IX.EQ-!) GO TO 6408 
ZYCUKXs 4) = CW4*ZBCIX- 1,4) 4+VU5eZBCIX- 1,5) 27 CWM4W5) 
ZXCIKXsA) = ZXCIX3) + ZKCIXS3 - ZXCSK, 2) 
ZXYCIXs4) = ZXYCIX23) # ZXYCIXL3) - ZXYCIXKs 2) 
640 DO 658 JY=2,3 

ZYCIXsSY) & ZYCIX,SY 41) 

ZXCIXeSY) = ZXCIXsSY+1) 

ZXYCIKsSY) = ZXYCIXsSY+1) 


658 CONTINUE 
668 CONTINUE 
Y3o3 Y4 


Z33 = 233 + 23B3/B3 
23A3 = Z3A3 + ZA3B3/B3 


23B3 Z3B4 
ZA3B3 = ZA3B4 
B3 = Be 


678 IF CIXeNEel «AND. IX«NE«LXP1) GO TO 688 
JX = IX/LXPL + 2 
JX1 = 5 = JX 
JY 2 IY/LYP!] + 2 
JY1 2 5 - JY 
ZXCIXsSY) © ZXCIKIZSY) & ZXCUXSSY1) = ZXCIXL YI) 
ZYCIXsSY) = ZYCUXI,SY) + ZYCUXsdY1) - ZYCUX1IZSYID 


ZXYCIXs SY) = ZXYCIKLLSY) + ZXYCIXSYI) = ZXYCUK1LSY1) 


DETERMINATION OF THE COEFFICIENTS OF THE POLYNOMIAL 
686 @X3B3 = (ZX34-2X33)*B3 
ZX4B3 = (2ZX44-Z2X43) «BS 
ZYSA3 = (ZY43~Z2Y33)*A3 
ZY4A3 = (ZY44-Z2Y34) *A3 

A = ZA3B3 - ZX3B3 - ZY3A3 + ZxXY33 

B = ZX4B3 - ZX3B3 - ZXY43 + Z2xXY33 

= ZY4A3 - ZY3A3 - ZXY34 + ZXY33 

D = ZXY44 - ZXY43 - ZXY34 + ZXY33 


B3SQ B3*B3 
PB2 = (2.0*(Z3B3~ZY33)+Z3B3-ZY34) *B3 
PO3 = €-2.0*Z23B3+ZY34+ZY33) *B350 
P12 = (2. O*(ZX3B3-ZXY33)+ZX3B3-ZXY34) *B3 
PIS = (+2. O*ZX3B3+ZXY34+ZXY33)*B3S0 
P2B = (2 O*¥(Z3A3-Z2ZX33)+Z3A3-Z2X43) *AB 
P2l = (2. Ox CZY3A3-ZXY33)+ZY3A3-Z2XY43) *A3 
P22 = (3. Oe CA+E)+D) ¥A3*BI 
P23 5 (-3.0*E-B-D)*A3*B3SQ 
P3O = €-2.0*Z3A3+ZX43+ZX33)*A3S0 
P31 = (€-2.0*ZY3A3+ZXY43+ZXY33)*A3S0 
F32 = (-3.0*E-C-D)*B3*A3SQ 
P33 = (D+E+E) *A3SO*xB3SO 

COMPUTATION OF THE POLYNOMIAL 

698 DY = VK - Y3 
Q@ = POO + DY*CPOI1+DY*(PO2+DY*PB3) > 
Ql = PI1® + DY*¢(P11+DY*(P12+DY*P13>) 
Q@2 = P28 + DY*¥CP2Z1+DY*( P22+DY*"P23) >) 
Q@3 = P3@ + DY*(P31+DY*(P32+DY*P33)>) 
DX = UK - X3 


WOK) = Q@ + DX*CQ1+DX*(Q2+DX*Q3)) 
702 CONTINUE 


C NORMAL EXIT 


c 


c 


RETURN 
ERROR EXIT 
718 WRITE (1U@,99999) 
GO TO 882 
720 WRITE €1U@,99998) 
GO TO 88@ 
730 WRITE (1U@,99997) 
GO TO 888 
74@ WRITE (1U@,99996) 
GO TO 768 
750 WRITE (1U@,99995) 
766 WRITE (1U8,99994) IX, XCIX) 
GO TO 88a 
778 WRITE (€1U8,99993) 
GO TO 798 
786 WRITE (1U@,99992) 
790 WRITE (1U8@,99991) IY, YCIY) 
800 WRITE (1U6,99998) LX@, LY@, N@ 
RETURN 
FORMAT STATEMENTS 


99999 FORMATCIX/23H -*#x* LX = 1! OR LESS-/) 
99998 FORMATCIX/23H *x* LY = 1 OR LESS.-/) 


99997 FORMATCIX/22H kee 
99996 FORMATCIX/27H ** 
99995 FORMAT(IX/33H *** 


N = @ OR LESS./) 
IDENTICAL X VALUES./) 
X VALUES OUT OF SEQUENCE./) 


99994 FORMAT( 7H IX =, 16, 1@X. THXCIX) =, El2-3) 


99993 FORMATCIX/27H #** 
99992 FORMATCIX/33H **x 


IDENTICAL Y VALUES./) 
Y VALUES OUT OF SEQUENCE./) 


99991 FORMAT 7H TY =, 16, 1@X, THYCIY) 3, El2.3) 


99998 FORMAT( 7H 


aagaananaaaaaagaaaaa 


LX =, 16, 10X, 4HLY =, 16, 10Xs 3HN 3, I77 
* 36H ERROR DETECTED IN ROUTINE ITPLBV) 


END 


SUBROUTINE SFCFITCIU, LXs LYs Xs» Ys Zs MX» MY» NUs NVe 


* Vs W 

SMOOTH SURFACE FITTING 

THIS SUBROUTINE FITS A SMOOTH SURFACE OF A SINGLE-VALUED 

BIVARIATE FUNCTION Z = Z¢XsY) TO A SET OF INPUT DATA 

POINTS GIVEN AT INPUT GRID POINTS IN AN X-Y PLANE. IT 

GENERATES A SET OF OUTPUT GRID POINTS BY EQUALLY DIVIDING 

THE X AND Y COORDINATES IN EACH INTERVAL BETWEEN A PAIR 

OF INPUT GRID POINTS, INTERPOLATES THE Z VALUE FOR THE 

X AND Y VALUES OF EACH OUTPUT GRID POINT. AND GENERATES 

A SET OF OUTPUT POINTS CONSISTING OF INPUT DATA POINTS 

AND THE INTERPOLATED POINTS. 

THE METHOD IS BASED ON A PIECE-WISE FUNCTION COMPOSED OF 

A SET OF BICUBIC POLYNOMIALS IN X AND Ye EACH POLYNOMIAL 

IS APPLICABLE TO A RECTANGLE OF THE INPUT GRID IN THE X-Y 

PLANE. EACH POLYNOMIAL IS DETERMINED LOCALLY. 

THE INFUT PARAMETERS ARE 

TU) = LOGICAL UNIT NUMBER OF STANDARD OUTPUT UNIT 

LX = NUMBER OF INPUT GRID POINTS IN THE X COORDINATE 
(MUST BE 2 OR GREATER) 
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LY = NUMBER OF INPUT GRID POINTS IN THE Y COORDINATE 
(MUST BE 2 OR GREATER) 

x = ARRAY OF DIMENSION LX STORING THE X COORDINATES 
OF INPUT GRID POINTS CIN ASCENDING OR DESCENDING 
ORDER) 

i = ARRAY OF DIMENSION LY STORING THE Y COORDINATES 
OF INPUT GRID POINTS CIN ASCENDING OR DESCENDING 
ORDER) 

z = DOUBLY-DIMENSIONED ARRAY OF DIMENSION (LXsLY) 
STORING THE VALUES OF THE FUNCTION AT INPUT 
GRID POINTS 

MX #* NUMBER OF SUBINTERVALS BETWEEN EACH PAIR OF 
INPUT GRID POINTS IN THE X COORDINATE 
(MUST BE 2 OR GREATER) 

MY = NUMBER OF SUBINTERVALS BETWEEN EACH PAIR OF 

INPUT GRID POINTS IN THE Y COORDINATE 

(MUST BE 2 OR GREATER) 

NUMBER OF OUTPUT GRID POINTS IN THE X COORDINATE 

CLX-<1)aMX+1 

NUMBER OF OUTPUT GRID POINTS IN THE Y COORDINATE 

= (LY-1)*MY+1 

THE OUTPUT PARAMETERS ARE 

U = ARRAY OF DIMENSION NU WHERE THE X COORDINATES OF 
OUTPUT POINTS ARE TO BE DISPLAYED 

Vv = ARRAY OF DIMENSION NV WHERE THE Y COORDINATES OF 

Ww 


NV 


OUTPUT POINTS ARE TO BE DISPLAYED 

= DOUBLY-DIMENSIONED ARRAY OF DIMENSION (NU.NV) 
WHERE THE Z COORDINATES OF OUTPUT POINTS ARE TO 
BE DISPLAYED 

C SOME VARIABLES INTERNALLY USED ARE 

C ZA = DIVIDED DIFFERENCE OF Z WITH RESPECT TO X 

C ZB = DIVIDED DIFFERENCE OF Z WITH RESPECT TO Y 

C ZAB = SECOND ORDER DIVIDED DIFFERENCE OF Z WITH 

c RESPECT TO X AND Y 

C ZX = PARTIAL DERIVATIVE OF Z WITH RESPECT TO X 

C ZY == PARTIAL DERIVATIVE OF Z WITH RESPECT TO Y 

C ZXY = SECOND ORDER PARTIAL DERIVATIVE OF Z WITH 

Cc RESPECT TO X AND Y 

C DECLARATION STATEMENTS 

DIMENSION X(LX)s YCLY), ZCLX,LY), UCNUD, VONV). WC 

DIMENSION ZAC4,2)4 ZB(5)2 ZABC2s3), ZXC2), ZYC2), 


QaNgNMAMaMaMNANaAANANaANaANaAaANANANAAAANA 


NULNV) 
ZKYC2) 


EQUIVALENCE (Z3A2sZAC1))s (Z3A9,ZAC2)), (Z3A4,ZA03)), 


CZ3A5s ZACK), (Z4A2,ZAC5)). (Z4ASB,ZAC6))5 CZ4A4,Z2 
(Z4A55,ZAC8)), (Z4BI1,ZBC1)), (Z4B2,ZB(C2)), (Z24B3,2 
(Z4B4,ZB(C4)), (Z4B5,ZBC5)), (ZA3B2,ZABC1)), 
(ZA4B2,ZABC2)). (ZA3B3.Z2AB(3))s (ZA4B3,ZABC4)), 
(ZA3B4sZAB(S))s (ZA4B4,ZABC6)). (ZX434Z2XC19)5 
(ZX44,ZXC2)), CZY43,7ZYO1)), CZY44,ZYC 20), 


eee HR EH 


€P10,Z2X33), ¢€P11,ZXY33) 

EQUIVALENCE CIXMI1,J3X)s5 CIXML,JY). (DU, DV, DX, DY), 
* CFMX,RMXsFMYsRMY,SWLE)s (W2,WY2,A,00), (W3,VWY3.,B, 
* (WX2,0,02)5 CWX3,D,Q3), (23A2,P02), (Z4A2,P03), 
* (Z4B1,P12), (Z4B2,P13), (Z4B4,P20), (Z4B5, P21), 
* (ZA3B2,P22)5 CZA3B4, P23) 

C PRELIMINARY PROCESSING 

C SETTING OF SOME INPUT PARAMETERS TO LOCAL VARIABLES 


1u@ = IU 

LX@ = LX 

LXMI1 = LX@ - 1 

LXM2 = LXMI - 3} 
LY@ = LY 

LYMI = LY@ - 1} 

LYM2 = LYM! - 1 
MX@ = MX 

MXP1 = MX@ + 1 

MXM! = MX® - 1 

MY@ = MY 


MYP1 = MY@ + 1 
MYM] = MY@ - 1 
NUS = NU 
NV@ = NV 
C ERROR CHECK 
IF (LXM2.LT.@) GO TO 480 
IF CLYM2eLT.8) GO TO 416 
IF (MXMI-LE.@) GO TO 426 
IF (MYMI1.LE-@) GO TO 438 
IF (NUG@+NE«LXMI*MXO+1) GO TO 448 
IF (NVO.NE-LYMI*MY@+1) GO TO 450 
IX = 2 
IF (X€1)-XC2)) 186, 468, 38 
1@ DO 20 1X=3,LX8 . 
IF (XCIX-1)-XCIX)) 26, 468, 478 
28 CONTINUE 
GO TO 58 
38 DO 48 IX=3,LX@ 
IF (XCIX-1)-XCIX)) 4785 466, 48 
48 CONTINUE 
5@ 1Y = 2 
IF (Y¥C1)-¥C2)) 66, 498, 88 
68 DO 78 1Y=3,LY8 
IF CYCIY-1)-YCIY)) 78, 498, SOB 
7@ CONTINUE 
GO TO 168 
88 DO 98 I1Y=3,LY@ 
IF CYCIY=19-YCIY)) 580, 498s 96 
98 CONTINUE 
C COMPUTATION OF THE U ARRAY 
100 FMX = MX@ 
RMX = 1-6/FMX 
KU = 1 
X4 = XC1) 
UC1) = X4 
DO 128 IX22s,Lx@ 
X3 = X4 
X4 = XCIX) 
DU = (X4-X3)*RMX 
DO 118 JX=1,MXM! 
KU = KU + 1 
UCKU) = UCKU-1) + DU 
118 CONTINUE 
KU = KU + 1 
UCKU) = X4 
128 CONTINUE 


AC7T)).5 
BC3)), 


CZXY43,ZXYCO1)), CZXY44,Z2XYC2)), CPOB,2339), CPO1,ZY33), 


Q1), 


0 


COLLECTED ALGORITHMS (cont.) 


C COMPUTATION OF THE V ARRAY 


138 


148 


FMY = MYO 
RMY = 1.60/FMY 
KV = | 
Y4 = YO1) 
Vol) 2 ¥4 
DO 148 l1Y=2,LY@ 
Y3 = Y4 
Y4 = YCTY) 
DV = (Y4-¥3) «RMY 
DO 138 JY=#1,MYM1 
KV = KV + 1 
VCKV) = VCKV-1) + DV 
CONTINUE 
KV = KV + I 
VCKV) = Y4 
CONTINUE 


C MAIN DO-LOOPS 


C ROUTINES TO PICK UP NECESSARY Xz 
C COMPUTE THE ZA, ZB, AND ZAB VALUES, 


JYMX 3 MY@ 
KV8 = @ 
DO 398 1Y=2,LYO 
Iym2 = LY - 2 
IYmM3 = IYM2 - 1 
IYML = IY - LY@ 
IYML1 = IYML + 1 
IX6 = 8 
IF CLYMLeEQ.8) JYMX = MYPI 
JXMX = MX@ 
KUG = @ 
DO 380 IX#1,LX® 
IXMl # IX - 1 
IXML = IX - LX®@ 
IF CIXML.~EQ.@) JXMX = MXP? 


C WHEN NECESSARY 
C PRELIMINARY WHEN IX-EQ-1 


IF CIXMI.NE.@) GO TO 158 
Y3 =z YCIY-1) 

Y4 2 YCTY) 

B3 = 168/¢Y4-Y3) 

A3IS0 = B3*B3 


IF CIYM2.GT.@) B2 = 1.0/¢€Y3-YC1Y-2)) 
IF CIYM3.GT.@) Bl = 1.0/¢€YC1Y¥=2)-YCL1Y-3)) 
IF CLIYML-LT.@) B4 = 1.B/CYCIY+1)-Ya4) 
IF CIYMLI.LT.8) BS = Le O/CYCIY+2)-YCIY+1>) 


GO TO 188 


C TO SAVE THE OLD VALUES 


i5@ 


16¢@ 


178 


C TO COMPUTE THE ZA, ZB, AND ZAB VALUES AND 


Z3A2 = Z3A3 
Z4A2 = Z4A3 
X35 X4 

233 = 243 
Z3B3 = Z4B3 
AJ = Ad 

AISG = AI*AZ 
Z3A3 = Z3A4 
Z4A3 = Z4A4 
ZAIB2 = ZAAB2 
ZA3B3 = ZA4B3 
ZA3B4 = ZA4BA 
X4 = XS 

Z43 = 253 
Z4Bi = Z5BtI 
Z4B2 = Z5B2 
Z4B3 = Z5B3 
Z4B4 2 Z5B4 
Z4B5 2 Z5B5 
A4 = A5 

Z3A4 = Z3A5 
Z4A4 = Z4A5 
ZASB2 = ZASB2 
ZA4B3 = ZASB3 
ZA4B4 = ZASB4 
X5 = X6 

Z§3 ® 263 

ZS4 = 264 
ZSB1 = Z6BI 
ZSB2 = Z6B2 
Z5B3 = Z6B3 
ZSB4 = Z6B4 
Z5BS = Z6BS 


C TO ESTIMATE THE ZB VALUES 
C WHEN CIY.LE.3).OR-C1Y+GEeLY=1) 


188 


4198 


IX6 = 1X6 + 1 

IF (1X6.GTeLX@) GO TO 268 
X6 = XCIX6) 

263 = ZCIX6,1Y-1) 

264 = ZC1X6,1Y) 

Z6B3 = (264-263)*B3 

IF (LYM2sEQ-@) GO TO 206 
IF CIYM2-EQ.8) GO TO 198 
262 = ZCUX6,1Y-2) 

Z6B2 = (263-262)*B2 

IF CLIYML-NE.9) GO TO 198 
Z6B4 = Z26B3 + Z6B3 - Z6B2 
GO TO 218 

Z65 = ZCIX6,1Y+!) 

Z6B4 = (265-264) *B4 

IF CIYM2.NE.@) GO TO 216 
Z6B2 = 26B3 + Z6B3 - Z6B4 
GO TO 218 

Z6B2 = Z6B3 

Z6B4 3 Z26B3 

IF CIYM3-LE.-@) GO TO 228 
Z6BL = (Z262=-ZC1X6s,1Y-3))*B! 
GO TO 236 

Z6Bi = Z6B2 + Z6B2 - Z6B3 
IF CIYML1.GE.@) GO TO 248 
ZOBS = (ZCIK6s,1Y+2)-Z265)*BS 
GO TO 25@ 

Z6BS = Z6B4 + Z6B4 - Z6B3 
IF CIX6.EQ-1) GO TO 178 


Ys AND Z VALUES, 


AND TO ESTIMATE THEM 


AS = 1.8/¢(X6-X5) 

Z3AS = (263-253) *A5 
Z4AS = (264-254) ¥#A5 
ZASB2 = (Z6B2-Z25B2)*A5 
ZASB3 = (Z6B3-Z5B3)*A5 
ZASB4 = (Z6B4-Z5B4) *AS 
IF CIX6-EQ.2) GO TO 168 
GO TO 28a 


C TO ESTIMATE THE ZA AND ZAB VALUES 
C WHEN (1X-GE+LX-1) «AND» (LX-GT. 2) 


262 


aa 


278 


IF ¢€LXM2.EQ.8) GO TO 2708 

Z3AS = Z3A4 + Z3A4 - Z3A3 
Z4A5 = Z4A4 + Z4A4 - Z4A3 

IF CIXML«EQ-8) GO TO 298 
ZASB2 = ZA432 + ZA4B2 - ZA3B2 
ZASB3 = ZA4B3 + ZA4B3 - ZA3B3 
ZASB4 = ZA4BA + ZA4B4 - ZASBA 
GO TO 298 


TO ESTIMATE THE ZA AND ZAB VALUES 
WHEN (1X-+GE-«LX-1) + AND. (LX+EQ.2) 


Z3AS = Z3A4 

Z4AS 3 Z4A4 

IF CIXML-EQ+@) GO TO 298 
ZASB2 = ZA4B2 

ZASB3 = ZA4B3 

ZASB4 = ZA4B4 


C TO ESTIMATE THE ZA AND ZAB VALUES 
C WHEN IX.EQ.1 


280 


aaaa 


298 


IF CIXMIeNE-@) GO TO 298 

Z3A3 = Z3A4 + 2Z3A4 - Z3A5 
Z3A2 = Z3A3 + Z3A3 - Z3A4 
Z4A3 = Z4A4 + Z4A4 - Z4AS 
Z4A2 = Z4A3 + Z4AZ = Z4A4 
ZA3Be = ZA4B2 + ZA4B2 - ZASB2 
ZA3B3 = ZA4B3 + ZA4B3 - ZA5B3 
ZA3B4 = ZA4B4 + ZA4BA - ZASB4 
GO TO 3280 


ZX33 = ZX43 
ZX34 = ZX44 
ZY33 = ZY43 
ZY34 = ZY44 
ZXY33 = ZXY43 
ZXY34 = ZXY44 


C NEW COMPUTATION 


388 


31 


328 


358 


DO 35@ JY=1,2 
W2 = ABS(ZACHs SY) “ZAC 3s SY) 
W3 = ABSC(ZAC2sdY) -ZACILLSY)) 
SW = W2 + W3 
IF (SWeEQ.8-8) GO TO 318 
WX2 = We/sSsw 
WX3 = W3/SW 
GO TO 320 
WX2 = 8.5 
WX3 = 8.5 


ZXCSY) = WX2HZAC2,SY) + WXI*ZAC3,SY) 


W2 = ABS(ZBCJSY+3)-ZBCJY+2)) 
W3 = ABSC(ZB(JY+1)+ZBCJY)) 
SW = Wwe + W3 

IF (SWeEQ-8-8) GO TO 339 
WY2 = W2/sw 

WY3 = W3/SW 

GO TO 348 

WY2 = 6.5 

WY3 = 8.5 


ZYCJY) = WY2*ZBCJY41) + WY3*ZBCUY+2) 
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NUMERICAL DIFFERENTIATION an TO DETERMINE PARTIAL 
DERIVATIVES ZX, ZY. AND ZXY AS WEIGHTED MEANS OF DIVIDED 
DIFFERENCES ZA, ZBs AND ZABs RESPECTIVELY 

TO SAVE THE OLD VALUES WHEN IX.NE.1 


ZXYCSY) = WY2eCWXOHZABC 1, SY +WX3*ZABCO,SY)) + 


WY 3% ( WX2KZABC 1, JY +1) 4WX3*ZABC2,JY+1)) 


CONTINUE 
IF CIXM1.-EQ.@) GO TO 388 


ZX3B3 = (€ZX34-Z2X33) *B3 
ZX4B3 = (ZX44-2X43) *B3 
ZY3A3 = (ZY43-ZY33) ¥AI 
ZY4A3 =® CZY44"ZY34)*AI 


A ® ZA3B3 - ZX3B3 - ZY3A3 + 
B = ZX4B3 - ZX3B3 - ZXY43 + ZXY33 
C = ZY4A3 - ZY3A3 = ZXY34 + 
D = ZXY44 - ZXY43 = ZXY34 + 


Es=A+A-B-C 
PR2 = (€2.0*(Z3B3-Z2Y33)+Z3B3-ZYI4) «B3 
PO3 = (-2.OeZ9B3+ZY344ZY33) *B3S0 
P12 = (2. 0*(ZX3B3-ZXY33)+ZX3B3- ZXY34)*BS 
P13 = (€=2.O0*ZX3B3+ZXY34+ZXY33) «BIS 
P28 = (2.8*(Z3A3-2X33)+Z3A3-ZX43) *A3 
P21 © (2. Oe CZYSAS-“ZXY33) +ZY IAI“ ZKY43) €AB 
P22 = (3. 6*CAtE)+D) *A3*B3 
P23 = (€-3+40eE-B-D)*A3*B3SQ 
PID = (-26OeZ3A34+ZX434Z2X33) *A3SO 
P3L = (=2.O*ZY3A3+ZXY434+ZXY33)4#AI9SO 
P32 = (-+3.0eE-C-D)*B3*AISGO 
P33 = (D+E+E) *A3SQ*B3S0 
C COMPUTATION OF THE POLYNOMIAL 
DO 378 JY=1lsJYMX 
KV = KV@ + JY 
DY = VCKV) - Y3 
QB = POO + DY*(POL+DY*(PO2+D/4PG3)) 
Ql = PIG@ + DY*CPLI+DY*(PI2+DYeP1i3)>) 
Q2 = P20 + DY*CP21+DY*( P22+D/*P23) ) 
Q3 = P3G + DY*¥CP31+DY*¢P324+DV*PI3)> 
DO 368 JX*1,JXMX 


ZXY33 


ZXY33 
ZXY33 


KU = KU@ + JX 


DX = UCKU) - X3 
WCKU,KV) = O08 + DX*(Q1+DX*(Q2+DX*"0Q3)) 


360 CONTINUE 
378 | CONTINUE 
KU® = KU +.MX@ 
388 CONTINUE 
KV8 = KV@ + MYG 


398 CONTINUE 


DETERMINATION OF THE COEFFICIENTS OF THE POLYNOMIAL 


0 
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C NORMAL EXIT 5@@ WRITE (1U8,99989) 
RETURN 518 WRITE (10U6,99988) IY. YCIY) 
C ERROR EXIT 52@ WRITE (1U@,99987) LX@s MX@. NU@, LYS, MY@, NV@ 
400 WRITE (1U8,99999) RETURN 
GO TO 526 C FORMAT STATEMENTS 
418 WRITE (10@,99998) 99999 FORMATCIX/23H ***% LX = 1 OR LESS./) 
GO TO 528 99998 FORMATCIX/23H +e LY = 1 OR LESS./) 
420 WRITE (1U8,99997) 99997 FORMATCIX/23H #4 MX = 1 OR LESS./) 
GO To S20 99996 FORMATCIX/23H 4 && MY = 1 OR LESS./) 
438 WRITE (1U8,99996) 99995 FORMATCIX/26H ee IMPROPER NU VALUE-/) 
GO TO 526 99994 FORMATCIX/26H *x% IMPROPER NV VALUE. /) 
44@ WRITE (1U8,99995) 99993 FORMATCIX/27H *x* IDENTICAL X VALUES«/) 
GO TO 5208 ; 99992 FORMATCIX/33H *e* X VALUES OUT OF SEQUENCE./) 
450 WRITE (1U@s99994) : 99991 FORMATC 7H IX =, 16, 10X, 7HXCIX) =, £123) 
GO TO S520 99998 FORMATCIX/27KH «ae IDENTICAL Y VALUES«/) 
460 WRITE (1U@,99993) 99989 FORMATCIX/33H xem Y VALUES OUT OF SEQUENCE./) 
GO TO 48@ 99988 FORMAT( 7H TY =, 16, !10X, THYCIY) =. E1263) 
478 WRITE (1U@,99992) 99987 FORMAT( 7H LX 3, 16, 1@X, 4HMX =, I6, I18X, 4HNU #, I16/ 
480 WRITE (1U8,99991) IX,» XCIX) * 7TH LY =, 16, 1@X, 4HMY =, 16, 10X, 4HNV =, 16/6H ERROR, 
GO TO 526 * 30H DETECTED IN ROUTINE SFCFIT) 
498 WRITE (100.9999) : END 
GO TO 518 
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Bivariate Interpolation and Smooth Surface Fitting Based on _ Local 
Procedures [E2] 


[H. Akima, Comm. ACM 17, 1 (Jan. 1974), 26-31] 


M.R. Anderson [Recd 14 February 1978 and 5 April 1978] 
Department of Physics, University of Michigan, Physics-Astronomy Building, 
Ann Arbor, MI 48109 


Subroutine SFCFIT contains a violation of the Fortran Standard [1] similar to 
that observed [2] in a previous contribution by the same author [3]. Section 
7.1.2.8 states that the initial value of a DO statement must be less than or equal 
to the value represented by the terminal parameter. When LX or LY are input as 
2, DO statements labeled 10, 30, 60, and 80 violate this rule. Error conditions of 


IDENTICAL X VALUES, X VALUES OUT OF SEQUENCE, 
IDENTICAL Y VALUES, Y VALUES OUT OF SEQUENCE 


may improperly result from comparisons of array variables, subscripts for which 
are incorrectly generated, within these DO loops. 
Subroutine SFCFIT may be corrected to avoid the above violation by changing 
the initial parameters in DO statements labeled 10, 30, 60, and 80 from 3 to 2. 
As altered, these carefully written subroutines have been used extensively and 
successfully. 


REFERENCES 
1. ANSI Standard Fortran, X3.9-1966. Amer. Nat. Stand. Inst., New York, 1966. 
2. ANDERSON, M.R. Remark on Algorithm 433. ACM Trans. Math. Software 2, 2 (June 1976), 208. 


3. AKIMA, H. Algorithm 433. Interpolation and smooth curve fitting based on local procedures. 
Comm. ACM 15, 10 (Oct. 1972), 914-918. 
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Algorithm 475 


Visible Surface Plotting 
Program [J6] 


Thomas Wright [Recd. 18 Apr. 1972, 13 Oct. 1972] 
Computing Facility, National Center for Atmospheric 
Research, Boulder, CO 80302 


National Center for Atmospheric Research is sponsored by the 
National Science Foundation 


Key Words and Phrases: hidden line problem, computer 
graphics, contour surface 

CR Categories: 3.65, 4.41, 8.2 

Language: Fortran 


[This program is not in ANSI Fortran. Nonstandard features-are 
noted in the text. A demonstration driver is included to illustrate 
use of the subroutines. I/O unit 9 is used by this driver.—LDF-.] 


Description 

This package of three routines produces a perspective picture 
of an arbitrary object or group of objects with the hidden parts not 
drawn. The objects are assumed to be stored in the format described 
below, a format which was chosen to facilitate the display of func- 
tions of three variables (Figure 1) or output from three-dimensional 
computer simulations (Figure 2). The basic method is to contour 
cuts through the array, starting with a cut nearest the observer. The 
algorithm leaves out the hidden parts of the contours by suppressing 
lines enclosed within lines produced while processing preceding cuts. 
The technique is described in detail in {2}. 

The object is defined in a three-dimensional array by setting 
words to one where the object is, and to zero where it is not. That 
is, the position in the array corresponds to a position in three-space, 
and the value of the array tells whether any object is present at that 
position or not. Because a large array is needed to define objects 
with good resolution, only a part of the array is passed to the 
package with each call. 

There are three subroutines in the package. INIT3D is called 
at the beginning of a picture. This call can be skipped sometimes 
if certain criteria are met and certain precautions are taken. Sce the 
comment lines for details. SETORG (which has an entry point 
PERSPC) does three-space to two-space perspective transforma- 
tions. It is called by JN/T3D and need not be called by the user. 
The mathematical method for the three-space to two-space trans- 
formation is due to Kubert, Szabo, and Giulieri [1]. DANDR 
(draw and remember) is called successively to process different 
parts of the three-dimensional array. For example, in Figure 3, the 
nearer plane would be processed in the first call to DANDR, while 
the further plane would be processed in a subsequent call. A sample 
program is provided with the algorithm to illustrate this point. 

Although this package was developed using NCAR’s CDC 
machines with locally written systems and compilers, implementa- 
tion on different machines or systems should not be too difficult 
regardless of the plotter. The algorithm has been tested on the 
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Fig. 1. Four contour surfaces of the wave function of a 3-P 
electron in a one electron atom: 50 X 50 X 50 object cube, 100 X 
100 screen model. 
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of surface and 
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P = 3.0E-05 


Har 


P = 7.08-05 


Fig. 2. Output from a three-dimensional cloud model: 100 X 
100 X 60 object cube, 200 X 200 screen model. 


Fig. 3. Processing different parts of a three-dimensional array. 
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Minnesota Fortran compiler (MNF), and when the following items References 
are taken care of, should be portable. 1. Kubert, B., Szabo, J., and Giulieri, S. The perspective 
There is a PROGRAM card in the demonstration program representation of functions of two variables. J. ACM 15, 2 (Apr. 


There is an ENTRY statement in SETORG. ENTRY statements ea stern hidden-li er rene 

are nonstandard, but are generally portable. It could be eliminated, ne fe Ae 07a SG. Comsat, Sinidlanon 
but the package would run longer. There are two machine-de- Conf., pp. 261-267 : : 

pendent variables used and described in DANDR. There is one ee ; 


system routine, LINE, called once and described in DANDR, 


which must be implemented or simulated to use this package. In Algorithm 
three statements (which are marked) in DANDR, .OR. and .AND. PREGKAM ACMTEST 
are used for’ masking operations. with integer variables. Some Ce ee ae il din ieee aes gash ee at 
compilers may not produce the desired code, so references to Pane Gitte oe 
machine language functions may have to be substituted. There SC1) = 0. 
is a nonstandard but common form of the DATA statement in Sa 
DANDR. Functions which are assumed available are SQRT, eee Sele coer 
ACOS, and SIN. EYECL) = 250+ 
Figures 4 and 5 are referred to in the listing as the first pic- Se eset to 
ture and the second picture. 4 Vane eaten y es 30s 80+ &8Us STl» 3» 160s IS2, 9 9) 


C CKEATE AND PLOT TEST OBJECT 
DO 50 I=1,60 


Fig. 4. The first picture produced by the test program. A = CI-S0)««2 
DO 40 J=1,80 
C = €J-25)**2 
D = IABSCJ-63) + IABSCI-25) 
be 30 K=1,5,80 
C FLO@K 
IF (Kekuel) GU 16 10 
C BALL 
IF CSQRTCA+C+CFLGATCK)-25¢)**2)-LE25-) GO TO 10 
C POINT 
IF €D«GT-FLOATCB8O0-K )*-1K75) GO TEC 20 
10 1OBUCJ.K) = I 
GE TO 30 
20 1OBJCUsK) = O 
30 CONTINUE 


40 CONTINUE 
CALL DANDK(B80, 80» STIs 3» 160s 160s IS2s 9» ds» LQBU, 
* 80) 
50 CONTINUE 
C ADVANCE T@ THE NEAT FRAME™ 
CALL FRAME 
A SECOND PICTURE WILL NOW BE CALLED USING THE SAME SIZE 
AKKAYS AND EYE POSITION. TIHIS MEANS THE CALL 10 INI13D, 
THE BIGGEST TIME CONSUMEK, CAN BE SKIPPED IF THE FOLLOWING 
FOUn LINES AKE INCLUDED. 
KEWIND 9 
bo 70 [=153 
bO 60 J215160 
IsecI,J)» = 0 
60 CENTINUE 
70 CONTINUE 
C THIS PICTUKE WILL BE THE T=4 CONTCUK SUKFACE OF 
C T=1L/SSORTCUKUF VEV te EW +O 6 SHV) EKQAZSORTCURUFV EV) © 
DO 120 I1=1,80 
U = C40-.5-FLOATCIDI/79« 
UU = U*kU 
OG 110 J=i,90 
Vo = CFLOEATCI)-4005)/779. 
VV = VV 
A = 14/750KTCUU+VV) 
bDG 100 K=stss0 
C THE FOLLOWING CAKD ADDS AXES» 
IF C1k&UeEQe1 eOre T#KeEUl1 «OK e J¥KekUet)? GO TH 40 
Wo = CFLOATCK)-406¢5)/79 6 
LF Cle /SURKTCUUFVV th kl +0 SHV) RESRKACLE 2 Ge) CO 10 90 


anaaa 


80 TOBUCJsK) = 1 
GQ TO 100 

90 I@BUCJsK) = O 

100 CONTINUE 


1190 CONTINUE 
CALL DANDK(80s 80s STl» 3s 160» 160+ [52s 9% Ss» IBBUs 
* 80) 
120 CONTINUE 
C FLUSH PL8T BUFFER 
CALL FRAME 
ST@P 
END 


SUBROUTINE INIT3DCEYEs NU» NVs Nhs ST1s LAs N¥s [525 LUs 
* S) 
DIMENSI@N EYEC3)s STICNVeNWs2)5 TS2CLXoNY)» SC 4) 
BY TH@MAS WRIGHT 
COMPUTING FACILITY 
THE NATIONAL CENTER FORK ATMOSPHERIC KESEAKCH 
BOULDEKs COL@KADG 80302 
NCAR IS SPONSORED BY THE NATIGNAL SCIENCE FOUNDATION. 
THE METHOD IS DESCNIBED IN DETAIL IN - A ONE-PASS HIDDEN- 
LINE REMOVER FOK COMPUTER DRAWN THKEE-SPACE OBJECTS+ PROC 
1972 SUMMEK C@MPUTEK SIMULATION CONFERKENCEs 261-2675 1972. 
THIS VERSION IS FORK USE ON CDC 6000 BK 7000 COMPUTEKS. 
THIS PACKAGE OF KCUTINES PLOTS 3-DIMENSIONAL OBJECTS WITH 
HIDDEN PARTS NOT SHOWN. GBJECTS AKE STOKED IN AN AKKAY> 
WITH THE POSITION IN 1HE ARRAY COnKKESPENDING TG A LOCATION 
IN 3-SPACE AND THE VALUE OF THE AKKAY ELEMENT TELLING IF 
ANY OBJECT IS PKESENT AT THE LOCATION. 
INIT3D IS AN INITIALIZATION ROUTINE FOR THIS PACKAGE. IT 
IS CALLED» THEN A SEQUENCE OF CALLS ARE MADE Te DANDrK TO 
PR@DUCE A PICTURE. 
EYE AN ARRAY 3 LONG CONTAINING THE Us Vs AND & COORDI- 
NATES OF THE EYE POSITION. OBJECTS AKE CONSIDERED 
TO BE IN A BOX WITH 2 EXTREME CORNERS AT Clelsl) AND 
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CNUsNVsNW). THE EYE P@SITI@N MUST HAVE POSITIVE 
COORDINATES AWAY FROM THE CO@KDINATE PLANES U=0.s 
V=0s AND WO. VHILE GAINING EXPEKIENCE WITH THE 
PACKAGEs USE EYECL)=5*NUs EYEC2)=4eNVe EYEC 3) =34Nbe 
NU U BIWECTION LENGTH @F THE BOX CONTAINING THE @BJECTS 
NV V DIRECTION LENGTH OF THE BOA CONTAINING THE OBJECTS 
NW Ww DEKECTI@N LENGTH @F THE BOX CONTAINING THE OBJECTS 
STl A SCRATCH ARRAY AT LEAST NV¥*NW*2 WORDS -LONG. 

LX FIRST DIMENSION QF A SCKATCH AkKAYs I1S25 USED BY THE 
PACKAGE FORK REMEMBERING WHERE IT SHOULD NOT DikAwe 
LX=1+NX/NBPWe SEE DANDK COMMENTS FOK NX AND NBPhe 


NY SECOND DIMENSION @F I1S2. SEE DANDK COMMENTS. 
A SCKATCH AKRAY AT LEAST LA*NY WORUVS LONG. 

TU UNIT NUMBER @F SCRATCH FILE FOK THE PACKAGE. STI 
WILL BE WRITTEN NU TIMES ON THIS FILE. 

s AN ARRAY 4 LONG WHICH CONTAINS THE COORDINATES QF 


THE AKEA WHERE THE PICTURE IS T@ BE DRAWN. THAT [S» 
ALL PLOTTING COORDINATES GENERATED WILL 8E BOUNDED 
AS FO@LLO@WS~-- X COORDINATES WILL BE BETWEEN $SC(1) AND 
52), Y CO@BKDINATES WILL BE BETWEEN $¢3) AND SC 4)e 
TO PREVENT DISTORTIONs HAVE S(2)-SC1)2504)-SC3)e 

IF SEVERAL PICTURES ARE T@ BE DRAWN WITH THE SAME SIZE 
ARRAYS AND FYE PO@SITI@N AND THE USEK wEWINDS LU AND FILLS 
1S2 WITH ZERXGESs INIT3D NEED NOT BE CALLED FOK OTHEK THAN 
THE FIRST PICTURE. 

SET UP TRANSFORMATION KOUTINE FOK THIS LINE OF SIGHT. 


AAANANARANAANRAAANMRANAAAANAANAANAAANA 
uw 
ie) 


uo = NU 
Vo= NV 
w= NW 


CALL SET@KGCU*.ds Vee5s WHeSs EYEC IL)» EYEC2)5 EYEC3)) 
C FIND EXTREMES IN [RANSFGRMED SPACE. 
CALL PERSPCCles Les We Ds YT» Dd 
CALL PEKSPCCUs, Vs les Ds YBs D) 
CALL PERSPCCUs Les Les Xi» De Dd) 
CALL PERSPCCles Va bes XR» Ds D) 
C ADJUST EXTREMES TO PREVENT DISTORTION WHEN GOING FROM 
C TRANSF@RMED SPACE TG PLOTTER SPACE. 
DIF = (XR-XL-YTHYB)*eS 
IF CDIF) 105 30s 20 
10 XL = XL + DIF 
Xx = XR - DIF 
G@ TG 30 
20 YB = YB - DIF 
YT = YT + DIF 
30 REWIND [U 
C FIND THE PLOTTEK COORDINATES OF THE 3-SPACE LATTICE POINTS 


Cl = «9#CSC2]I-SCLIISCAR=XL)Y 
Ca = «O5*CSC2)-SC1)) + SCI) 
C3 = «9¥*CSC4)-SC3II/0YT-YB) 
C4 = 605*0504)-563)) * $63) 
D@ 60 I=1sNU 

U=NU + to - I 

D@ SO JFlsaNv 

ve J 


DO 40 K=lsNW 
CALL PERSPCCUs Vs FL@ATCK)» Xs Ys Dd 
STICJsKe1) = CL*CX<XL)D + C2 
ST1CJsKs2) = C3*CY-YB) + CA 
40 CONTINUE 
50 CONTINUE 
C WRITE THEM ON UNIT IU. 
wRITE CLU) STI 
60 CONTINUE 
REWIND fru 
C ZER® OUT ARRAY WHEKE VISIBILITY 1S REMEMBERED. 


0@ 80 J=1>NY 
D@ 70 I=1s5LX 
Is2cl»J) = 0 
70 CONTINUE 
B80 CONTINUE 
RETURN 
END 


SUBROUTINE SETORGOXs Ysa Ze XTs YTs ZT) 

THIS ROUTINE IMPLEMENTS THE 3-SPACE T@ 2-SPACE TRANSFOR- 

MATION BY KUBER»s SZAB@ AND GIULIENIs, THE PENXSPECTIVE 

KEPRESENTATION OF FUNCTIONS @F TWO VARIABLES. J+ ACM 15, 

2s 193-20451968. 

SETORG AKGUMENTS 

oe a4 ARE THE 3-SPACE CO@KDINATES OF THE INTERSECTION 
OF THE LINE @F SIGHT AND THE IMAGE PLANE- THIS 
POINT CAN BE THOUGHT OF AS THE P@OINI LOOKED AT. 

XTsYTsZT ARE THE 3-SPACE COORDINATES OF THE EYE POSITION. 

CG PERSPC AKGUMENTS 

C Ks Yo" ARE THE 3-SPACE CO@@KDINATES OF A POINT TO BE 

C TRANSFORMED. 


AaAaAaANaNAaAa 
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C XTsYT THE RESULTS OF THE 3-SPACE T@ 2-SPACE TRANSFOR- 
C MATION. 
C ZT NOT USED. 


C STORE THE PANXAMETENS @F THE SET@RG CALL FO@K USE WHEN 
C PERKSPC IS CALLED. 


AX = K 
AY = Y 
AZ = 2Z 
EX = XT 
EY = YT 
EZ = ZT 


C AS MUCH COMPUTATION AS POSSIBLE 15S DONE DURING EXECUTION 
C OF SET@RG SINCE PERSPC IS CALLED THOUSANDS OF TIMES FOK 


C EACH CALL TO SETORG. 


DX = AX - EX 
DY = AY - EY 
DZ = AZ - EZ 


D = SORTCDX*DX+DY*DY+DZ*DZ) 
C@SAL = DA/D 
COSRE = DY/D 
COSGA = DZ/v 
AL = AC@SCCOSAL? 
BE ACOS(COSBE) 
GA ACOS(COSGA>) 
SINGA = SINCGA) 
C THE 3-SPACE P@INT LOOKED AT IS TRANSFORMED INIO@ €0s0) OF 


Hon fi 


aagaan 
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THE 2-SPACE. THE 3-SPACE Z AXIS IS TRANSFORMED INTO THE 
2-SPACE Y AXIS-e IF THE LINE @F SIGHT I5 CLOSE TO PARALLEL 
T@ THE 3-SPACE Z AXIS» THE 3-SPACE Y AAIS IS CHOSEN CIN- 
STEAD OF THE 3-SPACE Z AXIS) T® BE TRANSFO@RMED INTC THE 
2-SPACE Y AXIS. 
IF CSINGA-LT-.0-0001) G@ TO 10 
Rk = 1e/SINGA 
ASSIGN 20 TA JUMP 
RETURN 
10 SINBE = SINCBE) 
K = 1e/SINBE 
ASSIGN 30 TO JUMP 
RETURN 
Seto ENTRY PER SPC CIO OCI IO IK fob a OK 
ENTRY PEKSPC 
Q = D/C CX-EX) *COSAL+CY-EY) *COSBE+(2Z-EZ) *COSGA) 
GO TO JUMP, (20530) 
20 KT = CCEX+O%(X-EX)-AX) #COSBE~CEY +O CY EY) -AY) *COSALD €K 
YT = CEZ+O*CZ-EZ)-AZ) €R 
RETURN 
30 XT = CCEZ+Q%(Z-EZ)- AZ) *COSAL = CEX+Q#CX- EX) -AX) *COSGA) 4K 
YT = CEY+Q*CY-EY)-AY) &Kx 
RETURN 
END 


SUBROUTINE DANDRCNVs Nes STL, LXs NXs NY¥s IS2s5 Lbs ds 
* I@BJUSs MV) 

DIMENSI@N STICNV»NWs2)2 I[S2CLXsNY9> 5SC4)s LOBUSCMVsNW) 
THIS ROUTINE IS CALLED NU TIMESs EACH CALL PROCESSING THE 
PAKT @F THE PICTURE AT U=NU+1-I WHERE I 1S THE NUMBER OF 
THE CALL T@ DANDRe THAT ISs THE PART OF THE PICTUKE AT 
U=NU IS PROCESSED DURING THE FIRST CALL» THE PART OF THE 
PICTURE AT USNU-1 IS PK@CESSED DURING THE SECOND CALLs AND 
SQ ON UNTIL THE PART OF THE PICTURE AT U=1 IS PROCESSED 
DURING THE LAST CALL. 


NV SEE INI T3D COMMENTS- 

NW SEE INI T3D COMMENTS. 

ST SEE INIT3D C@MMENTS. 

LX THE NUMBER OF WORDS NEEDED T@ HOLD NX BITS- ALSO, 
THE FIRST DIMENSION OF IS2. 

NX NUMBER OF CELLS IN THE X DIKECTION OF A MODEL OF THE 


IMAGE PLANE+ A SILHQ@QUETTE OF THE PAKTS OF THE PIC- 
TURE PROCESSED 50 FAR IS STOKED IN THIS MODEL. LINES 
TO BE DRAWN AKE TESTED FO@K VISIBILITY. BY EXAMINING 
THE SILHOUETTE. LINES IN THE SILHQUETTE ARE HIDDEN. 
LINES OUT OF THE SILHQUETTE ARE VISIBLE+« THE S@LU- 
TION IS 4PPROXIMATE BECAUSE THE SILHQUETTE IS NOT 
FORMED EXACTLY- SEE IS¢ COMMENT BELOW. 

NY NUMBER OF CELLS IN THE Y DIKECTI@N OF THE MODEL OF 
THE IMAGE PLANE-+ ALSO THE SECOND DIMENSION OF IS2. 

1s2 AN ARRAY T@ HOLD THE IMAGE PLANE M@DEL- IT IS 
DIMENSI@NED LX BY NY»s THE MODEL IS NX BY NY AND 
PACKED DENSELY. IF HIDDEN LINES ARE DRAWN» DECKEASE 
NX AND NY CAND LX IF POSSIBLE). IF VISIBLE LINES 
AKE LEFT OUT @F THE PICTUKEs INCKEASE NX AND NY CAND 
LX IF NEED BE). AS A GUIDEs SOME EXAMPLES SHOWING 
SUCCESSFUL CH@ICES ARE LISTED : 

GIVEN NU NV Nh RESULTING NX NY FROM TESTING 


100 100 60 200 200 

60 60 60 110 110 

40 40 40 75 #75 
IU SEE INIET3D COMMENTS. 


I1@BUS A NV BY Nw AkRKAY ChHITH ACTUAL FIRST DIMENSION MV IN 
THE CALLING PKOGKAM) DESCRIBING THE GBJECT- IF THIS 
IS CALL NUMBER I TO DANDk» THE PAKT OF THE PICTURE 
AT UFNU+i-I [LS TO. BE PKOCESSED. I6BJS DEFINES THE 
@BJECTS TO BE DkAWN IN THE FOLL@WING MANNER -- 
I@BJSSCJsK=) IF ANY OBJECT CONTAINS THE POINT 
CNUFI-EoJdeK) AND IQOBUSCJsK)20 G@THERXWISE. 

MV ACTUAL FIRST DIMENSION OF I@BJUS IN THE CALLING 
PROGHAM. 


Cee KKKKKKKEK MACHINE DEPENDANT CONSTANTS 4642 40k HOR a AOR tk & 


Cc 
c 
c 


aa 


an 


NBPW NUMBER OF BITS PER WORD 
MASK AN ARRAY NBPW LONG. MASKCI)=2eaCI-1)5 Lels2s0ee,NBPW 
CDC 6000 OR 7000 VERSION 
DIMENSION MASKC 60) 
DATA NBPW/607 
DATA MASK/1Bs 28> 4Bs 108s 20Bs 408, 10082200B» 400Bs 1 000B, 
2000B» 40008, 100008» 20000B8, 400008» 1000008» 200000B, 
A00000B>» 1000000Bs 20000008; 4000000B, 10000000B, 
200000008» 40000000Bs 1000000008. 200000000B,» 400000000B,s 
10000000008. 20000000008,» 4000000000B, 10000000000B, 
200000000008» 40000000000B> 100000000000B, 
200000000000B>» 4000000000008, 1000000000000B, 
2000000000000B» 4000000000000B; 100000000000008> 
20000000000000B.» 40000000000000B» 100000000000000B,5 
200000000000000B> 400000000000000B;» 10000000000000008, 
2000000000000000B, 40000000000000008, 
100000000000000008B, 20000000000000000B, 
40000000000000000B, 100000000000000000B,. 
200000000000000000Bs 400000000000000000B> 
1000000000000000000As 2000000000000000000B. 
40000000Q0000000000B, 10000000000000000000B, 
20000000000000000000B, 40000000000000000000B/ 
ASSIGN 120 TO IRET 
RX AND RY ARE USED TQ MAP PLOTTER COBRDINATES INT® THE 
IMAGE PLANE MODEL. 
RX = CFLOATCNX)-1-2970SC2)9-S01)) 
RY = CFLOATCNY)-1-2970S04)-503)) 
READ THE RELATIVE PL@TTER C@@RDINATES OF THE LATTICE 
POINTS FROM UNIT IU. 
READ (IU) STI 
DX» DY AND DZ ARE USED TO FIND REQUIRED CO@RDINATES OF 
NON-LATTICE POINTS~ 
NvD2 3 NVZ2 
NWD2 = NW72 
DX = CSTICNVsNWD2s19-STIC 1» NWD2s 12) *e SSC FLOATCNV)-1-) 
DY = CSTECLSNWD2s2)-ST1CNVsNWD22 2) )*eS/CFLOATCNV) <1) 
DZ = CSTICNVD2sNwWs2)-STICNVD2s 152) #2 S/CPFLOATCNW)-1-) 
SLOPE 18 USED TO DEFORM THE IMAGE PLANE MODEL S@ THAT 
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LINES @F CONSTANT Y OF THE IMAGE’ MODEL HAVE THE SAME 
SLOPE AS LINES OF CONSTANT U AND W IN THE PICTURE. THIS 
IMPR@VES THE PICTURE. 
SL@PE = DY/DX Ne 
THE FOLLOWING L@@PS THROUGH STATEMENT 130 GENERATE THE «5 
CONTGUR LINES IN 2-SPACE FOK THE ARRAY IOBUS CWHICH CON- 
TAINS ONLY ZERGES AND ONES)» TESTS THE LINES FOK VISIBIL- 
ITY, AND CALLS A KOUTINE TG PLOT THE VISIBLE LINES. 
D@ 130 1=2sNV 
JUMP = IQBJSCI-151)"8 + IOBUSCI,1)*4 + 1 
D@ 120 J=2sNW 
X = STICIsJ91) 
Y = ST's Js2) 
DECIDE WHICH @F THE 16 POSSIBILITIES THIS IS. 


JUMP = CJUMP-1)74 + IOBUSCI-155)*8 + IOBUSCI, J) «4 + 1 


GB TO €120%205 40+ 50+ 70» 80s 302 190» 100s 10+ 80> 70» 50s 40s 
* 205120)» JUMP 


GOING TO 10 MEANS JUMP=10 WHICH MEANS @NLY THE LOWER-KIGHT 
AND UPPER-LEFT ELEMENTS @F THIS CELL ARE SET TG 1- 
TWO LINES SHOULD BE DRAWNs A DIAGONAL CONNECTING THE 
MIDDLE @F THE BOTTOM TO THE MIDDLE OF THE RIGHT SIDE OF 
THE CELL CLOWER=RIGHT LINE)» AND A DIAGONAL CONNECTING THE 
MIDDLE @F THE LEFT SIDE TO THE MIDDLE OF THE TOP CUPPER- 
LEFT LINE) OF THE CELL. 
10 ASSIGN 90 T@ InET 
LOWER-RIGHT LINE 
20 X13 X 
Yis Y - D2 
X2 = K + DX 
Y2 = Y = bY 
GQ Te 1t0 
LOWER@-LEFT AND UPPER-RIGHT 
30 ASSIGN 60 T@ IRET 
LOWER*LEFT 
40 Xl = XK 
Yt2 Y ~ DZ 
X2 = X = DX 
Ye = Y + DY 
G@ Te 110 
HORIZONTAL 
50 Xl = X + Dx 
Yl = y - by 
X2 = X - DX 
y2 = Y¥ + DY 
G@ Te 110 
UPPEK-LEFT 
60 ASSIGN 120 TO IRET 
70 Xl = X + DX 
Ylo= Y¥ - DY 
X2 = K 
Y2 = Y + DZ 
G9 To 110 
VERTICAL. 
80 Xl = X 
Ylo= Y - b2 
X2 = XK 
Y2 = Y + DZ 
GO TO 110 
90 ASSIGN 120 T@ IKET 
UPPEK-LEFT 
100 Xl = X = DX 
Y! = Y + DY 
x2 =X 
Y2 = Y + DZ 
TEST VISIBILITY @F THIS LINE SEGMENT. 
110 IX = CX1-SC1L))*RX 


TY=MODCITFIAXACCY1-SC3)) *KY-SLOPE*FLOATC IA) +NYeNYD 41 

IBIT = MODCIXsNBPW) + 1 

IX = IX/NBPW + 1 
so RK CAND. USED AS A MASKING OPERATCK #440 RRR OK 

IV=1IS2CIXs LY) -ANDeMASKCIBIT) 

IF EITHER END OF THE LINE IS AT A MARKED SPOT @N THE IMAGE 
PLANE M@DEL» THE LINE IS HIDDEN 

IF CIVeNE-O) GO TO IRETs (602902120) 

IX = €X2-SCL) > *Rx 

LYSMODCIFIXC CY2-$€3) KY -SLOPEFLOATCIA) }tNYONY) #1 

IBIT = MO@DCIXsNBPW) + 1 

IX = IX/NBPW + | 
woo CAND). USED Ad A MASKING OPERATOR ORR OR Kt 

IV2LS2C1Xs TY) eAND¢MASKCIBIT) 

IF CIVeNE-O) GO TO IRETs (605905120) 
tt et to totoiofok UNDEFINED EXTERNAL KEFERENCE +4 448 HOR 
SUBROUTINE LINECXt+Y12X22Y2) IS ASSUMED TQ DRAW A LINE 
FROM CX1sY1) TO Cx2esY2) 

CALL LINECXLs Ybs X2s Y2) 

GO TC ITRET» (605905120) 

120 CONTINUE 
130 CONTINUE 
CODE THR@UGH STATEMENT 150 CREATES AN APPROXIMATION OF 
THE SILHQUETTE OF THE PART OF THE PICTUKE JUST DRAWN BY 
MARKING THE IMAGE PLANE MODEL WHERE THE OBJECT @CCURS. 
D@ 150 I=tsNv 
De 140 J=ts Nw 

IF CI@BUSCI4J)»-EQ.0) GO T@ 140 

IX = CSTIECIs de bI-SC1L)#RX + O45 

TwWK = SLOPE*XFLOATCIX) - 0-5 

TYSMODCIFIXCCSTACL ss Js 29-SC3) RKY = TWK DENYS NYD #1 

IBIT = MODCIXsNBPW) + 1 

IX = IX/NBPW + 1 
sO oto 6 OR e USED AS A MASKING @PEKATOR sO ORR re 

TS2CIXs LYIALS2CI Xs LY) OR eMASKCIBIT) 

140 CONTINUE 
150 CONTINUE 
RETURN 
END 
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Remark on Algorithm 475 [J6] 


Visible Surface Plotting Program [Thomas Wright, 
Comm. ACM 17 (Mar. 1974), 152-155] 

Lawrence W. Frederick [Recd 31 May 1974] 

Emory University Computing Center, Uppergate 
House, Emory University, Atlanta, GA 30322 


In the initialization phase a significant savings in time may be 
obtained (as a function of the box dimensions, NU, NV, NW) by 
integrating subroutine SETORG into subroutine INIT3D. The time 
consuming part of INIT3D is the 3-space to 2-space transformation 
done via the call to the PERSPC entry of SETORG. This transfor- 
mation is performed in a regular fashion by triply nested DO loops 
ranging over the box dimensions. By algebraically separating the 
transformation, expressions not depending on inner loop indices 
may be floated to outer loops. This arrangement eliminates a large 
number of redundant operations and the nonstandard ENTRY 
statement, 


Remark on Algorithm 475 [J6| 
Visible Surface Plotting Program [Thomas Wright, 
Comm. ACM 17 (Mar. 1974), 152-155] 


R.G. Mashburn [Recd 9 Dec. 1974] Computer 
Sciences Division at Oak Ridge National Laboratory 
Union Carbide Corporation, Nuclear Division* Oak 
Ridge, TN 37830 


* Prime contractor for the U.S. Energy Research and Develop- 
ment Administration. 


The Visible Surface Plotting Program, Algorithm 475, has been 
modified to run on IBM 360 hardware using the Fortran IV (level 
H) compiler. Using a modifid version of the demonstration pro- 
gram supplied with the algorithm, the two sample plots were suc- 
cessfully produced. The following documents the changes that were 
required to convert the programs from CDC 6000 or 7000 programs 
to IBM 360 programs. In addition to the changes listed below it was, 
of course, necessary to include a FRAME subroutine, a LINE sub- 
routine, and other calls to plotting subroutines which support 
locally available plotting equipment. However, since plotting equip- 
ment and its software support vary from one installation to another, 
only those changes pertinent to the IBM 360 are listed here. 


Demonstration program: 
1. Remove the PROGRAM statement. 
2. Change the first DIMENSION statement from: 


DIMENSION EYE(3), S(4), ST1(80, 80, 2), IS2(3, 160) 
to: 
DIMENSION EYE(3), S(4), ST1(80, 80, 2), IS2(5, 160) 


Note. The comments in the program indicate the first extent 
LX of the array 1S2 is calculated as follows: 


LX = 1 + NX/NBPW 


COLLECTED ALGORITHMS (cont.) 


This is true so long as NX is not an integral multiple of NBPW. 
However, in this case NX is 160 and NBPW (the number of 
bits per word) is 32 for the IBM 360. Thus NX is an integral 
multiple of NBPW, and LX is calculated simply as NX/NBPW 
In general use 


LX = 1+ (NX—1)/NBPW. 
3. Change the call to the /N/T73D subroutine to: 
CALL INIT3D (EYE, 80, 80, 80, ST1, 5, 160, 1S2, 9, S) 


4. Change the two calls to DANDR (one after statement 40, the 
other after statement 110) to: 


CALL DANDR (80, 80, ST1, 5, 160, 160, 1S2, 9, S, IOBJ 
+80) 


5. Change the DO statement following the REWIND 9 state- 
ment from: 


DO 701 = 1,3. to: 


INIT3D subroutine: 

SETORG subroutine: 

1. Because no standard exists for referencing arc cosine, the 
three statements containing references to the arc cosine sub- 
routine were changed from: 


DO 701 = 1,5 


No changes required. 


AL = ACOS(COSAL) to: AL = ARCOS(COSAL) 
BE = ACOS(COSBE) BE = ARCOS(COSBE) 
GA = ACOS(COSBA) GA = ARCOS(COSGA) 


2. Because no standard exists for ENTRY statements and their 
syntax differs among compilers, it was necessary to change the 
ENTRY statement from: 


ENTRY PERSPEC to: 
ENTRY PERSPC(X, Y, Z, XT, YT, ZT) 
DANDR subroutine: 
1. The DIMENSION statement should be changed from: 


DIMENSION MASK (60) to: DIMENSION MASK (32) 


2. The two DATA statements following the DIMENSION 
statement should be changed from: 


DATA NBPW/60/ 
DATA MASK /1B, 2B, 4B, 10B, 20B, 40B, 100B, 200B, 400B, 
1000B, 
* 2000B, 4000B, 10000B, 20000B, etc., 
to: 


DATA NBPW/32/ 

DATA MASK/Z1, Z2, Z4, Z8, Z10, Z20, Z40, Z80, Z100, 
* 7200, Z400, Z800, Z1000, Z2000, Z4000, Z8000, Z10000, 
* Z20000, Z40000, Z80000, Z100000, Z200000, Z400000, 
* 7800000, Z1000000, Z2000000, 24000000, 28000000 
* 710000000, Z20000000, Z40000000, Z80000000 / 


3. The two uses of the .AND. masking operation and the one 
use of the .OR. masking operation were changed to cail 
assembly language function subprograms JAND and JOR 
(programs written locally for the ORNL computing center 
Fortran library) which return an JNTEGER*4 value which is 
the logical AND and logical OR respectively of the two argu- 
ments given them. 

Change the two .AND. statements from: 


IV = IS2(IX, lY). AND.MASK (IBIT) _ to: 
IV = IAND(IS2(IX, IY), MASK (IBIT)) 


{| 


Change the .OR. statement from: 
IS2(X, 1Y) = IS2(IX, TY). OR.MASK (IBIT) to: 
IS20X%, TY) = IOR(IS2(1X, TY), MASK (IBIT)) 


Note. In the original program listing of subroutine DANDR, 
the comment card immediately preceding statement 60 reads: 


C UPPER-LEFT but should say: C UPPER-RIGHT. 
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Remark on Algorithm 475[J6] 


Visible Surface Plotting Program [Thomas Wright, 
Comm. ACM 17 (Mar. 1974), 152-155] 

C.J. Doran [Recd 22 Oct. 1974], Physics Department, 
University of Nottingham, England 


Algorithm 475 has been successfully implemented on a D.G. 
Nova 1220 minicomputer and an I.C.L. 1906A, making substitu- 
tions for the nonstandard features of the original algorithm. 

ENTRY statements are permitted in 1900 Fortran but not by 
Data General. SETORG and PERSPC were therefore written as 
separate subroutines linked by a labelled common area declared as: 


COMMON/CSETORG/JUMP, EX, EY, EZ, AX, AY, AZ, D, R, 
COSBE, COSAL, COSGA 


JUMP being declared as a LOGICAL variable. The assigned 
GO TO statement in PERSPC then becomes 


IF (JUMP) GO TO 30 


with JUMP = . FALSE. replacing the first ASSIGN statement 
in SETORG, and JUMP = . TRUE. replacing the second. 

The DATA statement in DANDR may easily be standardized 
by writing decimal literals, but most compilers will not accept an 
integer 2¥“?”, NBPW should then be redefined as one less than 
the number of bits per word. 

Logical operations between integers may be performed by 
portable Fortran functions JAND and JOR as: 


FUNCTION IAND(I, J) 
LOGICAL BI, BJ 
EQUIVALENCE (BI, II), (BJ, JJ) 
I= I 


JJ =J 

BI = BI . AND. BJ 
IAND = II 
RETURN 

END 


with equivalent coding for JOR. The first two masking operations 
then become: 
IV = IAND(IS2(IX, LY), MASK(IBIT)) 


and the third becomes: 
IS2(1X, 1Y) = IORUS2(1X, lY), MAS K(UIBIT)) 
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Visible Surface Plotting Program [J6] 
[T. Wright, Comm. ACM 17,3 (March 1974), 152-157] 


Gordon E. Bromage [Recd 6 May 1975 and 11 July 1975] 
University of Bradford, West Yorkshire, U.K. 


Author’s present address: S.R.C. Astrophysics Research Division, Culham Laboratory, Abingdon, 
Oxon., U.K. 


This package was modified to remove all the nonstandard features mentioned in 
the algorithm description, together with one that was not pointed out, namely, 
two calls in ACMTEST to the system-dependent graph-plotting routine FRAME. 

The bit-manipulation (masking) operators .AND. and .OR. and the nonstandard 
DATA statement (all in DANDR) were dealt with in the following way. The 
masking operators were replaced by segments IAND and [OR written in an as- 
sembly language. Since the array MASIx is only used in these bit manipulations, 
the data statement assigning values to the elements of MASK was removed from 
DANDR and a corresponding statement inserted into the assembly-language 
segments, so that only the bit number (IBIT) was referenced from DANDR. 
Thus, in DANDR, the statement 


IV = I82(1X,1TY) .AND. MASK(IBIT) 

was replaced twice by the line 

CALL IAND(I82(IX,IY) IBIT,IV) 

and the line 

IS2(IX,TY) = IS2(1X,1Y) .OR. MASK(IBIT) 


was replaced once by 
CALL IOR(IS2(1X, TY) IBIT,IS2(1X,TY) ). 


The package was then tested on an ict 1904A machine (George 3 system), 
which uses a word length of 24 bits. 

For the system-dependent graph-plotting routines, Caleomp routines were used 
in place of LINE and FRAME. In fact, to allow for duplication and editing of 
graphs without having to rerun the package, the plotting routines were separated 
from the main program. Thus the coordinates (X1,Y1,X2,Y2) of the lines to be 
plotted were written onto files in DANDR using the statement 


WRITE (IUX) X1,Y1,X2,Y2 in place of CALL LINE (X1,Y1,X2,Y2) 


(where IUX is the I/O unit number assigned to a particular file), and the plotting 
was performed by a separate program. 

It should be emphasized that the number of scratch files needed for assignment 
of I/O unit IU in INITS8D is also system dependent. For example, on the 1904A 
more than one file was needed for picture resolutions higher than that correspond- 
ing to a 30 X 30 X 30 object cube mesh; for 60 X 60 X 60 mesh, four files were 
needed, each one storing the information relating to 15 of the 60 image planes. 

With the above changes implemented, the package ran successfully on the Ici 
1904A for the processing of concave pictures (optimization objective-function 
surfaces) as well as for pictures of bounded objects and for a wide variety of eye 
positions. Successful processing was often obtained even when one of the eve- 
position coordinates was negative (cf. comment lines relating to the array EYE in 
INIT3D). On this machine, less than 30K 24-bit words were needed at run time 
for a resolution corresponding to a 60 X 60 X 60 mesh; while 12K words were 
sufficient for a 30 X 30 X 30 mesh resolution. The run time for the first test picture 
at the higher resolution was approximately 10 minutes. 
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Visible Surface Plotting Program [J6] 
[T. Wright, Comm. ACM 17, 3(March 1974), 152-155 } 


Lucian D. Duta [Recd 5 Aug. 1975] 
Academy of Economic Studies, Str. Dorobanti 15-17, Bucharest, Romania 


Algorithm 475 has been modified for running on an IBM 370 computer and on a 
FELIX C-256 computer, using the Fortran IV compilers. The two sample plots 
were successfully produced on a BENSON 222 plotter. 

The changes in the program are those described by Mashburn [1 ]. In addition to 
these changes, we suggest the following. 


SETORG Subroutine 


1. Because the parameter ZT is not used in the PERSPC entry, change the entry 
statement to 


ENTRY PERSPC(X, Y,Z,XT, YT) 
2. Remove the statements 


AL = ACOS(COSAL) 
BE = ACOS(COSBE) 
GA = ACOS(COSGA) 


3. Change the statement 
SINGA = SIN(GA) 
to 
SINGA = SQRT(1. — COSGA+COSGA) 
4. Change statement 10 from 
10 SINBE = SIN(BE) 
to 
10 SINBE = SQRT(1. — COSBEsCOSBE) 
INIT3D Subroutine 
1. Modify all statements which call to PERSPC entry by removing the last 
argument: 


CALL PERSPC(1.,1.,W,D, YT) 
CALL PERSPC(U,V,1.,D, YB) 

CALL PERSPC(U,1.,1.,XL,D) 
CALL PERSPC(1.,V,1.,XR,D) 


2. Include an ENTRY statement after statement 60: 
ENTRY INIS2 


A call to the INIS2 entry will produce the filling of the array [S2 with zeros and 
the rewinding of the JU unit. Because the call to the JNJS2 entry is made only 
after the call to the INIT3D subroutine, the JNJS2 entry need not have 
parameters. 


3. Change the comment cards from 


C IF SEVERAL PICTURES ARE TO BE DRAWN WITH THE SAME SIZE 

C ARRAYS AND EYE POSITION AND THE USER REWINDS IU AND FILLS 
C IS2 WITH ZEROES, INIT3D NEED NOT BE CALLED FOR OTHER THAN 
C THE FIRST PICTURE. 


to 


C IF SEVERAL PICTURES ARE TO BE DRAWN WITH THE SAME SIZE 
C ARRAYS AND EYE POSITION, INIT3D NEED NOT BE CALLED FOR 
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C OTHER THAN THE FIRST PICTURE. IN THIS CASE, BEFORE EACH 
C SUBSEQUENT PICTURE THE INIS2 ENTRY MUST BE CALLED FOR 
C REWINDING IU AND FILLING THE ARRAY IS2 WITH ZEROES. 


Demonstration Program 


1. Change the following statements: 


C FOUR LINES ARE INCLUDED. 
REWIND 9 
DO 70 I = 1,3 
DO 60 J = 1,160 
IS2(I,J) = 0 
60 CONTINUE 
70 CONTINUE 


to 


C LINE IS INCLUDED. 
CALL INIS2 


REFERENCES 
[1] Masupurn, R.G. Remark on Algorithm 475. Comm. ACM 18, 5(May 1975), 276-277. 
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Visible Surface Plotting Program [J6] 
[T. Wright, Comm. ACM 17, 3 (March 1974), 152-155] 


A.C.M. van Swieten [Recd 28 July 1976 and 12 Sept. 1978] 
VSSG, P.O. Box 3032, Leyden, The Netherlands 


and 


J.Th.M. de Hosson . 
Department of Applied Physics, Rijksuniversiteit Groningen, Universiteitscom- 
plex Paddepoel, Nijenborgh 18, 9747 AG Groningen, The Netherlands 


This research was supported by the Netherlands Organization for the Advancement of Pure Research 
(Z.W.0., The Hague). 


This remark describes an extension of the visible surface plotting program, ACM 
Algorithm 475. This program turns out to result in a long plotting time when one 
is using CALCOMP plot routines. The long plotting time is mainly caused by 
numerous idle pen movements which are inherent to the structure of the algo- 
rithm. Essentially the algorithm does the following: the three-dimensional surface 
is cut in slices. The slices are separated and then searched in order to produce a 
perspective image of that slice and to remove the hidden lines; therefore, the 
algorithm generates a large number of small segments in the search direction. In 
general, however, the search direction does not coincide with the contour direc- 
tion. When one is using CALCOMP subroutines there are a lot of idle pen 
movements due to the fact that the segments are not in an appropriate order. In 
Figure 1(a) it is shown that numerous idle pen movements are necessary to plot 
a disklike form. In the improved version only one idle pen movement is made 
(see Figure 1(b)). 

The extension consists of two subroutines: SDLINE and PLTOUT. In the 
original subroutine DANDR we have to add five statements: Insert 


COMMON/TOM1/NSEQ, SS; SS = 0.04, NSEQ = 0 
before the statement 


SLOPE = DX/DY 
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(a) (b) 


Fig. 1. (a) The pen movements generated by the original version of the plotting 
program. The idle pen movements are dashed lines. (b) Output of the improved 
plotting program showing one idle pen movement (dashed line) 


which initializes 
SDLINE; CALL SDLINE(X1, Y1, X2, Y2) 


instead of 
LINE(X1, Y1, X2, Y2) 


which builds up the sequences and 
CALL PLTOUT 


after the statement 
130 CONTINUE 


in DANDR which plots the sequences. The subroutine SDLINE(X1, Y1, X2, Y2) 
temporarily stores the segments in order to construct the sequences. This is done 
by comparing the last point of each sequence with the endpoints of a segment. 
The criterion for the continuation of a sequence is that one of the endpoints of 
the segment lies within a square with edges of 2SS around the last point of a 
sequence. The value of SS depends on the plotter precision and it is taken to be 
equal to 0.04. If there is no continuation point of any sequence a new sequence is 
started through the segment. 

In the present version the length of the sequences is equal to 80 and the number 
is equal to 20. If a sequence has been filled up completely a new sequence is 
created. If one needs more than 20 sequences intermediate plotting takes place 
by calling PLTOUT. 

The subroutine PLTOUT plots the sequences taking into account the minimum 
distance between starting points and ends of sequences. This is done by ordering 
the sequences in an appropriate way and by indicating whether they should be 
processed in normal or reversed order. 

Finally we give some test results of the revised program compared with the old 
version. The core size, execution time, and CALCOMP plotting time are compared 
in the case of the second example (Figure 5) in Algorithm 475. Although this type 
of surface is not the one that results in the greatest reduction, the saving of 
plotting time is significant (see Table I). In Table I the time spent in DANDR 
but not the time spent in PLTOUT is listed. The space of INIT3D + P + 
DANDR (old version) and of INIT3D + P + DANDR + SDLINE + PLTOUT 
+ TOM are also given in Table I. 


Table I 
CALCOMP plotting 
CYBER 74-16 Space Time time (minutes) 
Old version 12775 5.012 31 


Revised version 10544, 5.621 8 
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REVISED ALGORITHM 


C 
C 
C 
C 
C 
C 
C 
C 
C 
C 
C 
C 
Cc 


PROGRAM CONES (INPUT, OUTPUT, PLOT, TAPE6=OUTPUT, TAPE5=INPUT, 
LTAPE99=PLOT, TAPE9) 


DEMONSTRATION PROGRAM 

BY THOMAS WRIGHT IN: 

ALGORITHM 475, VISIBLE PLOTTING PROGRAM (J6), 
COMMUNICATION OF THE ACM, MARCH 1974,VOL.17,NUMBER 3,P 152. 


ARAKKKAKKKK MACHINE DEPENDANT FUNCTIONS *X*#XAXXXAAKKARKKEKKKEKEKE 


FIRST CARD IS THE PROGRAM CARD FOR CDC 6@¢@@ AND CDC 700@ SERIES. 
CALCOMP PACKAGE WHICH CONTAINS THE SUBROUTINES NAMPLT,ENDPLT, 
NAMPLT = TO INITIALIZE THE SYSTEM. 

ENDPLT = TO TERMINATE PLOTTING ON A FILE. 


DIMENSION EYE(3),S(4),ST1(8@, 8@, 2) ,1S2(3, 16) 
DIMENSION LOBJ (80, 80) 
CALL NAMPLT 

USE WHOLE FRAME 
S(1)=¢. 

§(2)=28. 

$(3)=¢. 

S(4)=28. 

SET EYE POSITION 
EYE(1)=2@@. 
EYE(2)=400. 

EYE (3)=300. 

NX=8¢ 

NY=8¢ 

NZ=8¢@ 

NCELLS=2 
MX=NCELLS*NY 
LX=1+MX/ 60 

MY=MX 


THIS PICTURE WILL BE THE T=4 CONTOURSURFACE OF 
T=1/SQRT (U*U+V*V+W*W)+(. 5-V) **2/SQRT (U*U+V¥V). 


C THE 


aaaan 


aaagaaaan 


89 
90 


30 
49 


5@ 


THIS IS THE SECOND PICTURE (FIG.5) PRODUCED BY THE TEST PROGRAM 
OF THOMAS WRIGHT. 

CALL INIT3D(EYE,NX,NY,NZ,ST1,LX,MY,1IS2,9,S) 

DO 5@ I=1,NX 

U=(4@.5-FLOAT(I))/79. 

UU=U*U 

DO 46 J=1,NY 

V=(FLOAT(J)-40.5)/79. 

VV=V4V 

A=1./SQRT (UU+VV) 

DO 3@ K=1,NZ 

FOLLOWING CARD ADDS AXES. 

IF (1*J.EQ.1 .OR. I*K.EQ.1 .OR. J*K.EQ.1) GO TO 8¢ 
W= (FLOAT (K)-49.5)/79. 

IF (1./SQRT (UU+VV+WAW) + (.5-V)**2*A.LE.4.) GO TO 90 
LOBJ(J,K)=1 

GO TO 30@ 

LOBJ(J,K)=@ 

CONTINUE 

CONTINUE 

CALL DANDR(NY,NZ,ST1,LX,MX,MY,1S2,9,S,IOBJ,NY) 
CONTINUE 

CALL ENDPLT 

STOP 

END 

SUBROUTINE INIT3D(EYE,NU,NV,NW,ST1,LX,NY,1S2,1U,S) 
BY THOMAS WRIGHT 


THIS ROUTINE IMPLEMENTS THE 3-SPACE TO 2-SPACE TRANSFORMATION BY 
KUBER,SZABO AND GIULIERI, THE PERSPECTIVE REPRESENTATION OF 
FUNCTIONS OF TWO VARIABLES. J. ACM 15,2, 193-264,1968. 


DIMENSION EYE(3) ,ST1(NV,NW, 2),I1S2(LX,NY),S (4) © 

THE METHOD IS DESCRIBED IN DETAIL IN - ONE-PASS HIDDEN- 

LINE REMOVER FOR COMPUTER DRAWN THREE-SPACE OBJECTS. PROC 

1972 SUMMER COMPUTER SIMULATION CONFERENCE ,261-267,1972. 

THIS VERSION IS FOR USE ON CDC 6@¢@ OR 7¢¢@ COMPUTERS. 

THIS PACKAGE OF ROUTINES PLOTS 3-DIMENSIONAL OBJECTS WITH 

HIDDEN PARTS NOT SHOWN. 

INIT3D IS AN INITIALIZATION ROUTINE FOR THIS PACKAGE. IT IS CALLED 
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AAQAANQAANAQAANAARAAANARAAAARANANNAaAanNANANANDANAANANDA 


»THEN A SEQUENCE OF CALLS ARE MADE TO DANDR TO PRODUCE A PICTURE. 


EYE AN ARRAY 3 LONG CONTAINING THE U,V,W COORDINATES OF THE EYE 
POSITION. OBJECTS ARE CONSIDERED TO BE IN A BOX WITH 2 EXTREME 
CORNERS AT (1,1,1) AND (NU,NV,NW). THE EYE POSITION MUST HAVE POSI 
TIVE COORDINATES AWAY FROM THE COORDINATE PLANE U=$, V=0,W=0. 
WHILE GAINING EXPERIENCE WITH THE PACKAGE, USE EYE(1)=5*NU,EYE(2)= 
4*NV, EYE(3)=3*NW. 
U DIRECTION LENGTH OF THE BOX CONTAINING THE OBJECTS 
V DIRECTION LENGTH OF THE BOX CONTAINING THE OBJECTS 
NW W DIRECTION LENGTH OF THE BOX CONTAINING THE OBJECTS 
ST1 A SCRATCH ARRAY AT LEAST NV*NW*2 WORDS LONG. ; 
LX FIRST DIMENSION OF A SCRATCH ARRAY, IS2, USED BY THE PACKAGE 
FOR REMEMBERING WHERE IT SHOULD NOT DRAW. 
LX=14+NX/NBPW. 
NY SECOND DIMENSION OF IS2. 
IS2 A SCRATCH ARRAY AT LEAST LX*NY WORDS LONG. 
IU UNIT NUMBER OF SCRATCH FILE FOR THE PACKAGE. ST1 
WILL BE WRITTEN NU TIMES ON THIS FILE. 
S AN ARRAY 4 LONG WHICH CONTAINS THE COORDINATES OF THE 
AREA WHERE THE PICTURE IS TO BE DRAWN, 
THAT IS, ALL PLOTTING COORDINATES GENERATED WILL BE BOUNDED AS 
FOLLOWS-- X COORDINATES WILL BE BETWEEN S(1) AND S(2), 
Y COORDINATE WILL BE BETWEEN S(3) AND S(4). 
TO PREVENT DISTORTION, HAVE S(2)-S(1)=S (4)-S (3) 
IF SEVERAL PICTURES ARE TO BE DRAWN WITH THE SAME SIZE 
ARRAYS AND EYE POSITION AND THE USER REWINDS IU AND FILLS 182 
WITH ZEROES, INIT3D NEED NOT TE BE CALLED FOR OTHER THAN THE 
FIRST PICTURE. 


SET UP TRANSFORMATION ROUTINE FOR THIS LINE OF SIGHT. 
U=NU 
V=NV 
W=NW 
AX=U*@.5 
AY=V*0.5 
AZ=W*Q.5 
EX=EYE (1) 
EY=EYE (2) 
EZ=EYE (3) 
DX=AX—-EX 
DY=AY-EY 
DZ=AZ-EZ 
D=SQRT (DX*DX+DY *DY+DZ*DZ) 
CA=DX/D 
CB=DY/D 
CG=DZ/D 


CRERRAKKKEKKK MACHINE DEPENDANT FUNCTION *&%4*XKAK ACOS kkKKKAKAKKAAKKE 


C 


C 
C 
C 


aaa 


aa 


AL=ACOS (CA) 
BE=ACOS (CB) 
GA=ACOS (CG) 
THE MACINE DEPENDANT FUNCTION ACOS CAN BE REPLACED BY ARCCOS 
AL=ARCCOS (CA) 
BE=ARCCOS (CB) 
GA=ARCCOS (CG) 
SINGA=SIN (GA) 
THE 3-SPACE POINT LOOKED AT IS TRANSFORMED INTO (@,@) OF 
THE 2-SPACE. THE 3-SPACE Z-AXIS IS TRANSFORMED INTO THE 
2-SPACE Y AXIS. 
IF(SINGA.LT.@.09@1)GO TO 11 
R=1./SINGA 
FIND EXTREMES IN TRANSFORMED SPACE. 
CALL P(1.,1.,W, DUMMY, YT,AX,AY,AZ,EX,EY,EZ,CA,CB,CG,D,R) 
CALL P(U,V,1.,DUMMY,YB,AX,AY,AZ,EX,EY,EZ,CA,CB,CG,D,R) 
CALL P(U,1.,1.,XL,DUMMY,AX,AY,AZ,EX,EY,EZ,CA,CB,CG,D,R) 
CALL P(1.,V,1.,XR, DUMMY ,AX,AY,AZ,EX,EY,EZ,CA,CB,CG,D,R) 
ADJUST EXTREMES TO PREVENT DISTORTION WHEN GOING FORM 
TRANSFORMED SPACE TO PLOTTER SPACE. 
DIF=(XR-XL~YT+YB)*.5 
LF (DIF) 10, 30,20 
1@ XL=XL+DIF 
XR=XR-DIF 
GO TO 3¢ 
2@ YB=YB-DIF 
YT=YT+DIF 
3@ REWIND IU 
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C FIND THE PLOTTER COORDINATES OF THE 3-SPACE LATTICE POINTS. 


C1=.9%*(S(2)-S (1) )/ (XR-XL) 

C2=.05* (S(2)-S (1) )+8 (1) 
C3=.9*(S(4)-S (3))/ (YT-YB) 

C4=.05* (S(4)-S (3) )+8 (3) 

DO 6@ I=1,NU 

U=NU+1-1 

DO 5¢@ J=1,NV 

v=J 

DO 49 K=1,NW 

W=K 

Q=D/ ( (U-EX) *CA+ (V-EY) *CB+ (W-EZ) *CG) 
X =((EX+Q* (U-EX)-AX) *CB- (EY+Q* (V-EY )-AY) *CA) *R 
Y =(EZ+Q* (W-EZ)-AZ)*R 
ST1(J,K,1)=C1* (X-XL)+C2 
ST1(J,K, 2)=C3* (Y-YB)+C4 


4 CONTINUE 
5@ CONTINUE 


C WRITE THEM ON UNIT IU. 


WRITE(1IU)ST1 


60 CONTINUE 


REWIND IU 


C ZERO OUT ARRAY WHERE VISIBILITY IS REMEMBERED. 


C 
C 
Cc 


gaangaNnangannanaaaqanannnannanannnnnanannann 


DO 8@ J=1,NY 
DO 7@ I=1,LX 
1S2(L,J)=¢ 


7@ CONTINUE 
8 CONTINUE 


RETURN 


11 CONTINUE 


STOP 
END 


SUBROUTINE P(X,¥,Z,XT,YT,AX,AY,AZ,EX,EY,EZ,CA,CB,CG,D,R) 


X,Y,Z ARE THE 3-SPACE COORDINATES OF A POINT TO BE TRANSFORMED. 
XT,YT THE RESULTS OF THE 3-SPACE TO 2-SPACE TRANSFORMATION. 


Q=D/ ( (X-EX) *CA+ (Y-EY ) *CB+ (Z-EZ) *CG) 

XT= eo ee ee 
YT=(EZ+Q* (Z-EZ)-AZ) *R 

RETURN 

END 

SUBROUTINE DANDR(NV,NW,ST1,LX,NX,NY,IS2,1U,S,IOBJS,MV) 


THE PURPOSE OF THE SUBROUTINE AND THE INPUT AS WELL AS THE 
OUTPUT PARAMETERS ARE THE SAME AS PUBLISHED BEFORE BY WRIGHT. 
THEY ARE SUMMARIZED AND REPETED IN BEHALF OF THE USERS OF THIS 
SUBROUTINE DANDR 


THIS SUBROUTINE IS CALLED NU TIMES, EACH CALL PROCESSING THE 
PART OF THE PICTURE AT U=NU-I+1 WHERE I IS THE NUMBER OF THE CALL 
TO DANDR. THE PART OF THE PICTURE AT U=NU IS PROCESSED DURING 
THE FIRST CALL, THE PART OF THE PICTURE AT U=NU-I+1 DURING 
THE SECOND CALL, AND SO ON UNTIL THE PART OF THE PICTURE AT U=1 
IS PROCESSED DURING THE LAST CALL. 

PARAMETERS IN THE CALL 


NV V DIRECTION LENGTH OF THE BOX CONTAINING THE OBJECT. 

NW W DIRECTION LENGTH OF THE BOX CONTAINING THE OBJECT. 

ST1 A SCRATCH ARRAY AT LEAST NV*NW*2 WORDS LONG. 

LX THE NUMBER OF WORDS NEEDED TO HOLD NX BITS. 

NX NUMBER OF CELLS IN THE X DIRECTION OF A MODEL OF THE 
IMAGE PLANE. 

NY NUMBER OF CELLS IN THE Y DIRECTION OF THE MODEL OF THE 
IMAGE PLANE. 

IS2 AN ARRAY TO HOLD THE IMAGE PLANE MODEL. 

Iv UNIT NUMBER OF SCRATCH FILE FOR THE PACKAGE. 


ST1 WILL BE WRITTEN NU TIMES ON THIS FILE. 
IOBJS A NV BY NW ARRAY DESCRIBING THE OBJECT. 
IF THIS IS CALL NUMBER I TO DANDR, THE PART OF THE PICTURE 
AT U=NU-I+1 IS TO BE PROCESSED. IOBJS DEFINES THE OBJECTS 
IOBJS(J,K)=1 IF ANY OBJECT CONTAINS THE POINT (NU-I+1,J,K) 
AND IOBJS(J,K)=@ OTHERWISE. 
MV THE ACTUAL FIRST DIMENION OF IOBJS IN THE CALLING PROGRAM. 
S) AN ARRAY WHICH CONTAINS THE COORDINATES OF THE AREA WHERE 
THE PICTURE IS TO BE DRAWN. 
THE PROGRAM IS TESTED USING A CDC76@@ (CYBER 76-16) COMPUTER 
INSTALLATION, AND CDC 66¢@ INSTALLATION AS WELL. 


1500 
151¢ 
1526 
153 
1540 
1550 
156¢ 
1570 
158 
159¢ 
1660 
1616 
1620 
1630 
1646 
1650 
1660 
167¢ 
168¢ 
169¢@ 
17060 
17106 
172 
1730 
1746 
1750 
176¢ 
1770 
1780 
1796 
18060 
181¢ 
1820 
183¢ 
1849 
1850 
1860 
1870 
1880 
189¢ 
1900 
1910 
1926 
1930 
1946 
1950 
1966 
1976 
1980 
1996 
2000 
2010 
2026 
2030 
2040 
2050 
2060 
2070 
2080 
2090 
2106 
2110 
2120 
2130 
2140 
2156 
2160 
2170 
2180 
2190 
2200 
2210 
2220 
2230 
2240 
2250 
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CkeRK 


1¢ 
20 


36 
49 


5@ 


60 
7¢ 


80 


Cc 

Cc INLINE FUNCTION WHICH ARE ASSUMED TO BE AVAILABLE, ARE= 

Cc ABS, FLOAT, IFIX, MOD. 

CRHEREKEREKKEKE MACHINE DEPENDANT CONSTANTS XXXRXAKKKKKERKEKEEKKERK 
Cc NBPW NUMBER OF BITS PER WORD 

Cc CDC SERIES (PRESENT CASE) NBPW=60. 

Cc IBM SERIES, NBPW=32. 

C UNIVAC 110@@ SERIES, NBPW=36. 

Cc MASK AN ARRAY NBPW LONG. MASK (I)=2** (I-1) ,I=1,2,3,..,NBPW. 

c 


DIMENSION ST1(NV,NW, 2) ,IS2(LX,NY) ,S(4) , LOBJS (MV,NW) 
DIMENSION MASK (6@) 
INTEGER AND,OR 
COMMON/TOM1/NSEQ,SS 
REKKKKKKKEKKK NBPW 
DATA NBPW/6/ 
DATA MASK/1B, 2B, 4B, 1@B, 26B, 4@B, 100B, 200B, 400B, 

*10QOB , 2600B, 4000B , 190008, 

X2GDOOB, 4OOOOB , 1OPGOOB , 2DGHOOB,, 4OOHOOB , 1GOOOOOB , 2GOOOOOB , 490000OB, 
®1GDOOGDOOB , 2OODDOODB , 4OOOOHOOB , 100GOHGOOB , 20G00GSHOOB , 400000000B, 
*1DDDDOOGODB , 2DGPADODOOB , 4GGOHOOHOOB , 190HGGHHOOOB , 200000000008, 
XLGDDDDDOOOOB , 1OHHDHOOADOB , 240GGHGOHGOOB , 4600000GGOOOB , 
¥1PDODOSDDDOOOB , 2G09OHDHDHOOOHOB , 44G4GHHOHOHHOGB , 100000000000008, 
*2DGODDOODOOGOSB , 49OHOHODGODGOGOB , 164000GHNGOOGOOOB , 260000000000000B, 
*4DDDODDDDDDHOOOB , 19600OHOHOOOOHHODB , 26060000000000008, 
*4GDDOOGEDDDDDOOOB , 1000000GOOOOGHGOOB , 26009000000000000B, 
XADDDPDPGDODHDDOOOB , 16090040OOOOOGHOOOB , 260000000000000000B, 
*¥4GDODODDDDDDDHDGODB , 10400000GHOGOOHOOOOGB , 2600000000006000000B, 
*4GO09ODDDDDDDHDDADSB , 160900009040SGGGOOGB , 260000000000000000008, 
*499606060000006000000B / 

ASSIGN 12@ TO IRET 

INITIALIZATION 

NSEQ=@ 

SS=@.04 

RX= (FLOAT (NX)-1.)/(S(2)-S(1)) 

RY= (FLOAT (NY)-1.)/(S (4) -S (3) ) 

READ (IU)ST1 

NVD2=NV/ 2 

NWD2=NW/ 2 

DX=(ST1(NV,NWD2,1)-ST1(1,NWD2,1))*.5/(FLOAT(NV)-1.) 
DY=(ST1(1,NWD2, 2)-ST1(NV,NWD2, 2) )*.5/ (FLOAT (NV)-1.) 
DZ=(ST1(NVD2 ,NW,2)-ST1(NVD2,1, 2) )*.5/(FLOAT(NW)-1.) 

SLOPE=DY /DX 

DO 13¢ I=2,NV 

JUMP=LOBJS (I-1,1)*8+IOBJS (1,1) *4+1 

DO 12¢ J=2,NW 

X=ST1(1I,J,1) 

Y=ST1(1,J,2) 

JUMP=(JUMP-1) /4+1OBJS (I-1, J) *8+IOBJS (1,J) *4+1 

GO TO(120,20,40,5, 70, 80, 36,100, 100,10, 80, 70,50, 40, 20,120) , JUMP 
ASSIGN 9@ TO IRET 

X1=X 

Y1=Y-DZ 

X2=X+DX 

Y2=Y-DY 

GO TO 11¢ 

ASSIGN 6@ TO IRET 

X1=X 

Y1=Y-DZ 

X2=X-DX 

Y2=Y+DY 

GO TO 11 

X1=X+DX 

Y1=Y-DY 

X2=X-DX 

Y2=Y+DY 

GO TO 11¢ 

ASSIGN 120 TO IRET 

X1=X+DX 

Y1=Y-DY 

X2=X 

Y2=Y+DZ 

GO TO 11¢ 

X1l=X 

Y1=Y-DZ 


RREKEKREREREKERERERREEREERERERERERERERREKRERERERERERE 


2260 
2270 
2280 
2290 
2300 
2316 
2320 
2330 
23406 
2350 
2360 
2370 
2380 
2390 
2460 
2410 
2426 
2430 
2440 
2450 
2460 
2470 
2480 
2490 
2500 
2510 
2520 
2530 
254¢ 
2550 
2560 
2570 
2580 
2590 
2600 
2610 
2620 
2630 
2640 
2650 
2660 
2670 
2680 
2690 
2700 
2710 
2720 
2730 
2740 
2750 
2760 
2770 
2780 
2790 
2800 
2816 
2820 
2830 
2840 
2850 
2860 
2870 
2880 
2890 
29060 
2916 
2920 
2930 
2946 
2950 
2960 
2970 
2986 
2999 
3000 
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Y2=Y+DZ 3620 

GO TO 11¢ 30630 

96 ASSIGN 1206 TO IRET 3040 
16¢ X1=X-DxX 3050 
Y1=Y+DY 3060 
X2=X 3670 
Y2=Y+DZ 3080 

11@ IX=(X1-S (1) )*RX 3090 
IY=MOD (IFIX((Y1-S (3) ) *RY-SLOPE*FLOAT (1X) )+NY ,NY)+1 3100 
IBIT=MOD(1X,NBPW)+1 3110 
LX=1X/NBPW+1 3120 
T1=IS2 (1x, IY) 3130 
12=MASK (IBIT) 3140 
IV=AND (11,12) 3150 
IF(IV.NE.@)GO TO LRET, (60,9, 120) 3160 
IX=(X2-S(1)) *RX 3176 
IY=MOD (LFIX((Y2-S (3) )*RY-SLOPE*FLOAT (LX) )+NY ,NY)+1 3180 
IBIT=MOD (IX, NBPW)+1 3190 
LX=1X/NBPW+1 3200 
I1=1S2 (1X, LY) 3210 
12=MASK (IBIT) 3226 
IV=AND (11,12) 3230 
IF(IV.NE.@)GO TO IRET, (60,90, 12¢) 3246 
CALL SDLINE(X1,Y1,X2,Y2) 325¢ 

GO TO IRET, (60,904,120) 3260 

12@ CONTINUE 3276 
13@ CONTINUE 3280 
CALL PLTOUT ' 3296 

Cc SUBROUTINE PLTOUT PLOTS THE SEQUENCES TAKING INTO ACCOUNT 3306 
Cc THE MINIMUM DISTANCE BETWEEN BEGINNING AND ENDPOINTS OF THE 3316 
Cc SEQUENCES. 3326 
DO 15¢ I=1,NV 333¢ 

DO 14¢ J=1,NW 3346 
IF(LOBJS(1,J).£EQ.6)GO TO 14¢ 335¢ 
IX=(ST1(1,J,1)-S (1) )*RX+0.5 3360 
TWK=SLOPE*FLOAT (1X)-@.5 337¢ 
IY=MOD (IFIX((ST1(1,J,2)—-S (3) ) *RY-TWK)-4NY ,NY)+1 338¢ 
IBIT=MOD(IX,NBPW)+1 3390 
IX=1X/NBPW+1 3400 
I1=1S2(1x, IY) 3410 
12=MASK(IBIT) 3420 
1S2(1X,1Y)=OR(11,12) 3430 

14@ CONTINUE 3446 
15@ CONTINUE 3450 
RETURN 3460 
END 3470 
CX*XKKMACHINE DEPENDENT KEEKKKREKKRERERRERREKRREREKRKREEKEKRERRRRKKRRKKKRKKI 3486 
INTEGER FUNCTION AND(I,J) 3496 


C*#X*X*THIS VERSION FOR CDC 600G SERIES *&XXXRAKKEKEKKRRERKRERKRERRERERER 3500 
CRHERKKKKKK (AND. USED AS MASKING OPERATOR, *&KEKKKAKAKKARKAKAKAKKKEK 3516 


AND=I.AND.J 3526 
RETURN 3536 
END 3546 
CHXAKAMACHINE DEPENDENT *X&XXXKAAAAAAKAAAAKAKAKKRKAKKRERRREKARERERERRREK 3550 
INTEGER FUNCTION OR(I,J) 3560 


CXX**ATHTS VERSION FOR CDC6QGG SERIES *XAKAKAAKAKAAAKAAKAEKKKAKERRERERERK 35706 
CRERKKK .OR. USED AS MASKING OPERATOR, *#XARKAKKAKKAAKKKRKKKEKKKREERE 3580 


OR=1.0R.J 3590 

RETURN 3600 

END 3610 

SUBROUTINE SDLINE(X1,Y1,X2,Y2) 3620 
C PEN-UP MINIMIZING VERSION OF THE VISIBLE SURFACE PLOTTING PROGRAM 3630 
Cc ORIGINAL PROGRAM BY T. WRIGHT , COMMUN. ACM 17, 3(MARCH 1974) 3649 
Cc PP 152-155. 3650 
Cc AUTHORS , A.C.M. VAN SWIETEN (*) AND J.TH.M. DE HOSSON (**) 3660 
C 3670 
Cc (*) MATHEMATICAL INSTITUTE, STATE UNIVERSITY GRONINGEN, 3680 
Cc P.O. BOX 800, GRONINGEN, THE NETHERLANDS (PRESENT ADDRESS = 3690 
c VSSG,P.0. BOX 3632, LEYDEN, THE NETHERLANDS). 3700 
C (**) NORTHWESTERN UNIVERSITY, DEPT. MATERIALS SCIENCE AND 3719 
Cc ENGINEERING, THE TECHNOLOGICAL INSTITUTE, EVANSTON,ILLINOIS 69261, 3720 
G U.S.A. ( ON LEAVE OF ABSENCE, LABORATORIUM VOOR FYSISCHE METAAL- 3730 
Cc KUNDE, MATERIALS SCIENCE CENTRE, NIJENBORGH 18, GRONINGEN, 3740 
G THE NETHERLANDS , SEPT.1976- SEPT. 1977). 3750 
Cc 3760 
Cc IN THE ORIGINAL SUBROUTINE DANDR ONE HAS TO ADD THE 3770 
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C 
C 
Cc 
Cc 
C 
C 
C 
C 
C 
C 
C 
C 
Cc 
C 


AAMQMANANNANAANANAaANnnaAaAAa 


1¢ 
26 


4g 
50 


FOLLOWING FIVE STATEMENTS 
1)2) SS=6.04,NSEQ=@,BEFORE SLOPE=DX/DY WHICH INITIALIZE SDLINE. 
3) ADD COMMON/TOM1/NSEQ,SS TO DANDR. 

4) CALL SDLINE(X1,Y1,X2,Y2) INSTEAD OF LINE(X1,Y1,X2,Y2). 

5) CALL PLTOUT AFTER STATEMENT 130 CONTINUE IN DANDR. 


THE SUBROUTINE SDLINE(X1,Y1,X2,Y2) TEMPERARILY STORES THE 
SEGMENTS IN ORDER TO BUILT UP THE SEQUENCES. THIS IS DONE BY 
COMPARING THE LAST POINT OF EACH SEQUENCE WITH THE ENDPOINTS 
OF A SEGMENT. SDLINE IS ASSUMED TO DRAW A LINE FROM (X1,Y1) 

TO THE POINT (X2,Y2) UTILIZING THE SUBROUTINES PLTOUT AND PLOT. 


LOGICAL OPERATIONS .AND. , .OR. 


KKARKRARERER MACHINE DEPENDANT CONSTANTS *X&XXAAKAKRAKAKAAKKKKEKKE 


CDC 600% AND CDC 760¢@ SERIES. 

DIMENSION XX(8@, 20) ,YY (80,20) ,NN(20) 

COMMON/TOM/NN, XX, YY 

COMMON/TOM1/NSEQ,S 

IF(NSEQ.EQ.@) GOTO 2¢ 

SEARCH FOR CONTINUATION POINT. 

DO 16 ISEQ=1,NSEQ_ 

INN=NN (ISEQ) 

ISW=@ 

XL=XX (INN, ISEQ) 

YL=YY (INN, ISEQ) 

TRUE IN NEXT STATEMENT MEANS CONTINUATION POINT FOUND 
-AND. LOGICAL MULTIPLICATION. 

LF ( (ABS (X1-XL) .LE.S). AND. (ABS (Y1-YL).LE.S)) GOTO 5@ 

IF ( (ABS (X2-XL) .LE.S) .AND. (ABS (Y2-YL).LE.S)) GOTO 4@ 
LOGICAL EXPRESSION = INCLUSIVE .OR. 

IF((INN.GT.2).OR. (ISW.NE.@)) GOTO 106 

XBL=XX(1, ISEQ) 

YBL=YY (1, 1SEQ) 

XX(1,ISEQ)=XX(2 ,ISEQ) 

YY (1, ISEQ)=YY (2, ISEQ) 

XX (2, I1SEQ)=XBL 

YY (2, ISEQ)=YBL 

ISwW=1 

GOTO 5 

CONTINUE 

NEW SEQUENCE 

IF(NSEQ.EQ.2@) CALL’ PLTOUT 

NSEQ=NSEQt1 

XX (1,NSEQ)=X1 

XX (2 ,NSEQ)=X2 

YY (1,NSEQ)=Y1 

YY (2 ,NSEQ)=Y2 

NN (NSEQ)=2 

RETURN 

CONTINUE OLD SEQUENCE 

X2=X1 

Y2=Y1l 

INN=INN+1 

IF(INN.GT.8@) GOTO 2¢ 

XX (INN, ISEQ)=X2 

YY (INN, ISEQ)=Y2 

NN (ISEQ)=INN 

RETURN 

END 

SUBROUTINE PLTOUT 


INSERT CALL PLTOUT AFTER STATEMENT 13@ CONTINUE IN 
THE ORIGINAL DANDR SUBROUTINE. 


THIS SUBROUTINE PLOTS THE SEQUENCES TAKING INTO ACCOUNT 

THE MINIMUM DISTANCE BETWEEN BEGINNING AND ENDPOINTS OF THE 
SEQUENCES. THIS IS DONE BY ORDERING THE SEQUENCES IN AN 

APROPRIATE WAY AND BY INDICATING WHETHER THEY SHOULD BE PROCESSED 
IN THE NORMAL ORDER OR REVERSED. 


kkKKKK UNDEFINED EXTERNAL REFERENCES #4 XXXARAKAARAARARAKARKRAKRKERKERRRK 


SUBROUTINE PLOT(X,Y,IND)IS AVAILABLE IN THE CALCOMP PACKAGE. 
PLOT(X,Y,IND) = TO MOVE THE PEN FROM ITS CURRENT POSITION 

TO A NEW POSITION. 
X-COORDINATE, IN CM, OF NEW PEN POSITION RELATIVE TO ORIGIN. 
Y - COORDINATE, IN CM, OF NEW PEN POSITION RELATIVE TO ORIGIN. 
IND = IS USED TO CONTROL VERTICAL POSITION OF THE. PEN, THE 


x 
tou 


3780 
3790 
3800 
3810 
3820 
3830 
3840 
3856 
3860 
3870 
3880 
3896 
3900 
391¢ 
39206 
3930 
3940 
3950 
3960 
3970 
3980 
3996 
40OD 
4010 
4020 
4030 
4D4Q 
4950 
4060 
4670 
4080 
4090 
4106 
4110 
412¢ 
413¢ 
4140 
4150 
4166 
4170 
4180 
4199 
4200 
4210 
4226 
4230 
4240 
4250 
4260 
4270 
4280 
4290 
4300 
4310 
432@ 
4330 
4340 
4350 
4360 
4376 
4380 
4390 
4460 
4410 
4420 
4430 
4440 
4450 
4460 
4476 
4480 
4490 
4500 
4510 
4520 
4536 
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ESTABLISHING OF NEW ORIGINS, DUMPING OF THE BUFFER, AND 4546 


Cc 
Cc THE STARTING OF NEW BLOCKS. 455@ 
Cc 4560 
DIMENSION XX(8@, 20) , YY (80, 2@) ,NN(2@) , IND(20) , IDR(20) 457@ 
COMMON/TOM/NN, XX, YY 4586 
COMMON/TOM1/NSEQ,S 4590 
IF (NSEQ.EQ.@) RETURN 4600 
DO 1¢ I=1,NSEQ 461@ 
IND(1L)=1 4626 
16 CONTINUE 4630 
IDR(1)=1 4640 
LTEMP=NN (1) 4650 
OLDX=XX (ITEMP,, 1) 4660 
OLDY=YY (ITEMP, 1) 4670 
DO 3@ I=2,NSEQ- . 4680 
DMIN= 10900000. 4690 
DO 2@ J=1,NSEQ 4760 
-K=IND(J) 4716 
DX=XX(1,K)-OLDX 4720 
DY=YY (1,K)-OLDY 4730 
D=SQRT (DX*DX+DY*DY ) 4740 
IF(D.GE.DMIN) GOTO 15 4750 
DMIN=D. 4760 
MINJ=J 4770 
IDRT=1 4780 
15 ITEMP=NN(K) 4790 
DX=XX (ITEMP ,K) -OLDX 4800 
DY=YY (ITEMP ,K)-OLDY rs 4810 
D=SQRT (DX*DX+DY*DY) 4826 
IF(D.GE.DMIN) GOTO 2¢ 4830 
DMIN=D . 4849 
MINJ=J 4850 
IDRT=-1 4860 
2@ CONTINUE 4870 
L=IND (MINJ) 4880 
IND (MINJ)=IND(I) 4896 
IND (L)=L 4900 
IDR(1)=IDRT 49106 
IB=1 4920 
IF(IDRT.NE.1) IB=NN(K) 4930 
OLDX=XX(1IB,K) ; 4946 
OLDY=YY (IB,K) 4950 
3@ CONTINUE 4960 
DO 5@ I=1,NSEQ 4970 
K=IND(1) 4980 
N=NN (K) 4990 
IB=1 5000 
M3=IDR(1) 5010 
IF(M3.NE.1) IB=N 5020 
M1=IB+M3 5030 
Cx*e*eKR UNDEFINED EXTERNAL REFERENCE ¥®*® PLOT © RX¥RRKAREKEREKKEKRERKERKEKK 50406 
CALL PLOT(XX(IB,K) ,YY(IB,K), 3) 5050 
DO 40 L=2,N 5060 
CXRKKE UNDEFINED EXTERNAL REFERENCE *k PLOT KREREKEEERERERERKREKRKKR KKK 50670 
CALL PLOT (XX(M1,K) ,YY(M1,K), 2) 5080 
M1=M1+M3 , 5090 
4@ CONTINUE 51060 
NN(K)=@ 5119 
5@ CONTINUE 5120 
NSEQ=@ 5130 
RETURN 5146 
END ' : 515¢ 
FUNCTION ARCCOS(Y) 5160 
3 BECAUSE ACOS IS NOT A STANDARD FORTRAN FUNCTION THE PRESENT 5170 
C “UNCTION ROUTINE IS AN APPROXIMATION FOR IT. 5189 
PI=3.1415926 51990 
X=ABS (Y) 5200 
ARCCOS= (1.5707288-@. 2121144*X+0.674261*X*X-@. 618729 3*X*X*X) *SQRT (1 5210 
A.-X) 5220 
IF (Y.LT.@.) ARCCOS=PI-ARCCOS 5230 
RETURN . 52406 


END . 5256 
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Editor’s note: A/ gorithm 476 described here is available on magnetic 
tape from. the Department of Computer Science, University of 
Colorado, Boulder, CO 80302. The cost for the tape is $16.00 (U.S. 
and Canada) or $18.00 (elsewhere). If the user sends a small tape 
(wi. less than I 1b.) the algorithm will be copied on it and returned to 
him at a charge of $10.00 (U.S. only). All orders are to be prepaid 
with checks payable to ACM Algorithms. The algorithm is recorded 
as one file of BCD 80 character card images at 556 B.P.1., even 


parity, on seven track tape. We will supply algorithm at a density of 


800 B.P.I. if requested. Cards for algorithms are sequenced starting 
at 10 and incremented by 10. The sequence number is right justified in 
column 80. Although we will make every attempt to insure that the 
al gorithm conforms to the description printed here, we cannot guaran- 
fee it, nor can we guarantee that the algorithm is correct —L.D.F. 
and A.K.C. 
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Six Subprograms for Curve 
Fitting Using Splines Under 
Tension [E2] 


A.K. Cline 
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1470, Boulder, CO 80302 
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Author’s present address: Institute for Computer Applications 
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Key Words and Phrases: interpolation, splines, contouring, 
curve fitting 

CR Categories: 5.13, 8.2 

Language: Fortran 


Description 

The spline under tension package includes six subprograms: 
two in each of three problem areas. These implement the theory 
presented in [!]. The first pair, CURV1 and CURV2, solves the 
standard interpolation problem: determine a real-valued function 
that assumes values {}',}/-1 at abscissas {v,}/=1. The second pair, 
KURV1 and KURV2, solves the more general problem of passing 
a curve through a sequence of pairs {x;, ¥;}7=1 in the plane. The 
third pair, KURV PI and KURV P2, solves the same problem, but 
the solution curve is closed. 

CURV 1 and KURVI require additional endpoint slope condi- 
tions to determine the solution. The user may omit the information 
in which case values are produced internally based upon the other 
input information. If three or more points are to be interpolated, 
these internal slope values are the slopes given by a quadratic 
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polynomial interpolating the first three values for the initial slope 
and last three values for the terminal slope. If only two points are 
to be interpolated and no slope information is given, the resulting 
curve is a Straight line. The subprogram KURV PI! determines 
periodic splines under tension, and thus no additional slope infor- 
mation is required. 

In each pair of subprograms, the first is called only once, and 
sets up and solves the tridiagonal system to specify the spline. The 
second is used for the actual mapping of points. The function 
CURP2 returns an image point for a given real value. The sub- 
routines KURV2 and KURVP2 return the image pairs in their 
parameter sequences. Each of these subprograms, CURV2, KURV2, 
and KURV P2, first must determine which data points are adjacent 
to the input value. This search usually begins with the leftmost 
values and proceeds until the correct interval is found. However, if a 
sequence of input values is to be mapped, the search can be made 
more efficient by ordering these values left to right. The search can 
then proceed on one call from where it ended on the previous 
call. All three subprograms include an efficiency option which in 
effect says, ‘“You may proceed from where you stopped.” 

All the subprograms included require a natural exponential 
function named EXP. KURVI, KURV2, KURVP1, and KURV P2 
require a square root function SQR7. The subroutine KURV1 
requires the sine (S/N) and cosine (COS) functions, in addition to 
the function ATAN2 of two arguments which when given x and y 
(not both zero) returns an angle 6 which satisfies x = y X tan (6). 
All of these are basic Fortran external functions. 


References 
1. Cline, A.K. Scalar- and planar-valued curve fitting using 
splines under tension. Comm. ACM 17,4 (Apr. 1974), 218-220. 


Algorithm 


SUBROUTINE CURVI(Ns X, Ys SLPls SLPN» YP, TEMP, SIGMA) 
INTEGER N 
REAL X(N). YCN)s SLP1, SLPN» YP(N)s TEMP(N), SIGMA 
THIS SUBROUTINE DETERMINES THE PARAMETERS NECESSARY TO 
COMPUTE AN INTERPOLATORY SPLINE UNDER TENSION THROUGH 
A SEQUENCE OF FUNCTIONAL VALUES. THE SLOPES AT THE TkO 
ENDS OF THE CURVE MAY BE SPECIFIED OR OMITTED. FOR ACTUAL 
COMPUTATION OF POINTS ON THE CUFVE IT IS NECESSARY TO CALL 
THE FUNCTION CURV2. 
ON INPUT~- 
N 1S THE NUMBER OF VALUES TO BE INTERPOLATED (N.GE-2), 
X IS AN ARRAY OF THE N INCREASING ABSCISSAE OF THE 
FUNCTIONAL VALUES. 
Y IS AN ARFAY OF THE N ORDINATES OF THE VALUES, CILE.Y(K) 
IS THE FUNCTIONAL VALUE CORRESPONDING TO X(K)). 
SLP1 AND SLPN CONTAIN THE DESIRED VALUES FOR THE FIPST 
DERIVATIVE OF THE CURVE AT X(1) AND X(N), RESPECTIVELY. 
IF THE QUANTITY SIGMA IS NEGATIVE THESE VALUES WILL BE 
DETERMINED INTESNALLY AND THE USEF NEED ONLY FUFNISH 
PLACE-HOLDING PAPAMETERS FOR SLFIL AND SLPN- SUCH PLACE- 
HOLDING PARAMETERS WILL BE IGNORED BUT NOT DESTROYED, 
YP IS AN ARPAY OF LENGTH AT LEAST N 
TEMP 1S AN ARRAY OF LENGTH AT LEAST N VHICH IS USED FOR 
SCRATCH STORAGE, 
AND 
SIGMA CONTAINS THE TENSION FACTOR. THIS IS NON-ZERO AND 
INDICATES THE CURVINESS DESIRED. IF APSC(SIGMA) IS NEARLY 
ZERO (E.G. .@01) THE RESULTING CURVE 1S APPPOXIMATELY A 
CUBIC SPLINE. IF APBSCSIGMA) IS LARGE (E.G. 50.) THE 
RESULTING CURVE IS NEARLY A POLYGONAL LINE. THE SIGN 
OF SIGMA INDICATES WHETHER THE DERIVATIVE INFOPMATION 
HAS BEEN INPUT OR NOT- IF SIGMA IS NEGATIVE THE ENDPOINT 
DERIVATIVES WILL BE DETERMINED INTERNALLY. A STANDARD 
VALUE FOR SIGMA [S APPROXIMATELY 1+ IN ABSOLUTE VALUE. 
ON OUTPUT--~- 
YP CONTAINS VALUES PROPORTIONAL TO THE SECOND DERIVATIVE 
OF THE CURVE AT THE GIVEN NODES. 
N,»X2YsSLP1,SLPN AND SIGMA ARE UNALTERED. 
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NMI = N - 1 

NPi = N + 1 

DELXI = X(2) = XC¢1) 
DX1 = €YC€2)-YC1))/DELX1 


C DETERMINE SLOPES IF NECESSARY 
IF (SIGMA+LT.@-) GO TO 5@ 
SLPP! = SLP1I 
SLPPN = SLPN 


COLLECTED ALGORITHMS (cont.) 


C DENORMALIZE TENSION FACTOR 
16 SIGMAP ®# ABS(SIGMA)*FLOAT(N-1)/ (X(N) =XC1)) 


C SET UP RIGHT HAND SIDE AND TRIDIAGONAL SYSTEM FOR YP AND 
C PERFORM FORWARD ELIMINATION 
DELS = SIGMAP*DELX1 
EXPS = EXPC(DELS) 
SINHS = .S*CEXPS-1./EXPS) 
SINHIN = 1./(DELX1*SINHS) : 
DIAG! = SINHIN*®(DELS*.5#(EXPS+1./EXPS)-SINHS) 
DIAGIN = 1./DIAG1 
YPC1) = DIAGIN*(DX1-SLPP1) 
SPDIAG = SINHIN*(SINHS-DELS) 
TEMP(1) = DIAGIN*SPDIAG 
IF (N-EQ@.2) GO TO 38 
DO 28 [=2.NM1 
DELX2 = X¢CI4#1l) - XC) 
DX2 = CYCI+1)-Y¥C1L))/DELX2 
DELS = SIGMAP*DELX2 
EXPS = EXPC DELS) 
SINHS ® «S# CEXPS-1./EXPS) 
SINHIN #® 1./(CDELX@*SINHS) 
DIAG2 = SINHIN*(DELS*(.5«#CEXPS+1./EXPS))-SINHS) 
DIAGIN # 1-/¢(DIAGI+DIAG2-SPDIAG*TEMPCI-1)) 
YPCI) = DIAGIN* (DX2-DX1-SPDIAG*YPC(I-1)) 
SPDIAG = SINHIN*(SINHS-DELS) 
TEMP(I1) = DIAGIN*SPDIAG 
DXt = Dxe2 
DIAG] = DIAG2 
28 CONTINUE 
3@ DIAGIN = 146/¢DIAGI-SPDIAG*TEMP(NM1) > 
YP(N) = DIAGIN*(SLPPN-DX2-SPDIAG*YP(NM1) > 
C PERFORM BACK SUBSTITUTION 
DO 42 1=2,N 
IBAK = NPI - I 
YPCIBAK) = YPCIBAK) - TEMPCIBAK) *YPCIBAK+1) 
4@ CONTINUE 
RETURN 
5@ IF (NeE@.2) GO TO 68 
C IF NO DERIVATIVES ARE GIVEN USE SECOND ORDER POLYNOMIAL 
C INTERPOLATION ON INPUT DATA FOR VALUES AT ENDPOINTS. 
DELX2 = X(€3) - X¢2) 
DELX12 = X€3) - X11) 
C1 4 -CCDELX12+DELX1)/DELX12/DELX1 
C2 = DELXI2/DELX1/DELX2 
C3 = -=DELX!/DELX12/ DELX2 
SLPPL = Ci#YC1) + C2"YC2) + C3*YC3) 
DELN # X(N) - XCNMI) 
DELNM! = X(NM1) ~ X(N-2) 
DELNN # X(N) = X(N-2) 
Cl + (DELNN+DELN) /DELNN/ DELN 
C2 = -DELNN/DELN/ DELNM1! 
C3 = DELN/DELNN/DELNM! 
SLPPN = C3*YC(N-2) + CO*YCNMI1) + CI¥YCN> 
GO TO 18 
C IF ONLY TWO POINTS AND NO DERIVATIVES ARE GIVEN, USE 
C STRAIGHT LINE FOR CURVE 
6@ YP{ 1) = 6B. 
YP(2) = 6. 
RETURN 
END 3 


FUNCTION CURV2(T, Ns X» Ys YP, SIGMA, IT) 
INTEGER N, IT 
REAL Ts X(N). YC(N), YPC(N)s SIGMA 
THIS FUNCTION INTERPOLATES A CURVE AT A GIVEN POINT 
USING A SPLINE UNDER TENSION. THE SUBROUTINE CURVI SHOULD 
BE CALLED EARLIER TO DETEPFMINE CERTAIN NECESSARY 
PARAMETERS. 
ON INPUT-- 
T CONTAINS A REAL VALUE TO BE MAPPED ONTO THE INTERPO- 
LATING CURVE. 
N CONTAINS THE NUMBER OF POINTS WHICH WERE INTERPOLATED 
TO DETERMINE THE CURVE, 
X AND Y ARE ARRAYS CONTAINING THE ORDINATES AND ABCISSAS 
OF THE INTERPOLATED POINTS, 
YP IS AN ARRAY WITH VALUES PROPORTIONAL TO THE SECOND 
DERIVATIVE OF THE CURVE AT THE NODES 
SIGMA CONTAINS THE TENSION FACTOR (ITS SIGN IS IGNORED) 
IT 1S AN INTEGER SWITCH. IF IT 1S NOT 1 THIS INDICATES 
THAT THE FUNCTION HAS BEEN CALLED PREVIOUSLY (WITH NoX, 
YsYP, AND SIGMA UNALTERED) AND THAT THIS VALUE OF T 
EXCEEDS THE PREVIOUS VALUE. WITH SUCH INFORMATION THE 
FUNCTION IS ABLE TO PERFORM THE INTERPOLATION MUCH MORE 
RAPIDLY» IF A USER SEEKS TO INTERPOLATE AT A SEQUENCE 
OF POINTS, EFFICIENCY IS GAINED BY ORDERING THE VALUES 
INCREASING AND SETTING IT TO THE INDEX OF THE CALL. 
IF IT IS 1 THE SEARCH FOR THE INTERVAL (X(K),XCK+t1)) 
CONTAINING T STARTS WITH K=1t. 
THE PARAMETERS N»X,Y,YP AND SIGMA SHOULD BE INPUT 
UNALTERED FROM THE OUTPUT OF CURVI. 
ON OUTPUT-- 
CURV2 CONTAINS THE INTERPOLATED VALUE. FOR T LESS THAN 
XCL) CURV2 = Y(1). FOR T GREATER THAN X(N) CURV2 # YC(N). 
NONE OF THE INPUT PARAMETERS ARE ALTERED. 
S = X(N) - XC1) 
DENORMALIZE SIGMA 
SIGMAP = ABS(SIGMA) *FLOAT(N-1)/5S 
IF IT.NEs! START SEARCH WHERE PREVIOUSLY ‘TERMINATED. 
OTHERWISE START FROM BEGINNING 
IF CIT.EQ@.1) Il = 2 
C SEARCH FOR INTERVAL 
1@ DO 2@ 1#11sN 
IF (X€1I)-T) 28, 28, 36 
28 CONTINUE 
Is N 
C CHECK TO INSURE CORRECT INTERVAL 
3Q@ IF (XCI-1)-LE.T «OR. TeLE«X¢1)) GO TO 40 
C RESTART SEARCH AND RESET 11 
C ¢ INPUT '"'IT'* WAS INCORRECT >) 
Ti = 2. 
GO TO 18 
C SET UP AND PERFORM INTERPOLATION 
4@ DELI = T = X¢IK-}) 
DEL2 = X(I) - T 
DELS = XC€I) ~ XC(I-1) 
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EXPS1 = EXPCSIGMAP*DEL1) 

SINHD!] = .5*CEXPS1-1./EXPS1) 

EXPS = EXP(SIGMAP*DEL2) 

SINHD2 = .5*CEXPS-1./EXPS) 

EXPS = EXPSI*EXPS 

SINHS = .5*CEXPS-1./EXPS) 

CURV2 = (YPCI)*SINHDI+YPCI-1)*SINHD2)/SINHS + 
* COYCD)©YPCI) ¥DELI+CYCI-1)-YPCI-1))*DEL2)/DELS 
Il = 1 

RETURN 

END 


SUBROUTINE KURVI(N, X, Ys SLPI], SLPN, XP, YP, TEMP, S, 


* SIGMA) 

THIS SUBROUTINE DETERMINES THE PARAMETERS NECESSARY TO 
COMPUTE A SPLINE UNDER TENSION PASSING THROUGH A SEQUENCE 
OF PAIRS (¢XC1),YC1) ae eee CXCND LYON) ) IN THE PLANE. THE 
SLOPES AT THE TWO ENDS OF THE CURVE MAY BE SPECIFIED OR 
OMITTED. FOR ACTUAL COMPUTATION OF POINTS ON THE CURVE IT 
IS NECESSARY TO CALL THE SUBROUTINE KURV2. 
ON INPUT-- 
N IS THE NUMBER OF POINTS TO BE INTERPOLATED (N.GE.2), 
X IS AN ARRAY CONTAINING THE N X-COORDINATES OF THE 
POINTS, 
Y IS AN ARRAY CONTAINING THE N Y~COORDINATES OF THE 
POINTS, 
SLP! AND SLPN CONTAIN THE DESIPED VALUES FOR THE SLOPE 
OF THE CURVE AT (¢X(1),YC1)) AND CX(N)s4Y(N)). RESPEC- 
TIVELY- THESE QUANTITIES ARE IN DEGREES AND MEASURED 
COUNTERCLOCKWISE FROM THE POSITIVE X-AXIS- THE POSITIVE 
SENSE OF THE CURVE IS ASSUMED TO BE THAT MOVING FROM THE 
POINT 1 TO POINT Ne IF THE QUANTITY SIGMA IS NEGATIVE 
THESE SLOPES WILL BE DETERMINED INTERNALLY AND THE USER 
NEED ONLY FURNISH PLACE-HOLDING PARAMETERS FOR SLP1 AND 
SLPN+» SUCH PLACE-HOLDING PARAMETERS WILL BE IGNORED BUT 
NOT DESTROYED, 
XP,YP ARE ARRAYS OF LENGTH AT LEAST Ns 
TEMP IS AN ARRAY OF LENGTH AT LEAST N WHICH IS USED FOR 
SCRATCH STORAGE, 
AND 
SIGMA CONTAINS THE TENSION FACTOR. THIS 1S NON-ZERO AND 
INDICATES THE CURVINESS DESIRED. IF ABSC(CSIGMA) IS VERY 
LARGE (E.G. 50-) THE RESULTING CURVE IS VERY NEARLY A 
POLYGONAL LINE. THE SIGN OF SIGMA INDICATES WHETHER 
SLOPE IN FORMATION HAS BEEN INPUT OR NOT. IF SIGMA IS 
NEGATIVE THE END-POINT SLOPES WILL BE DETERMINED 
INTERNALLY. A STANDARD VALUE FOR SIGMA IS APPROXIMATELY 
1. IN ABSOLUTE VALUE. 
ON OUTPUT-- 
NsX,YsSLP1,SLPNs AND SIGMA ARE UNALTERED, 
XP AND YP CONTAIN INFORMATION ABOUT THE CURVATURE OF THE 
CURVE AT THE GIVEN NODES, 
AND 
S CONTAINS THE POLYGONAL ARCLENGTH OF THE CURVE. 

INTEGER N 

REAL X(N). YCN)s XPCN), YPC(N), TEMP(N), S, SIGMA 

DEGRAD = 3.1415926535897932/ 188. 
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NMI = N - 1 

NPL =N +! 

DELXI = X(2) - X(1) 

DELY! = YC(2) - YC1) 

DELS! = SQ@RT(DELX il *DELX1+DELY!*DELY1) 


DX1l = DELX1/DELSI 
DY1l = DELY!/DELS|} 

C DETERMINE SLOPES IF NECESSARY 
IF (SIGMA-LT.@-)> GO TO 78 
SLPP! = SLP1*DEGRAD 
SLPPN = SLPN*DEGRAD 


C SET UP RIGHT HAND SIDES OF TRIDIAGONAL LINEAR SYSTEM FOR XP 


C AND YP 
18 XPC1) = DXI - COSCSLPPI)> 
YPC1) = DYI = SINCSLPPI) 
TEMPC1) = DELS! 
S 2 DELS1 


IF (N.EQ.2) GO TO 30 
DO 20 1=2,NM1 
DELX2 = XCI+1) - XC€I) 
DELY2 = Y(I+#1) - YCI) 
DELS2 = SQRT(DELX2*DELX2+ DELY2*DELY2) 
DX2 = DELX2/DELS2 
DY2 = DELY2/DELS2 
XPCI) = DX2 - DX! 
YPCI) = DY2 - DY 
TEMPC(I) = DELS2 
DELX1 = DELX2 
DELY! = DELY2 
DELS1 = DELS2 
DXI = DxX2 
DY1 = DY2 
C ACCUMULATE POLYGONAL ARCLENGTH 
S = S + DELS1 
28 CONTINUE 
32 XPCN) = COSC(CSLPPN) - DXI 
YPCN) = SINCSLPPN) - DYI 
C DENORMALIZE TENSION FACTOR 
SIGMAP = ABS(SIGMA) *FLOAT(N-1)75 
C PERFORM FORWARD ELIMINATION ON TRIDIAGONAL SYSTEM 
DELS = SIGMAP*TEMP( 1) 
EXPS = EXPCDELS) 
SINHS = .5*(EXPS-1./EXPS) 
SINHIN = 1./(TEMPC1)*SINHS) 
DIAG! = SINHIN*(CDELS#.5*(CEXPS+1./EXPS)-SINHS) 
DIAGIN = 1./DIAGI 
XPC1) = DIAGIN*XP(1) 
YPC1) = DIAGIN®YPC1I) 
SPDIAG = SINHIN*(SINHS-DELS) 
TEMP(1) = DIAGIN*SPDIAG 
IF (N.EQ@.2) GO TO 5@ 
DO 48 1#2,NMI 
DELS = SIGMAP*TEMPCI) 
EXPS = EXPCDELS) 
SINHS = .«5*CEXPS-1./EXPS) 
SINHIN = 1¢/¢CTEMPC(1)*SINHS) 


0 


COLLECTED ALGORITHMS (cont.) 


C. 


DIAG2 = SINHIN*(DELS*¢.5*#CEXPS+1.6/EXPS))-SINHS) 
DIAGIN = 1./CDIAGI1+DIAG2-SFLIAG*TEMP(I-1)) 
XPCT) = DIAGIN®(XPCI)-SPDIAG#XPC(I-1)) 
YPCI) = DIAGIN*(YPCI)-SPDIAG#YPC(I=1)) 
SPDIAG = SINHIN*(SINHE~=DELS) 
TEMPCI) = DIAGIN*SPDIAG 
DIAG! = DIAG2 
46 CONTINUE 
5@ DIAGIN = 1i./¢(DIAG1I-SPCIAG*TEMPCNM1) ) 
XPCN) = DIAGIN*(CXPCN) -SPDIAG®XPC(NM1)) 
YPCN) = DIAGIN* (YP(N) -SPDILAG#YPCNM1)) 
PERFORM BACK SUBSTITUTION 
DO 6@ 1=2,N 


IBAK = NPI] - I 
XPCIBAK) = XPC(IBAK) = TEMPCIBAK)*XP( I BAK+1) 
YPCIBAK) = YPCIBAK) = TEMPCIBAK) *YPCIBAK+1) 
6@ CONTINUE 
RETURN 


70 IF (NwJEQ.2) GO TO 8@ 


CIF NO SLOPES ARE GIVEN, USE SECOND OPDER INTERPOLATION ON 


Cc 


Cc 
c 
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INPUT DATA FOP SLOPES AT ENDPOINTS 
CELS2@ = £QRTCCXC3)-XC2) )we2e (C3) -¥ C2) eK2) 
DELS!12 = DELS! + DELS2 
Cl = ~CCDELS12+DELS1)/DELS12/DELS1 
C2 = DELS12/DELS1/DELS2 
= -DELSI/DELS12/CELS2 
SX = CimXC1) + C2*X(2) + C3*X¢3) 
SY 3 Cl"YC1) + C2*Y¥C2) * C3*YC3) 
SLPP1 = ATAN2(SY, SX) 
DELNMI: = SQRTCCX(N-2)-XCNMI)) #24 CY CN-2)-YOCNMI) ) #2) 
DELN # SQRTCCXCNMI)-~XCN) #24 (CY CNMI) -YCN) ) #42) 
DELNN = DELNMI + DELN 


Cl = (CDELNN‘+DELN) /DELNN/ DELN 

Ce = -DELNN/DELN/DELNM1] 

C3 = DELN/DELNN/DELNM1 

SX = CB®X(N-2) + C2*X(NMI) + CI*X(N)D 
SY = C3*Y(N-2) + C2*YCNMI)D + CI®YC(N) 


SLPPN = ATAN2(SY,SX) 
GO TO 18 
IF ONLY TWO POINTS AND NO SLOPES ARE GIVEN, USE STRAIGHT 
LINE SEGMENT FOR CURVE 
BO XPCl) @ @. 


XP(2) = @. 
YP(1) = @. 
YP(2) ® 6. 
RETURN® 
END 


SUBROUTINE KURV2(T, XSs YS.» Ns Xs Ys XP» YPs Ss SIGMA) 

INTEGEP N 

PEAL T,. XSs YSs XCN), YCN)» XPCN)s YPCN), Ss SIGMA 
THIS SUBROUTINE PERFORMS THE MAPPING OF POINTS IN THE 
INTERVAL (@.,1.) ONTO A CURVE IN THE PLANE. THE SUBROUTINE 
KURV! SHOULD BE CALLED EARLIER TO DETERMINE CERTAIN 
NECESSARY PARAMETERS. THE RESULTING CURVE HAS A PARAMETRIC 
REPRESENTATION BOTH OF WHOSE COMPONENTS ARE SPLINES UNDER 
TENSION AND FUNCTIONS OF THE POLYGONAL ARCLENGTH PARAMETED 


ON INPUT--. 

T CONTAINS A REAL VALUE OF ABSOLUTE VALUE LESS THAN OR 
EGUAL T) 1. TO BE MAPPED TO A POINT ON THE CURVE. THE 
SIGN OF T 1S IGNORED AND THE INTERVAL (@.51.) IS MAPPED 
ONTO THE ENTIRE CURVE. IF T IS NEGATIVE THIS INDICATES 
THAT THE SUBROUTINE HAS BEEN CALLED PREVIOUSLY (WITH ALL 
OTHER INPUT VARIABLES UNALTE2ED) AND THAT THIS VALUE OF 
T EXCEEDS THE PREVIOUS VALUE IN ABSOLUTE VALUE. WITH 
SUCH INFORMATION THE SURROUTINE IS ABLE TO MAP THE POINT 
MUCH MORE RAPIDLY. THUS IF THE USER SEEKS TO MAP A 
SEQUENCE OF POINTS ONTO THE SAME CURVE, EFFICIENCY IS 
GAINED BY ORDERING THE VALUES INCREASING IN MAGNITUDE 
AND SETTING THE SIGNS OF ALL BUT THE FIRST, NEGATIVE, 

N CONTAINS -THE NUMBER OF POINTS WHICH WERE INTERPOLATED 
TO DETERMINE THE CURVE, 

X AND Y ARE ARRAYS CONTAINING THE X- AND Y-COORDINATES 
OF THE INTERPOLATED POINTS, 

XP AND YP ARE THE ARRAYS OUTPUT FROM KUPV2 CONTAINING 


> CURVATURE INFORMATION, 
> S CONTAINS THE POLYGONAL ARCLENGTH OF THE CURVE, 


SIGMA CONTAINS THE TENSION FACTOR (ITS SIGN IS IGNORED). 
THE PARAMETERS N,sXsYsXP,YP,S.AND SIGMA SHOULD BE INPUT 
UNALTERED FROM THE OUTPUT OF KURVI. 
ON OUTPUT-- . 
XS AND YS CONTAIN THE X-ANDY-COORDINATES OF THE IMAGE 
POINT ON THE CURVE. 
T»N.X,Y,XP,YP,5, AND SIGMA ARE UNALTERED. 
DENORMALIZE SIGMA 
SIGMAP = ABS(SIGMA)*FLOAT(N=1)/5 
STRETCH UNIT INTERVAL INTO ARCLENGTH DISTANCE 
TN = ABS(T#*S) 
FOR NEGATIVE T START SEARCH WHERE PREVIOUSLY TERMINATED, 
OTHERWISE START FROM BEGINNING 
IF (T-LT.@.) GO TO 18 
Il = 2 
= X¢€1) 
YS = YO1) 
SUM = @. 
IF (TeLE.@.) RETURN 
1@ CONTINUE 
DETERMINE INTO WHICH SEGMENT TN IS MAPPED 
DO 3@ I=1L1,N 
DELX = X¢I) - K¢I-1) 
DELY = YCL) - YCI-1) 
DELS = SQRT(DELX*DELX+DELY*DELY) 
IF (SUM*#DELS-TN) 20, 4@, 48 
20 SUM = SUM + DELS 
3@ CONTINUE 
IF ABS(T) IS GREATER THAN 1., RETURN TERMINAL POINT ON 
CURVE 
XS = X(N) 
YS = YCN) 
RETURN 
SET UP AND PERFORM INTERPOLATION 
4@ DEL! = TN - SUM 
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DEL2 = DELS - DELI 

EXPS1 = EXP(SIGMAP*DELI) 

SINHDI = .5*(CEXPSi-1./EXPS1) 

EXPS = EXP(SIGMAP*DEL2) 

SINHD2 = .5*CEXPS-1./EXPS) 

EXPS = EXPS1*EXPS 

SINHS = .5*CEXPS+1./EXPS) 

XS = (XPCI)*SINHDI+XPCI-1)*SINHD2)/SINHS + 
* CCXCL)=XPCI)#DELI4(OXCL-1)-XPCL=1)9*"DEL2)/DELS 
YS = (YPCL)*SINHDIFYPCI-1)*SINHD2)/SINHS + 
* COYCL)-YPCL) #DELI*#CYCI=1)-YPCI=1)9*DEL2)/DELS 
let 

RETURN 

END 


SUBROUTINE KURVPICN, Xs, Ys XP, YP, TEMP, S, SIGMA) 
INTEGER N 
REAL X(N). YCN)s XPCN)s, YPCN)s TEMPC 1), Seo SIGMA 
THIS SUBROUTINE DETERMINES THE PARAMETERS NECESSARY TO 
COMPUTE A SPLINE UNDER TENSION FORMING A CLOSED CUPVE IN 
THE PLANE AND PASSING THROUGH A SEQUENCE OF PAIRS 
CXCLIZYVCLI sae e ep CXCNIZYOCN)) © FOR ACTUAL COMPUTATION OF 
POINTS ON THE CURVE IT IS NECESSARY TO CALL THE SUBROUTINE 
KURVP2. 
ON INPUT-- 
N IS THE NUMBER OF POINTS TO BE INTERPOLATED (N.-GE-2), 
X IS AN ARRAY CONTAINING THE N X-COORDINATES OF THE : 
POINTS, 
Y IS AN ARRAY CONTAINING THE N Y-COORDINATES OF THE 
POINTS, 
XP,YP ARE ARRAYS OF LENGTH AT LEAST Nz 
TEMP IS AN ARRAY OF LENGTH AT LEAST 24#N VHICH IS USED 
FOR SCRATCH STORAGE, 
AND 
SIGMA CONTAINS THE TENSION FACTOR. THIS IS A NON-ZERO 
QUANTITY (WHOSE SIGN IS IGNORED) WHICH INDICATES THE 
CURVINESS DESIRED. IF ABSCSIGMA) IS VERY LARGE (E.G. 58. 
) THE RESULTING CURVE IS VERY A POLYGON. A STANDARD 
VALUE FOR SIGMA IS APPROXIMATELY |. IN ABSOLUTE VALUE. 
ON OUTPUT-- 
N»sX,Y., AND SIGMA ARE UNALTERED. 
XP AND YP CONTAIN INFORMATION ABOUT THE CURVATURE OF THE 
CURVE AT THE GIVEN NODES, 
AND 
S CONTAINS THE POLYGONAL ARCLENGTH OF THE CURVE. 
NML = N - 1 
NPI =N + 1 
SET UP RIGHT HAND SIDES OF TRIDIAGONAL (WITH CORNER 
ELEMENTS) LINEAR SYSTEM FOR XP AND YP 
DELXI = X€2) - XC1>) 
DELY! = YC2) - Yl) 
DELS! 2 SQRTC(DELX1*DELX!+DELY1*DELYI) 
DX1 3 DELXI/DELS1 
DY! = DELYI/DELS1 
XPCL> = DX1I 
YPC1) = DYI 
TEMPC(1) = DELSI 
S = DELS! 
DO {8 1L=2,N 
IPl =1 +1 
IF CI-EQ.N) IPIt = 1 
DELX2 = XCIPi) - X¢Id 
DELY2 = YCIPI) - YCI) 
DELS2 = SORT( DELX2* DELX2+ DELY 2* DEL Ye) 
DX2 = DELX2/DELS2 
DY2 2 DELY2/DELS2 
XPCI) = DX2 = Dx! 
YPCI) = DY2 - DyYl 
TEMPCI) = DELS2 
DELX1 = DELX2 
DELY!t # DELY2 
DELS| = DELS2 
DX! = [x2 
DYt = DY2 
ACCUMULATE POLYGONAL ARCLENGTH 
S = S + DELSt 
1@ CONTINUE 
XPC1) = XPC1) = DXi 
YPCL)>) = YPC1) - DY1 
DENORMALIZE TENSION FACTOR 
SIGMAP = ABS(SIGMA) *FLOAT(N)/S 
PERFORM FORWARD ELIMINATION ON TRIDIAGONAL SYSTEM 
DELS = SIGMAP*TEMPC(N) 
EXPS = EXPCDELS) 
SINHS = .5*CEXPS-1./EXPS) 
SINHIN # 1I./¢CTEMP(N)*SINHS) 
DIAGL = SINHIN* (DELS*.5*"(CEXPS+1./EXPS)-SINHS) 
DIAGIN = 1./DIAGI 
SPDIGI] = SINHIN*(SINHS-DELS) 
SPDIAG = @. 
DO 2@ I=1,4N 
DELS = SIGMAP*TEMPC(I) 
EXPS = EXPCDELS) 
SINHS = .5*#CEXPS-1./EXPS) 
SINHIN = IL./CTEMPCI)*SINHS) 
DIAG2 = SINHIN*(DELS*(.5*(CEXPS+1./EXPS))-SINHS) 
IF (1.EQ-N) GO TO 398 
DIAGIN = 1-/CDIAGI+DIAG2-SPDIAG*TEMPCI-1)) 
XPCI) = DIAGIN*® (XPCI)-SPDIAG*XPC(I-1)) 
YPCI)D = DIAGIN*®(YPCE)-SPDIAG*YPCI-1)) 
TEMPC(N+1) = -DIAGIN*® TEMPCNMI +1) "SF'DIAG 
IF CI1.EQ@.1) TEMPC(NP1) = =DIAGIN*SFIDIGI 
SPDIAG = SINHIN#*(SINHS-DELS) 
TEMP(I1) = DIAGIN*SPDIAG 
DIAG] = DIAG2 
20 CONTINUE 
38 TEMPCNMI) = TEMPCN+NM1) - TEMPCNMI) 
IF (N.EQ.2) GO TO 58 
PERFORM FIRST STEP OF BACK SUBSTITUTION 
DO 48 I=3,N 
IBAK = NPI - I 
XPCIBAK) = XPCIBAK) - TEMPCI BAK) *XP(IBAK+1) 
YPCIBAK) = YPCIBAK) - TEMPCIBAK) *YPCIEAK+ 1) 
TEMPC(CIBAK) = TEMP(N+I BAK) - TEMPCIBAK)*TEMPCIBAK+ 1) 
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4@ CONTINUE 
58 XP(N) = 
* CXPC(NJ-SPDIGI*®XPC1)-SPDIAG#XP(NM1))/(CDIAGI+DIAG2+SPDIGI*T 
* EMP(1)+SPDIAG*TEMPC(NM1 >) 
YPC(N) = 
* CYP(N)-SPDIGI*YP(1)-SPDIAG#YP(NM1))/(DIAGI*DIAG2+SPDIGI*T 
* EMP(1)+SPDIAG*TEMP(NM1)) 
C PERFORM SECOND STEP OF BACK SUBSTITUTION 
DO 6@ I=1t.NM!I 
XPCI) = XPCID + TEMPCI)*XP(N) 
YPCI) = YPCL) + TEMPC1) *YPCN) 
6@ CONTINUE 
RETURN 
END 


SUBROUTINE KURVP2(T, XS» YSs Ns Xo Ys XP» YP, Ss SIGMA) 

INTEGER N 

REAL Ts XSs YSs X(N)» YC(N), XPCN)s YPCN)» Ss SIGMA 
THIS SUBROUTINE PERFORMS THE MAPPING OF POINTS IN THE 
INTERVAL (@+,.1-) ONTO A CLOSED CURVE IN THE PLANE. THE 
SUBROUTINE KURVP1 SHOULD BE CALLED EARLIER TO DETERMINE 
CERTAIN NECESSARY PARAMETERS. THE RESULTING CURVE HAS A 
PARAMETRIC REPRESENTATIONBOTH OF WHOSE COMPONENTS APE 
PERIODIC SPLINES UNDER TENSION AND FUNCTIONS OF THE POLY- 
GONAL ARCLENGTH PARAMETER. 
ON INPUT-- 
T CONTAINS A REAL VALUE OF ABSOLUTE VALUE LESS THAN OR 
EQUAL TO l- TO BE MAPPED TO A POINT ON THE CURVE. THE 
SIGN OF T IS IGNORED AND THE INTERVAL (@.,1+) IS MAPPED 
ONTO THE ENTIRE CLOSED CURVE. IF T IS NEGATIVE THIS 
INDICATES THAT THE SUBROUTINE HAS BEEN CALLED PREVIOUSLY 
(WITH ALL OTHER INPUT VARIABLES UNALTERED) AND THAT 
THIS VALUE OF T EXCEEDS THE PREVIOUS VALUE IN ABSOLUTE 
VALUE. WITH SUCH INFORMATION THE SUBROUTINE IS ABLE TO 
MAP THE POINT MUCH MORE RAPIDLY» THUS IF THE USER SEEKS 
TO MAP A SEQUENCE OF POINTS ONTO THE SAME CURVE, 
EFFICIENCY IS GAINED BY ORDERING THE VALUES INCREASING 
IN MAGNITUDE AND SETTING THE SIGNS OF ALL BUT THE FIRST, 
NEGATIVE, ‘ 
N CONTAINS THE NUMBER OF POINTS WHICH WERE INTERPOLATED 
TO DETERMINE THE CURVE, 
X AND Y ARE ARRAYS CONTAINING THE X- AND Y-COORDINATES 
OF THE INTERPOLATED POINTS, 
XP AND YP ARE THE ARRAYS QUTPUT FROM KURVP1 CONTAINING 
CURVATURE INFORMATION. . 
S CONTAINS THE POLYGONAL ARCLENGTH OF THE CURVE, 
SIGMA CONTAINS THE. TENSION FACTOR CITS SIGN IS IGNORED). 
THE PARAMETERS NoXsYsXPsYP.S AND SIGMA SHOULD BE INPUT 
UNALTERED FROM THE OUTPUT OF KURVPI. 
ON OUTPUT-- : 
XS AND YS CONTAIN THE X- AND Y-COORDINATES OF THE IMAGE 
POINT ON THE CURVE. 
TsNsXsYsXP+YP,S AND SIGMA ARE UNALTERED. 
DENORMALIZE SIGMA 

SIGMAP = ABS(SIGMA) *FLOAT(N)/S 
STRETCH UNIT INTERVAL INTO ARCLENGTH DISTANCE 

TN = ABS(T*S) 


QagaNMAANANAMANNMAAANANANAAMANAAANAAAAAAANAAANN 


a 


C FOR NEGATIVE T START SEARCH WHERE PREVIOUSLY TERMINATED, 
C OTHERWISE START FROM BEGINNING 
IF (T.LT.G-)> GO TO 18 
2 
SUM = @. 
1@ IF CLIl-EQ@.1) GO TO 58 
C DETERMINE INTO WHICH SEGMENT TN 1S MAPPED 


DO 36 I=#l1,N 
DELX = XC€1I) = X¢CI-1) 
DELY * YCI) - YCI-1) 
DELS = SORTCDELX*DELX+DELY*DELY) 
IF (SUM+*DELS-TN) 280, 40, 498 
20 SUM = SUM + DELS 
38 CONTINUE 
Isl 


IMI = N 
DELS = S - SUM 
GO TO 5@ 


4@ IMI = 1 = 1 
C SET UP AND PERFORM INTERPOLATION 
5@ DEL! = TN - SUM 
DEL@ = DELS - DEL! 
EXPS1] = EXP(SIGMAP*DEL1!) 
SINHD] = .5*CEXPS1-1-/EXPS1) 
EXPS = EXP(SIGMAP*DEL2) 
SINHD2 = .5*CEXPS~-1./EXPS) 
EXPS = EXPSI*EXPS 
SINHS = .5S*CEXPS-1./EXPS) 
XS = (XPCI)*SINHDI+XPCIM!)*SINHD2)/SINHS + 
* COXCE)-XPCI) )®DELI+¢CXCIM1)-XPCIM1))*DEL2)/DELS 
YS = CYPCL)*SINHDI+YPCIMI)*SINHD2)/SINHS + 
* CCYCI)-YPCI) RDELI*CYCIMI) -YPCIM1))*DEL2)/ DELS 
Ti al 
RETURN 
END 
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Generator of Set-Partitions to 
Exactly R Subsets [G7] 


Gideon Ehrlich |Reed. 11 Dec. 1972 and 26 Feb. 1973] 
Department of Applied Mathematics, The Weizmann 
Institute of Science, Rehovot, Israel 


Key Words and Phrases: partitions, subset generation, 
permutations 

CR Categories: 5.39 

Language: Algol 


Description 

Purpose. Procedure PARTEXACT produces, by successive 
calls, a sequence of all S(,r) partitions of a set of # distinct ele- 
ments into exactly r mutually exclusive subsets. (S(#,r) is the Stirling 
number of the second kind, see {1].) We assume that 7 > r > 2. 

There is no distinction of order: neither within subsets nor 
among them. We assume the elements to be numbers 1, 2,..., #. 
(If this is not the case, we just index the elements.) We also assume 
that we have a sequence of r numbered cells in which the subsets 
are located.: The first cell contains the number 1 (together with the 
whole subset to which 1 belongs), then each cell contains the mini- 
mal element not contained in the preceding cells. Partitions are 
represented -by an address-array, a, of # components. Every / is 
located in the cell numbered a(/). It follows that: 


1 acs 4, 


2. a(/) < min (max a(m) + 1, r). 


m<f 
After each call to PARTEXACT we receive a new address-array, 
a, which differs from the old one in, at most, two components. A 
new partition is received from the old one by transferring s from 
the os cell to the ws cell, and if « + 0, then we have also to transfer u 
from the ow cell to the vu cell. Together with the last a we will get 
i = 1, and we must not call PARTEXACT again. 

The variables. n, r, k, z are global integers; p[2:n], ¢{1:] are 
global integer arrays; a{l:#] is an integer array. The space required 
by PARTEFACT is thus 37 approximately. 

Initialization. One can initiate PARTEXACT using the fol- 
lowing block: 


begin integer /, 
kKi:=n—rtl; 
for ; = | step | until k doal/) := pl/l:= 1; 
for /:= k + I step I until 7 do al/] := 1 +/ 
i:= ks kl: =k —1;k—-1]:=0;2:=1 
end 


a defines the first partition. In the case 7 = r we geti = 1, 
and we stop immediately. The variables must not be changed be- 
tween calls. 

PARTEXACT has the important feature of being loopless, so 
the computation time of the new partitions is uniformly bounded. 
There is no dependence on nv (or r). The computation time of the 
whole sequence is thus a linear function of its length --s(s,r). It is 


477-P 1- 0 


to be noted that much computation time is saved, provided the 
main program deals not with the entire newly generated partition 
but with the changed element(s) only. 

For r = 2, GRAY2 [2] has to be used with ‘‘0” and ‘‘1”’ speci- 
fying the first and the second cells, respectively. The initial address 
vector [A = (0,0,..., 0)] must not be used. Together with the last 
partition GRAY2 will set i = 1 (for the first time). 

Algorithm details. z and k are the minimal numbers such that 
a(k + 1), a(k+2),..., a(n) arez + 1,z + 2,..., 7, respectively, 
a(1), a(2),..., a@(@i—1) are not changed until a(i) takes all available 
values: that is, if i > A then no other value but its present one, 
else, all values between 1 and min(max,, < ; a(m) + 1, 7). All those 
values are ordered in a sequence starting at | and ending at 2 
(“a 1-2 path’) or vice versa (‘fa 2-1 path’’). Each sequence can 
be illustrated as moving a route of / along all available cells each 
time visiting one new cell. 

Each of the seven labels ONE...SEVEN, appearing in 
PARTEXACT, deals with a special segment of one of the two paths. 
It moves i to the appropriate new cell. ONE deals with the first 
move of an element initially located in the first cell. The roles of 
the other labels are illustrated in Figures 1 and 2. Each of the arrows 
describes the effect of the appropriate label. 

If i enters the cell z + 1, we transfer k + 1 from that cell to 
the first one (from which it starts a 1-2 path). On the other hand, 
if the move of 7 empties its old cell, we transfer & to that cell. For 
each i, p(i) denotes the segment of the /’s path according to which i 


Fig. 1. [-2 path. 


ae 


Fig. 2. 2-1 path. 
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is moved. After each move of 7, i + 1 moves a whole new route. 
After each move of i + 1, i + 2 moves a whole new route, and so 
on. Between two successive paths of i there will be a single move of 
some / < i. 

t and / contain the information about the queue of elements 
to be moved. If i completes a path, then NOGA updates r and i. 
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Otherwise, OF RA does the job. Full explanations about ¢, its up- 
dating, and a description of the whole method are included in (3]. 


References 


1. Even, S. Algorithmic Combinatorics. Macmillan, New York, 
1973, Ch. 3. 


2. Ehrlich, G. GRAY2—a binary reflected Gray Code Generator. 


(to be published). 

3. Ehrlich, G. Loopless algorithm for generating permutations 
combinations and other combinatorial configurations. J. ACM 
20 (July 1973), 500-513. 


Algorithm 
procedure PARTEXACT (a, 5, 05, 1s, u, ou, nu, i); 
integer array a; 
integer s, 05, ns, u, ou, nu, ft; 
begin 
switch L := ONE, TWO, THREE, FOUR, FIVE, SIX, SEVEN; 
S:= é,08:= als];u:=0,; 
go to L[ pli]; 
ONE: 
ns := ali] := z := 2; pli] := 7; 
if i = k then 
begin 
u:=k:=k+1;0u:= alul; nu:= alu) := 1; 
plk} = 6 
end; 
go to NOGA; 
Two: 
ns:= alij:i=z:=2—1; 
comment The old cell of i was emptied; 
u:= k;ou:=aluljnu:=alk]:=z4+1;k:=k -1; 
ifz = 2 then 
begin pli] := 7; go to NOGA end; 
plil := 3; goto OFRA; 
THREE: 
as = ali) := ali] — 1; 
if ns ~ 2 then go to OFRA; 
pli] := 7; goto NOGA, 


FOUR: 
u:= k;ou:= alul;nu:= alu) := 2; 
ziez-lhk:i=k —-41; 

FIVE: 
ns := ali] := 1; pli) := 6; 

NOGA: 


if i = k then begin i = (i); go to EXIT end: 
if [i] < 1 then 
begin if —/\i] + i — 1 then s(/—1] := sli); di] := i — Lend; 


if i =: k — 1 then begin 7{k] := & — 1; Ak—1]:= —i — lend; 
itd) := Ajji:=k; 
go to EXIT; 
SIX: 
if z =: rthen 
begin ns := ali) := r; pli] := 3 end; 
else 
begin 


ns := ali) :=z:= 24-1; pli] := 2; 
u:=k:=k+1;0u:= alu); au:= alk) :=11; plk] := 6 
end; 
go to OFRA; 
SEVEN: is 
ns :== ali} := ali] +1; 
if ns > z then 
begin 
if z = rthen pli] := 5 
else 
if ali] = z + 1 then 
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begin 
comment j enters the cell of A + 1; 
zi=zt+1;plil:= 4; 
u:=k:=k+4;0u:= alul;nu:= alk] := 3; 


pik] := 6 
end 
end; 
OFRA: 
if i = & then go to EXIT; 
kl:=ek —1; 
ifi ~ k — 1 then {k—1] = —i; 
i:=k; 
EXIT: 
end PARTEXACT 


0 
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and 8 May 1973] 
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Victoria, B.C., Canada 
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CR Categories: 5.13, 5.41 

Language: Fortran 


Description 
The algorithm calculates an /, solution to an overdetermined 
system of m linear equations in 1 unknowns, i.e., given equations 


ee Aig Ny 2 b; fori = 1, 2, 22 


the algorithm determines a vector x = {.x,} which minimizes the 
sum of the absolute values of the residuals 


e(x) = DE La: — Dip ais x, |. (1) 

A typical application of the algorithm is that of solving the 
linear /, data fitting problem. Suppose that data consisting of m 
points with co-ordinates (¢;, );) is to be approximated by a linear 
approximating function aid: (1) + ade (1) +++ + and, (2) in 
the 4, norm. This is equivalent to finding an /, solution to the system 
of linear equations 


ae ?; (tide; = Vi fori = ie Zi veg MN, 


If the data contains some wild points (i.c. values of the dependent 
variable that are very inaccurate compared to the overall accuracy 
of the data), it is advisable to calculate an /, approximation rather 
than an /, (least-squares) approximation, or an /, approximation. 

The algorithm is a modification of the simplex method of linear 
programming applied to the primal formulation of the /; problem. 
A feature of the routine is its ability to pass through several simplex 
vertices at each iteration. The algorithm does not require that the 
matrix {a;,;} satisfy the Haar condition, nor does it require that it 
be of full rank. Complete details of the method may be found in 
[1]. Computational experience with this and other algorithms indi- 
cates that it is the most efficient yet devised for solving the 4 
problem. 

The parameters M and N represent the number of equations 
and number of unknowns respectively. 442 and N2 should be set to 
M + 2and N + 2 respectively. The simplex iterations are carried 
out in the two dimensional array A of size (M2,N2). Initially the 
coefficients of the matrix {a;,;} should be stored in the first M4 rows 
and first N columns of A, and the right hand side vector {5,;} should 
be stored in the array 8B. These values are destroyed by the routine. 
TOLER is a real variable which should be set to a small positive 
value. Essentially the routine regards any quantity as zero unless 


Lj mam Don 
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its magnitude exceeds TOLER. In particular, the routine will not 
pivot on any number whose magnitude is less than TOLER. 
Computational experience suggests that TOLER should be set 
to approximately 10-%/3 where d represents the number of decimal 
digits of accuracy available (typically we run the routine on an 
IBM 370 using double precision (16 decimal digits) with TOLER 
set to 10711). On exit from the routine, the array X contains an 
1, solution {x,;} and the array E contains the residuals 
{b; — doh a;; x;}. The array S is used for workspace. The fol- 
lowing information is stored in the array 4 on exit from the routine: 


A(M+1,N+1), the minimum value of (1), i.e. the minimum sum 

of absolute values of the residuals. 

A(M-+1,N-+2)—the rank of the matrix {«;,;}. 

A(M+2,N+1)—exit code with the value 1 if a solution has been 
calculated successfully, and 2 if the calculations are terminated 
prematurely. This latter condition occurs only when rounding 
efrors cause a pivot to be encountered whose magnitude is 
less than TOLER, and in this event all output information 
pertains to the last completed simplex iteration. This condition 
does not occur too frequently in practice, and then only with 
a large ill-conditioned problem. Since an /, solution is not nec- 
essarily unique, the routine attempts to determine if other 
optimal solutions exist. An exit code of 1 indicates that the 
solution is unique, while an exit code of O indicates that the 
solution almost certainly is not unique (this uncertainty can 
only be resolved by a close examination of the final simplex 
tableau contained in A: we do not consider such an examina- 
tion to be warranted in practice). A solution may be nonunique 
simply because the matrix {a;,;} is not of full rank. 

A(M+2,N+2)—number of iterations required by the simplex 
method. 


References 

1. Barrodale, I., and Roberts, F.D.K. An improved algorithm for 
discrete /, linear approximation. SIAM J. Numer. Anal. 10,5 (1973). 
839-848 


Algorithm 

SUBROUTINE LI(M.N.M2,N2,A.B, TOLEP,XsE.5S) 
THIS SUBROUTINE USES A MODIFICATION OF THE SIMPLEX METHOD 
OF LINEAR PROGRAMMING TO CALCULATE AN Li SOLUTION TO AN 
OVER-DETERMINED SYSTEM OF LINEAR EQUATIONS. 
DESCRIPTION OF PARAMETERS. 
M NUMBER OF EQUATIONS. 


N NUMBER OF UNKNOWNS (M-GE.N). 

Me SET EQUAL TO M+2 FOR ADJUSTABLE DIMENSIONS. 
Ne SET EQUAL TO N+2@ FOR ADJUSTABLE DIMENSIONS. 
A TWO DIMENSIONAL REAL APRAY OF SIZE (M2,sN2). 


ON ENTRY, THE COEFFICIENTS OF THE MATRIX MUST BE 
STORED IN THE FIRST M ROWS AND N COLUMNS OF A. 
THESE VALUES ARE DESTROYED BY THE SUBROUTINE. 

8 ONE DIMENSIONAL REAL ARRAY OF SIZE M. ON ENTRY, B 
MUST CONTAIN THE RIGHT HAND SIDE OF THE EQUATIONS. 
THESE VALUES ARE DESTROYED BY THE SUBROUTINE. 

TOLER A SMALL POSITIVE TOLERANCE. EMPIRICAL EVIDENCE 
SUGGESTS TOLER=1@#*(-D*2/3) WHERE D REPRESENTS 
THE NUMBER OF DECIMAL DIGITS OF ACCURACY AVALABLE 
(SEE DESCRIPTION). 


x ONE DIMENSIONAL REAL ARRAY OF SIZE N- ON EXIT. THIS 
ARRAY CONTAINS A SOLUTION TO THE LI PROBLEM. 

E ONE DIMENSIONAL REAL ARRAY OF SIZE M. ON EXITs THIS 
ARRAY CONTAINS THE RESIDUALS IN THE EQUATIONS. 

s INTEGER ARRAY OF SIZE M USED FOF WORKSPACE. 


ON EXIT FROM THE SUBROUTINE, THE ARRAY A CONTAINS THE 
FOLLOWING INFORMATION. 
ACM+1,N¢1) THE MINIMUM SUM OF THE ABSOLUTE VALUES OF 
THE RESIDUALS. : 
THE RANK OF THE MATRIX OF COEFFICIENTS. 
EXIT CODE WITH VALUES. 
@ - OPTIMAL SOLUTION WHICH IS PROBABLY NON- 
UNIQUE (SEE DESCRIPTION). 
1 - UNIQUE OPTIMAL SOLUTION. 
2 - CALCULATIONS TERMINATED PREMATURELY DUE TO 
ROUNDING ERRORS. 
ACM+2,N¢2) NUMBER OF SIMPLEX ITERATIONS PERFORMED. 
DOUBLE PRECISION SUM 
REAL MIN, MAX» ACM2.N2), X(N), ECM), BCM) 
INTEGER OUT, S(M) 
LOGICAL STAGE. TEST 


ACM+1,N42) 
ACM*2,N41) 


MQaanaanaanaanaananoaraananaanacaaanaaaananaanannaaaaa 
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C BIG MUST BE SET EQUAL TO ANY VERY LARGE REAL CONSTANT. 
C ITS VALUE HERE IS APPROPRIATE FOR THE IBM 378. 


DATA BIG/1.E75/ 


C INITIALIZATION. 


Ml = M+ 1 

Nlene lt 

DO 16 J2I.N 
ACM2,J) = J 
XCJ) 7 Be 


18 CONTINUE 


26 
3@ 
46 


DO 48 I=1,M 
ACILN2) = N+ I 
ACILNI) = BCI) 
IF (BC1).GE.8.) GO TO 398 
DO 26 Je#l,N2 
ACI.J) = -ACI,J) 
CONTINUE 
ECCI) = 6. 
CONTINUE 


C COMPUTE THE MARGINAL COSTS. 


DO 68 J#I.NI1 


SUM = 6.D@ 
DO S5@ I=1,.M 
SUM = SUM + ACI.J) 
58 CONTINUE 
ACM1,J) = SUM 
6@ CONTINUE 
C STAGE |. 


C DETERMINE THE VECTOR TO ENTER THE BASIS. 


a) 


88 


98 


STAGE = . TRUE. 


KOUNT = @ 
KR = 1 
KL = 1 
MAX = -1. 


DO 88 J#=KR,N 
IF CABSCACM2,J)).GT»N) GO TO 88 
D = ABSCACM!,J5)) 
XF (DeLE.«MAX) GO TO 88 
MAX = D 
IN = J 
CONTINUE 
IF CACMI,IN)-GE.@.) GO TO 188 
DO 98 I=1,.M2 
ACI,ZIN) = -ACI,IN) 
CONTINUE 


C DETERMINE THE VECTOR TO LEAVE THE BASIS. 


128 


138 


148 


K = @ 
DO 110 I=KL.M 
D = ACILIN) 
(F (D.-LE.TOLER) GO TO 118 
K = K + 1 
BCK) = ACILNI)/D 
SCK) = I 
TEST = .TRUE.- 
CONTINUE 
IF (K.GT.-@) GO TO 138 
TEST = .FALSE. 
GO TO 156 
MIN = BIG 
DO 148 121,K 
IF (BCI).GE-MIN) GO TO 148 
J = I 
MIN = BCI) 
OUT = S¢1) 
CONTINUE 
BcJ) = BCK) 
S«J) = SCK) 
Ke K- 1 


C CHECK FOR LINEAR DEPENDENCE IN STAGE I. 


158 


168 


1768 


198 


IF (TEST .OR- «NOT.-STAGE) GO TO 178 
DO 168 I=1,M2 
D = ACI.KR) 
ACILKR) = ACILZIN) 
ACI,IN) = D 
CONTINUE 
KR = KR + 1 
GO TO 268 
IF (TEST) GO TO 188 
ACM2,N1) = 2. 
GO TO 358 
PIVOT = ACOUT.IN)D 
IF CACMI,IN)-PIVOT-PIVOT-LE-TOLER) GO TO 288 
DO 198 J=KR.NI 
D = ACOUT.,J) 
ACMI,J) = ACM1,J) - D- D 
ACOUT.J) = -D 
CONTINUE 
ACOUT,N2) = -ACOUT,N2) 
GO TO 126 


C PIVOT ON ACOUT,IN)- 


206 


216 


220 
238 


248 


DO 218 J#KR.NI 
IF CJ-EQ.IN) GO TO 218 
ACOUT,J) = ACOUT.J)/PIVOT 
CONTINUE 
DO 230 [=1,M1 
IF ¢1-EQ.0UT) GO TO 238 
Ds ACI,ZIN) 
CO 228 J=KR.N1 
IF (J-EQ-IN) GO TO 2208 
ACI,J) = ACIZJ) - D*¥ACOUT.J) 
CONTINUE 
CONTINUE 
DO 248 f=1,Mi1 
IF (I1-EQ@.OUT) GO TO 24¢ 
ACIZIN) = -ACI,ZIN)/PIVOT 
CONTINUE 
ACOUT,ZIN) = L-/ PIVOT 
D = ACOUT.N2) 
ACOUT.N2) = ACM2, IN) 
ACM2,IN) = D 
KOUNT = KOUNT + 1 
IF ¢€.NOT-STAGE) GO TO 276 
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C INTERCHANGE ROWS IN STAGE I. 
KL = KL + I 
DO 25@ J=KR,N2 
D = ACOUT.J) 
ACOUT, J) = ACKOUNT, J) 
ACKOUNT,J) = D 
250 CONTINUE 
260 IF (KOUNT+KR.NE.NI) GO TO 72 
C STAGE 11. 
STAGE = «FALSE. ’ 
C DETERMINE THE VECTOR TO ENTER THE BASIS. 
278 MAX = -BIG 
DO 298 J=KR.N 
D = ACMI,J5) 
IF (D.GE.@.) GO TO 280 
IF (D.GT.¢-2.)) GO TO 299 


D= -D - 2. 

288 IF (DeLE+MAX) GO TO 290 
MAX = D 
IN 2 J 


298 CONTINUE . . 
IF (MAX.LE.TOLER) GO TO 318 
IF CAC(M1,IN).GT.@-) GO TO 168 
DO 388 121.M2 
ACI,ZIN) = -ACI,IN) 
388 CONTINUE 
ACMI,IN) = ACMIZIN) - 2. 
GO To 108 
C PREPARE OUTPUT. 
318 L = KL - } 
DO 330 1@1,L 
IF CACILJNI)-GE-8-) GO TO 338 
DO 328 J=KR.N2 
ACIsJ) # -ACIsJ) 
328 CONTINUE 
336 CONTINUE 
ACM2,N1) = @- 
IF (KR-NE.t) GO TO 358 
DO 348 J=1,N 
D = ABSCACMI.J)?) 
IF (D.LE.TOLER .OR. 2--D-LE-TOLER) GO TO 358 
348 CONTINUE 
ACM2,N1) = 1. 
358 DO 388 1=1.M 
K = ACI,N2) 
D = ACILNID 
IF (KeGT.@) GO TO 368 
K = -K 
D = -D 
368 IF (1.GE.-KL) GO TO 37@ 
X(K) = D 


378 K = K - N 
ECK) = D 
38@ CONTINUE 
ACM2,N2) = KOUNT 
ACMI,N2@) = NI - KR 
SUM = 6.D@ 
DO 396 I=KLsM 
SUM = SUM + ACILNID 
398 CONTINUE 
ACMILN1) = SUM 
RETURN 
END 


Footnote to Algorithm 478 

The major portion of the computation performed by the above 
subroutine is transforming the two-dimensional array A at each 
iteration. We have experimented with a modified code which trans- 
forms the columns of A, one at a time, by passing each column to 
a second subroutine which involves only one-dimensional arrays. 
Savings in time of about 25 to 40 percent are normally achieved by 
this modification. This is because Fortran stores two-dimensional 
arrays columnwise. 

To implement this modification in the above subroutine, the 
user should: (i) delete the eight lines immediately following state- 
ment number 20 up to and including statement number 22; (ii) re- 
place these eight lines by 


DO 22J=KR,NI 
IF(J.EQ.IN) GO TO 22 
CALL COL (A (1,J),A(,IN),A(OUT,J),M1,OUT) 
22 CONTINUE 


and (iii) include the following subroutine 


SUBROUTINE COL (V1,V2,MLT,M1,IOUT) 
REAL V1(M1),V2(M1),MLT 
DO 1 I=1,MI 
IF(ILEQ.IOUT) GO TO 1 
VI (1) = V1 (1) — V2(1)*MLT 
1 CONTINUE 
RETURN 
END 
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Remark on Algorithm 478[F4| 


Solution of an Overdetermined System of Equations in 
the 4, Norm [I. Barrodale and F.D.K. Roberts, Comm. 
ACM 17 (June 1974), 319-320] 

Fred N. Fritsch and Alan C. Hindmarsh [Recd 23 
Sept. 1974], Numerical Mathematics Group, Lawrence 
Livermore Laboratory, University of California, 
Livermore, CA 94550 


This note is to point out an error in the‘ Footnote to Algorithm 
478.” To correspond to the published listing, the statement numbers 
in (1) of the second paragraph of the footnote should be 210 and 230, 
rather than 20 and 22. To be consistent with the published statement 
numbering, we would also recommend that statement number 22 
be changed to 220 in the three places it occurs in the replacement 
coding of (ii). 
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Algorithm 479 


A Minimal Spanning Tree 
Clustering Method [Z] 


R.L. Page [Recd. 18 Feb. 1972, 8 Feb. 1973, and 29 Mar. 
1973] 


Department of Mathematics and Computer Science. 
Colorado State University, Fort Collins, CO 80521 


Key Words and Phrases: clustering, pattern recognition. feature 
selection, minimal spanning trees . 

CR Categories: 3.63, 5.39, 5. 5 

Language: Fortran 


Description : 

Zahn [2] describes a method for automatically detecting clusters 
in sets of points in N-space. The method is based on the construction 
of the minimal spanning tree of the complete graph on the input set 
of points. The motivation for using the minimal spanning tree in- 
cludes some evidence (cited in {2]) that it is related to human per- 
ception of dot pictures in two dimensions, but the method is appli- 
cable in any dimension. 

Advantages of the method are that it requires little input other 
than the data points, it is relatively insensitive to permutations in 
the order of the data points, and the clusters it produces in two di- 
mensions closely parallel clusters detected visually by humans when 
the data is displayed as a dot picture. 

Storage requirements increase linearly with the 7, the aac of 
points. The minimal spanning tree is constructed using an algorithm 
due to Prim and Dijkstra as implemented by Whitney |1]. The time 
needed is approximately proportional to ?. (Time also increases 
slowly with NV.) Whitney's algorithm is repeated here because we 
need to keep some information about the tree structure which his 
algorithm does not retain in a convenient form. 

The basic idea is to detect inherent separations in the data by 
deleting edges from the minimal spanning tree which are significantly 
longer than nearby edges. Such an edge is called inconsistent. Zahn 
suggests the following criterion: an edge is inconsistent if (1) its 
length 1s more than f times the average of the length of nearby 
edges, and (2) its length is more than s standard deviations larger 
than the average of the lengths of nearby edges (standard deviation 
computed on the lengths of nearby edges). The real numbers-f and's 
may be adjusted by the user. The question of determining which 
edges are “nearby” is also answered by the user. We will say point 
P is nearby point Q if point P is connected to point Q by a path in 
the minimal spanning tree containing d or fewer edges (d is an in- 
teger determined by the user). 

Deleting the inconsistent edges breaks up the tree into several 
connected subtrees. The points of each connected subtree are the 
members of a cluster. 

Use of the program. There are two steps involved in clustering 
a point set using this Fortran implementation of Zahn’s algorithm. 

Step 1. Call the subroutine GROW to construct the minimal 


Funds for computer time used in development of this algorithm 
were provided by National Science Foundation Grant GJ561. 
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spanning tree of the point set. GROW needs four parameters: (1) an 
array of real numbers specifying the point set; (2) an integer specify- 
ing the dimension of the space in which the points lie; (3) an integer 
specifying the number of points in the set; and (4) a logical value, 
true if the user ‘would like a description of the minimal spanning 
tree to be printed on unit 6, and false otherwise. The array of param- 
eter (1) is treated as if it were a matrix (stored by columns) in which 
each column represents a point in the input point set. To be more 
specific, the array must be arranged so that its (K—1)*DIMEN + 
Ith value is the /th component of the Kth vector in the point set. 
(DIMEN stands for the dimension of the space in which the points 
lie.) 

Step 2. Call the subroutine CLUSTR to determine the clusters 
in the point set. CLUS7TR needs six parameters: (1) the integer d de- 
fining the term “‘nearby”’; (2) the real number f described above; (3) 
the real number s described above; (4) an array to be used for out- 
put; (5) the declared length of the output array; and (6) a logical 
value, true if the user desires a description of the clusters determined 
to be printed on unit 6, and false otherwise. If parameter (5) is zero, 
the output array (parameter (4)) will not be used. Otherwise, the 
output array, which we call C here, will be filled with integers as 
follows: the first element will be the number of clusters detected; the 
remaining elements will be arranged in blocks of varying length, 
each block describing one cluster the first element in each block 
being the number of points in the cluster, and the remaining ele- 
ments of the block being the labels of the points in the cluster (a 
point's label will be its relative position in the input point set; thus 
the first point in the input has label 1, the second, label 2, etc.). 

Once step | has been completed for a particular point set, step 
2 may be repeated with different parameters without repeating 
step I. 

Restrictions. (1) As written, the program will handle only 100 
data points, but that can be easily changed by increasing the dimen- 
sions of three arrays in GROW and five arrays in CLUSTR (see 
program for directions). (2) The first parameter in CLUSTR must 
not be larger than 18. This too can be easily changed by increasing 
the dimension of two arrays in CLUSTR (see program). (3) Blank 
common is used to store the minimal spanning tree. 

Tests. The program has been tested on a CDC 6400 with several 
different input point sets of varying size and dimension, both artifi- 
cially generated and real data. The artificially generated data in- 
cluded three two dimensional point sets with two, four, and five 
clusters and one three-dimensiona! point set with eight clusters as 
well as some higher-dimensional, larger point sets used for timing 
analysis. Time to run GROW increases hike #7; time to run CLUSTR 
normally increases like , but in the worst case increases like 1°. 
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TO CLUSTER A POINT SET USING THIS ALGORITHM, TVO THINGS 
NEED TO BE DONE. (1) BUILD THE MINIMAL SPANNING TREE BY 
CALLING GROW, AND (2) DELETE ITS INCONSISTENT BRANCHES BY 
CALLING CLUSTR» ONCE STEP (1) HAS BEEN DONE. STEP (2) CAN 
BE REPEATED OVER AND OVER WITH DIFFERENT PARAMETEFS. 

SEE THE BEGINNINGS OF GROW AND CLUSTR FOR EXPLANATIONS OF 
THE PARAMETERS. 

CURRENTLY. THE ARRAYS ARE DIMENSIONED TO HANDLE UP TO 100 
POINTS. TO CHANGE THIS, SIMPLY CHANGE THE SIZE.OF THE 
ARRAYS MST. NIT, AND UI IN GROV AS DIRECTED EELOW THEIP 
DECLARATIONS. ALSO. CHANGE THE LENGTHS OF 
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THE ARRAYS EDGE ST, EDGE PT. AVE, S@, AND NUMNEI AS 
DIRECTED IN THE SUBROUTINE CLUSTR. IN ADDITION, IF THE 
PARAMETER: D IN CLUSTR WILL BE LARGER THAN 18, CHANGE THE 
LENGTHS OF THE ARRAYS NEIG ST AND NEIG PT AS DIRECTED. 
SUBROUTINE GROWCDATA, DIMEN, NUMPTS, PRINT) 
INTEGER DIMEN, NUMPTS 
DIMENSION DATAC1) 
LOGICAL PRINT 
THIS SUBROUTINE COMPUTES THE MINIMAL SPANNING TREE OF THE 
COMPLETE GRAPH ON THE NUM PTS POINTS IN ARPAY DATA 
EACH POINT 1S A VECTOR WITH DIMEN COMPONENTS STOREL IN 
CONTIGUOUS LOCATIONS IN THE ARRAY CATA. SPECIFICALLY, 
DATAC (K-1)*DIMEN +1 ) IS ‘THE 1-TH COMPONENT OF THE K-TH 
VECTOR. THE ARRAY DATA MAY CONTAIN NUMEEFS IN EITHER 
INTEGER OR FLOATING POINT FORMAT AS LONG AS THE FORMAT IS 
CONSISTENT WITH THE TYPE SPECIFICATION OF THE PARAMETERS 
IN THE FUNCTION DIST. 
IF THE PARAMETER PRINT HAS THE VALUE .TRUE., THEN A 
A DESCRIPTION OF THE MINIMAL SPANNING TREE IS PRINTED ON 
UNIT 6. EACH NODE IS LABELED VITH AN INTEGER INDICATING 
ITS RELATIVE POSITION IN THE APRAY DATA. 
INTEGER DIM, Ns MST(8@Q)s LOC(1), NBR(1), NXTC1) 
REAL WTC) 
EQUIVALENCE (MST,LOC,NBR.WTLNXT) 
COMMON DIM, N. MST 
INTEGER LASTPT, FREE, PT 
MST CALIAS LOC. NER, kT, NXT) IS A DESCRIPTION OF THE 
MINIMAL SPANNING TREE. IT CONTAINS ONE LIST FOR EACH NODE. 
THE POINTERS TO THE HEADS OF THESE LISTS ARE STOPED IN THE 
FIRST N=NUM PTS LOCATIONS OF MST AND GO BY THE NAME MST. 
THE FIRST ELEMENT OF EACH LIST CONSISTS OF FOUR FIELDS 
STORED IN CONTIGUOUS WORDS OF MST. EACH FIELD IS CALLED BY 
A NAME WHICH IS AN ALIAS OF MST. 
FIELD 1: LOCATION IN DATA OF THE NOCE (LOC) 
FIELD 2: NAME OF NEIGHBORING NODE (NBP) 
FIELD 3: WEIGHT OF THIS BRANCH (VT) 
FIELD G: POINTER TO NEXT NEIGHBOP OR END MAPK2@ (NXT) 
EACH ADDITIONAL ELEMENT OF THE LIST CONSISTS OF THREE 
FIELDS. FIELD 1 ABOVE IS OMITTED. 
THE LENGTH OF THE ARRAY MST MUST BE AT LEAST 8*N . 
THE MINIMAL SPANNING TREE {1S COMPUTED USING THE ALGORITHM 
OF PRIM AND DIJKSTRA AS IMPLEMENTED BY WHITNEY (CACM 15, 
APR 1972). 
EACH COLUMN OF NIT IS A PAIR (NITCILI)NITC2,1)9,121,NITPD 
DENOTING A NODE NOT (YET) IN THE TREE AND ITS NEAFEST 
NEIGHBOR IN THE CURRENT TREE. UI(1) 1S THE LENGTH OF THE 
EDGE (NITCI.LI)DsNITC2,1)>. THE LENGTH OF THE APPAY UI AND 
THE NUMBEP OF COLUMNS OF NIT CANNOT PE LESS THAN N. 
INTEGER NIT(2,1@@) - 
REAL UL( 188) 
DIM = :DIMEN 
N = NUMPTS 
COMPUTE MINIMAL SPANNING TREE USING ALGOPITHM OF WHITNEY 
INITIALIZE NODE LABEL ARRAYS ANC SET UP LIST FOP NODE N=KP 
NITP =N - 1 
KP = N 
KPDATA = (KP-1}) "DIM + 1 
DO 1@ L=1,.NITP 
IDATA = (I-1)"DIM + 1 
NITCILI) = I 
UICI) = DISTCDATACIDATA), DATACKPDATA), DIM? 
NITC2,1) = KP 
1@ CONTINUE 
FREE ="N + 1 
MSTCKP) = FPEE 
LOC( FREE) = (KP=-J])*DIM + I 
FREE = FREE + 1 
NXTCFFEE+2) = @ 
UPDATE LABEL OF NODES NOT YET IN TREE. 
28 KPDATA = CKP-1)*CIM + 
DO 3@ I=1,NITP 
IDATA = (NITCI1,1)-1)*" 01M + 1} 
D = DISTCDCATACI DATA), DATACKPDATA) CIM) 
IF CUICI)-LE.D) GO TO 38 
UlCI) = D 
NITC2,1) = KP 
3@ CONTINUE 
FIND NODE OUTSIDE TREE NEAREST TO TREE 
UK 2 UIC) 
DO 4@ L=1,NITP 
IF (UrI¢C1).GT+-UK) GO TO 4@ 
UK = UIC) 
K = L 
4@ CONTINUE 
ADD NEW EDGE TO MST 
ADD NEIGHBOR TO LIST OF NODE NITC2,K) 
CHANGE END OF LIST MAPK TO POINT TO NEXT NEIGHEOP 
PT = LASTPTC(NITC2,K)) 
NXTCPT) = FREE 
ENTER NAME OF NEIGHBOR 
NBER(FREE) = NITCI.K) ; 
ENTER WEIGHT OF THIS BRANCH (OFFSET PICKS UP WT FIELD) 
WTCFREE+1) = ULCK) 
PUT IN END OF LIST MARK (OFFSET PICKS UP POINTER FIELD) 
NXTCFREE*2) = @ 
FPEE = FREE + 3 
NEW NODE--CREATE ITS NEIGHBOF LIST 
SET UP HEAD POINTER 
NODE = NITC1LK) 
MSTCNODE) = FREE 
ENTER LOCATION OF THIS NODE IN DATA 
LOC(FREE) = (NODE-1)*DIM + 1} 
ENTER NAME OF NEIGHBORING NODE (OFFSET PICKS UP NEF FIELD) 
NBR(FREE+1) = NITC2,K) 
ENTER WEIGHT OF THIS BRANCH (OFFSET PICKS UP WT FIELD) 
WTC FREE+2) = UICK) 
ENTER END OF LIST MARK (OFFSET PICKS UP POINTER FIELD) 
NXTCFREE+#+3) = @ 
FREE = FREE + & 
KP = NITC1,K) 
DELETE NEW TREE NODE FFOM ARPAY NIT 
UICK) = ULCNITP) 
NITCILK) = NITCILNITP) 
NITC2,K) = NITC2,NITP) 
NITP = NITP - 1 
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C THE MST IS FINISHED WHEN IT CONTAINS ALL NODES 
IF (NITP.NE.@) GO TO 28 
IF (PRINT) CALL PPTREE 
RETURN 
END 


SUBROLTINE CLUSTR(D, FACTOR, SPREAD,» Cs CLEN, PRINT) 
INTEGER D,» CLEN, C(CLEN) 
REAL FACTOR, SPREAD 
LOGICAL PRINT 
THIS SUBROUTINE FINDS THE CLUSTERS OF A POINT SET USING 
A MINIMAL SPANNING TREE CLUSTERING METHOD OF ZAHN. THE 
MINIMAL SPANNING TREE, COMPUTED BY SUBROUTINE GPOW, IS 
STORED IN BLANK COMMON. % 
THE ZAHN ALGORITHM FINDS CLUSTERS. BY DELETING INCONSISTENT 
EDGES FROM THE MINIMAL SPANNING TREE. AN INCONSISTENT EDGE 
BEING ONE WHOSE WEIGHT IS SIGNIFICANTLY LARGER THAN THE 
AVERAGE WEIGHT OF NEARBY EDGES. 
NEARBY MEANS CONNECTED TO THE EDGE IN QUESTION BY A 
PATH CONTAINING D OR FEWER EDGES. 
SIGNIFICANTLY LARGER MEANS 
WEIGHT .GT. FACTOR * AVERAGE 
AND VEIGHT .GT. AVERAGE + SPREAD * STANDAPD DEVIATION 
WHERE THE AVERAGE AND STANCARD DEVIATION ARE COMPUTED ON 
THE WEIGHTS OF NEARBY EDGES. 
THE OUTPUT VECTOR Cc DESCRIBES THE CLUSTERS DETERMINED. 
IT IS ARRANGED IN BLOCKS, EACH BLOCK DESCRIBING ONE 
CLUSTER- THE FIRST ELEMENT IN EACH BLOCK IS THE NUMBER 
OF NODES IN THE CLUSTER. THE REMAINING ELEMENTS ARE THE 
LABELS OF THE NODES IN THE CLUSTER, THE LABEL INDICATING 
THE RELATIVE POSITION OF THE NODE IN THE ARRAY DATA. THE 
FIRST BLOCK STARTS, AT C(2)- 
C(1) IS THE NUMBER OF CLUSTERS FOUND BY THE ALGORITHM. 
THE VALUE OF C LEN SHOULD BE THE TRUE SIZE OF 
THE ARRAY C. IT IS USED TO PREVENT INVALID SUBSCRIPTS. 
IF C LEN IS ZERO. THE ARRAY C WILL NOT BE USED. 
IF THE PARAMETER PRINT HAS THE VALUE -.TRUE.» CLUSTERS 
ARE PRINTED OUT ON UNIT 6. 
INTEGER EDGEST(1@1), EDGELN, EDGEPT«1@1) 
REAL AVEC(198), SQ(180), SUPPWT. W 
INTEGER NUMNEI (C120) 
INTEGER NEIGST(2@), NEIGLN. NEIGPTC28) 
THE ARRAY EDGE ST (EDGE STACK) IS A STACK OF NODES USED TO 
DIRECT THE SEARCH THROUGH THE TREE FOR INCONSISTENT EDGES. 
ITS LENGTH (EDGE LN) CAN GROW AS LARGE AS ONE MORE THAN 
THE NUMBER OF NODES IN THE TREE. 
THE ARRAY EDGE PT (EDGE POINTERS) IS A STACK OF POINTERS 
TO THE NEXT UNEXAMINED NEIGHBORING NODE OF THE NODE IN THE 
SAME POSITION IN EDGE ST. THUS THE LENGTH OF EDGE PT IS 
ALWAYS THE SAME AS THAT OF EDGE ST. 
THE ARRAY NEIG ST (NEIGHBOR STACK) IS A STACK OF NODES 
USED TO DIRECT THE AVERAGING OF THE WEIGHTS OF NEARBY 
EDGES. ITS LENGTH (NEIG LN) CAN GROW AS LARGE AS D+2. 
THE ARRAY NEIG PT 1S USED IN CONJUNCTION WITH NEIG ST. ITS 
LENGTH CAN GROW AS LARGE A D+2. 
THE ARRAYS AVE AND.SQ@ ARE USED TO EXPEDITE THE CALCULATION 
OF AVERAGE WEIGHTS. SPECIFICALLY, AVECI) STORES THE SUM OF 
THE WEIGHTS OF EDGES EXTENDING FROM THE I-TH NODE AND 
SQCI) STORES THE SUM OF THE SQUARES. SIMILARLY. NUMNEI(1) 
STORES THE NUMBER OF NEIGHBORS OF THE L-TH NODE. THUS EACH 
OF THESE ARRAYS MUST BE AS LONG AS THE NUMBER OF NODES, 
INTEGER FINDCN, A. B, DLESS! 
INTEGER CLS, INCLS(1), PARENT(1). BAKWRD, BEGCLS 
EQUIVALENCE CINCLS,EDGEST)s ¢PARENT» EDGEPT) 
INTEGER CP. OTHEND 
INTEGER DIM, N, MST(1)s LOCC1)s NBRC1), NXTC1) 
REAL WTC1) 
EQUIVALENCE (MST.LOC,.NBR,WTLNXT) 
COMMON DIM, Ns MST 
IF (PRINT) WRITE (6.99998) Ds FACTOR» SPREAD 
DLESS! = D- 1 
C COMPUTATION SECTION 
C SUM BRANCH WEIGHTS OFF EACH NODE (DEPTH 1) 
DO 28 NODE#1.,N 
NUMNEICNODE) = } 
K = MST(NODE) 
AVECNODE) = WTCK+2) 
SQ(NODE) = WTCK+2)%e2 
K = NXT(K#3) 
18 IF (K.EQ.8) GO TO 2e 
AVE(NODE) = AVEC(NODE) ¢ WTCK+1) 
SQ(NODE) = SQC(NODE) + WTCK+1) #2 
NUMNEI(NODE) = NUMNEICNODE) + 1 
K 2 NXTCK+#2) 
GO TO 18 
2@ CONTINUE 
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C INITIALIZE ECGE STACK WITH NODE 1 SURROUNDED BY ITS FIRST 
C TWO NEIGHBORS. SINCE THE TOP TWO ELEMENTS OF THE STACK 
C INDICATE THE DIRECTION OF TRAVEL ALONG A BRANCH, THE 
C SEARCH WILL FIRST BE DIRECTED AWAY FROM NODE | IN THE 
C DIRECTION OF ITS FIRST NEIGHBOR. WHEN ALL THE TPEE IN THAT 
C DIRECTION IS SEARCHED, THE SEARCH WILL PROCEDE AWAY FROM 
C ITS FIRST NEIGHBOR TOWARD NODE |. 
C THE EDGE PT STACK IS USED TO KEEP TRACK OF THE NEIGHBORS 
C OF THE CORRESPONDING NODE IN EDGE ST WHICH HAVE ALREADY 
C BEEN SEARCHED. EDGE PT(I) POINTS TO THE LOCATION OF 
C EDGE STC(I*l) IN THE LIST OF NEIGHBORS OF EDGE STCI> 
EDGELN = 3 
K = MSTCI1) 
EDGEST(2) = LOCCK)/DIM + 1 
EDGEST( 1) * NBRCK+1) 
EDGEST(3) = NBRCK+1) 
EDGEPT(1) = FINDCNCEDGEST( 1), EDGEST¢2)) 
EDGEPT(2) = K + 1 
EDGEPT(3) = -1 


C CLIMB TREE TO NEXT UNTESTED BRANCH 
30 CALL CLIMBCEDGEPT, EDGEST, EDGELN, Ni 
IF CEDGELN-LE-2) GO TO 78 
C CHECK THE EDGE BETWEEN NODE EDGE ST(EDGE LN -1!) AND 
C NODE EDGE STCEDGE LN) FOR INCONSISTENCY. 
A = EDGESTCEDGELN-1) 
B = EDGESTCEDGELN > 
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C SUM WEIGHTS OF ALL BRANCHES NEARBY BRANCH A--B REAL FUNCTION DISTCA, B, N) 
NEARBY = @ INTEGER N 
AV = 6. : REAL ACN). BCN) 
STDDEV = @. C THIS FUNCTION COMPUTES THE WEIGHT OF THE BRANCH BETWEEN 
C INITIALZE NEIG ST TO SUM WEIGHTS HEADING OFF NODE E C NODE A AND NODE B. IT SHOULD BE WRITTEN TO SUIT THE DATA. 
NEIGLN = 2 C THE TYPE DECLARATION OF A AND B SHOULD MATCH THE DATA. 
NEIGST(1) = A C THIS VERSION COMPUTES THE USUAL EUCLIDEAN DISTANCE. 
NEIGPTC1) = EDGEPTCEDGELN-1!)> 3 DIST = CACI)-BC1L) > a"2 
NEIGST(2). = B 5 - : DO 1@ 122,N 
NEIGPT(C2) = -1 DIST = DIST + CACI)-BCL)) a2 
ASSIGN 5@ TO OTHEND ae 18 CONTINUE 
C GO OUT TO DEPTH D-! ALONG BRANCHES NOT YET ADDED . DIST = S@RTCDISTD 
40 CALL CLIMB(NEIGPT, NEIGST, NEIGLN, DLESS1) RETURN 
C ADD WEIGHTS OF BRANCHES OFF THE TOP NODE LESS THE WEIGHT END 
C OF THE BRANCH SUPPORTING IT 
K-= NEIGPTCNEIGLN-1) 
SUPPWT = UTCKtL) 
K = NEIGST(NEIGLN) SUBROUTINE CLIMBCPOINTR, STACK, LN. D) 
AV = AV + AVECK) - SUPPYT INTEGER POINTR(1), STACK(1), LN.» D 
STDDEV = STDDEV + SG@C(K) = SUPPWT*«2 INTEGER SPACE(2), MST(1). NBRCL), NXTC1D 
NEARBY = "NEARBY + NUMNEICK) - 1 EQUIVALENCE (MST,NBR,NXT) 
C WHEN DEPTH OF STACK RETURNS TO 2, ALL BRANCH WEIGHTS OFF COMMON SPACE, MST 
C THIS END HAVE BEEN ADDED C STARTING FROM THE NODE ON TOP OF THE STACK, CLIMB OUT 
IF (NEIGLN-LE-2) GO TO OTHEND, (58,68) C TO DEPTH D OR TO A TERMINAL NODE. WHICHEVER OCCURS FIRST 
NEIGLN = NEIGLN - 1 ; 10 IF (CLN-EQ.D+2) RETURN 
GO TO 4@ a . K = POINTR(LN) 
C INITIALZE NEIG ST TO SUM WEIGHTS HEADING OFF NODE A IF (K) 20, 38, 48 
5@ NEIGLN = 2 C SET POINTER TO FIRST NEIGHBOR OF TOP NODE 
NEIGST(1) = B 2@ NODE = STACKCLN) 
NEIGPT(1) = FINDCN(CB,A) POINTR(LN) = MST(NODE) + 1 
NEIGST(2) = A GO TO 5@ 
NEIGPT(2) = -1 C BACK DOWN FROM TERMINAL NODE 
ASSIGN 68 TO OTHEND 30 LN = LN - 1 
GO TO 48 C CLIMB OUT ON NEXT NEIGHBOR IF POSSIBLE 
C TEST BRANCH A--B FOR INCONSISTENCY. 4@ POINTR(LN) = NXTCK+2) 
6@ AV = AV/FLOAT (NEARBY) 7 IF CPOINTRC(LN).£0.@) RETURN 
STDDEV = SQRTCABS(STDDEV/FLOAT(NEARBY) -AVxx2) ) C CHECK DIRECTION 
K = EDGEFTCEDGELN-1) 50 K = POINTRCLN) 
wos UTCKEL) NEIGHB = IABS(NBR(K)) 
EDGELN = EDGELN - 1 IF (NEIGHB-EQ-STACK(LN-1)) GO TO 4@ 
IF (W.LE.AV+SPREAD*STDDEV «OR. WeLE-FACTOR*AV) GO TO 30 C CLIMB OUT ON NEIGHBORING NODE 
C BRANCH A--B IS INCONSISTENT. DELETE IT. LN = LN + 1 
NBR(K) = -ILABS(NBR(K)) STACK(LN) = NEIGHB 
K = NEIGPTC1) POINTR(LN) = -1 
NBR(K) = -LABS(NBRCK) ) GO TO 18 
GO TO 3@ END 
C OUTPUT SECTION 
C WE COLLECT THE CLUSTERS AS FOLLOWS: 1..START WITH FIRST . 
C NODE. 2. THROW IN ITS NEIGHBORS. 3. THROW IN NEIGHBORS 
C OF NEIGHBORS UNTIL NO NEW ONES CAN BE FOUND. 4. EACH INTEGER FUNCTION LASTPT(NODE) 
C TIME A DELETED BRANCH IS ENCOUNTERED, PUT OTHER END IN A C THE VALUE OF THIS FUNCTION POINTS TO THE END OF THE LIST 
C LIST OF UNUSED NODES (AT TOP OF ARRAY IN CLS). 5. WHEN C OF NEIGHBORS OF NODE. 
C A FULL CLUSTER IS COLLECTED , OUTPUT IT. 6+ START AGAIN INTEGER SPACE(2), MSTC1)s NXTC1) 
C AT STEP 2 WITH A NODE FROM THE LIST OF UNUSED NODES. : EQUIVALENCE (MST,NXT) 
78 NUMIN = @ ; : COMMON SPACE, MST 
CLS = @ C OFFSET PICKS UP POINTER FIELD 
CP =) ; LASTPT = MSTCNODE) + 3 
K = MSTC1) 10 LF (NXTCLASTPT)+EQ-@) RETURN 
NXTCLS = N . LASTPT = NXTCLASTPT) + 2 
INCLS(NXTCLS) = LOCCK)/DIM + 1 GO TO 18 
PARENT(NXTCLS) = @ END 
BAKWPD = @ 
C START CLUSTER WITH NEXT AVAILABLE UNUSED NODE 
8@ CLS = CLS + 1 
NUMIN = NUMIN + 1 INTEGER FUNCTION FINDCN(A, 8B) 
BEGCLS = NUMIN INTEGER A, B 
NXTCN = NUMIN INTEGER SPACEC2), MST(1), NBRC1). NXTC1) 
NODE = INCLSC(NXTCLS) EQUIVALENCE (MST,NBR,sNXT) 
INLIST = PARENTC(NXTCLS) COMMON SPACE, MST 
INCLS(NUMIN) = NODE C THIS FUNCTION LOCATES NODE B IN THE LIST OF NEIGHBORS OF A 
NXTCLS = NXTCLS + 1 C OFFSET PICKS UP NEIGHBOR FIELD 
C LET K POINT TO FIRST NEIGHBOR OF NODE FINDCN = MSTCA) + 1 
98 K = MST(NODE) + 1 10 IF CIABS(NBR(FINDCN)).EQ-B) RETURN 
C ADD NEIGHBOR TO CLUSTER AND RECORD IT ANCESTRY FINDCN = NXTCFINDCN+2) 
1808 NXTNBR = NBRCK) IF (FINDCN-NE-@) GO TO 18 
IF (NXTNBR-eLT-@) GO TO 118 WRITE €6,99999) Bs A 
IF (NXTNBR-EQ-BAKKRD) GO TO 120 99999 FORMAT(5H@NODE, 13, 26H 1S NOT A NEIGHBOP OF NODE. 13) 
NUMIN ® NUMIN + | RETURN 
INCLS(NUMIN) 2 NXTNBP END 
PARENTCNUMIN) = NODE 
GO TO 128 
C THIS NEIGHBOR IS IN A DIFFERENT CLUSTER--ADD TO UNUSED 
118 NXTNBR = -NXTNBR SUBROUTINE STORECVALUE, ARRAY, LOC, N) 
IF (NXTNBR-EQ@-INLIST) GO TO 128 INTEGER VALUE, ARRAY(N). LOC, N 
NXTCLS = NXTCLS - J C THIS SUBROUTINE IS USED TO STORE VALUES INTO THE ARRAY 
INCLS(NXTCLS) = NXTNBR C WHICH IS THE FOURTH PARAMETER OF CLUSTR- 
PARENTCNXTCLS) = NODE IF (N.EQ.@) RETURN 
C GET NEXT NEIGHBOR LOC = LOC + 1 
128 K = NXTCK+2) . IF (LOC-GT»N) GO TO 18 
IF (K.NE.®@) GO TO 188 ARRAY (LOC) = VALUE 
C ADD LIST OF NEIGHBORS OF NEXT ELEMENT OF THIS CLUSTER RETURN 
NXTCN = NXTCN ¢+ 1 10 WRITE (€6,99999) VALUE 
IF (NXTCN.GT.NUMIN) GO TO 130 99999 FORMATC41H THE ARRAY USED TO STORE A DESCRIPTION OF/3H TH, 
NODE = INCLS(NXTCN) * 3@HE CLUSTERS IS NOT LONG ENOUGH /15H iTS NEXT VALUE, 
BAKWRD = PARENT(NXTCN) * 11H SHOULD BE . 118) 
GO TO 98 RETURN 
C END OF CLUSTER--DO OUTPUT END 
13@ CALL STORECNUMIN-BEGCLS+!, C, CP, CLEN) 
IF (PRINT) WRITE (6.99999) CLS 
DO 148 I=BEGCLS,NUMIN 
IF (PRINT) WRITE (6.99997) INCLSCI) : SUBROUTINE PRTREE 
CALL STORECINCLSC(I), C, CP. CLEN) C THE DESCRIPTION OF THE MINIMAL SPANNING TREE PRINTED HERE 
140 CONTINUE C LABELS EACH NODE SEQUENTIALLY AS IT OCCURS IN DATA 
IF (NUMIN-LT.N) GO TO 8@ INTEGER DIM, Ns MSTC1)s LOC(1), NBR(1), NXTC1) 
CP a @ REAL WTC 1) 
CALL STORECCLSs GC. CP, CLEN) EQUIVALENCE (MST,LOC,NBR, WTsNXT) 
CALL FIXMST COMMON DIM, N, MST 
RETURN DO 28 NODE=1,N 
99999 FORMATCIH@/8H@CLUSTER, 15, t2H CONSISTS OF) WRITE (€6,99999) NODE 
99998 FORMATC44HITHE TREE HAS BEEN CLUSTERED SEARCHING TO A K = MST(NODE) + 1 
* BHDEPTH OF, 13/711X, 28HINCONSISTENT EDGES HAVE BEEN, 18 WRITE (6.99998) NBR(K), WTCK+1) 
* 27H DETERMINED BY A FACTOR OF », Gil.d4/1!X, I@HAND A SPRE, K = NXTCK+t2) 
* 6HAD OF ». Gl!.4, 21H STANDARD DEVIATIONS.) ; IF (K-NE.@) GO TO 16 
99997 FORMATC1@X, 4HNODE, 15) 28 CONTINUE 


END RETURN 
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99999 FORMAT ( SH8NODE, 


13/ 16H NEL GHBORS ARE} 


99996 FORMATCI@X, SHNODE. 15. 14H AT DISTANCE . Git. 4) 


END 


SUBROUTINE FIXMST 


INTEGER DIM, N, 


MSTC1). NBRC1), NXTC1) 


EQUIVALENCE (MST.NBRLNXT) 
COMMON DIM. N. MST 


DO 2@ [#I.N 
K = MSTCI) « 1 
16 NBRCK) = [TABS( 
K ® NXTC(Ke2) 
IF (K»NE.@) GO 
2@ CONTINUE 
RETURN 
END 


NBRCK)) 


TO 18 


Remark on Algorithm 479 [Z] 
A Minimal Spanning Tree Clustering Method 
([R.L. Page, Comm. ACM 17 (June 1974), 321-323] 
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H.S. Magnuski [Recd 19 July 1974] Stanford Elec- 
tronics Laboratories, Stanford University, Stanford 
CA 94305 


The implementation of this algorithm assumes that both 
real and integer variables occupy the same amount of storage, 
which is not true of many Fortran systems. The algorithm 
assumes that real array W7 and integer array MST are exactly 
the same length, and intermixes floating point and integer vari- 
ables in creating the linked lists contained in these arrays. The 
simplest (but not best) solution is to define array WT in its own 
common block. The correct solution requires rewriting of the 
algorithm so that the linked lists tan properly handle floating 
point numbers. 

{Prof. Page informs me that he has a revised version which 
follows the suggestion of the last sentence above. -L.D.F. } 


ACM Transactions on Mathematical Software, Vol. 2, No. 1, March 1976, Pages 110 -111 
REMARK ON ALGORITHM 479 
A Minimal Spanning Tree Clustering Method [Z]} 

[R.L. Page, Comm. ACM 17, 6(June 1974), 321-323] 


G.M. White, S. Goudreau, and J.L. Legros [Reed 5 Aug. 1975 ] 


Computer Science Department, University of Ottawa, Ottawa, Ont. Canada 
KIN 6N5 


The algorithm as given generally yields a large number of clusters containing 
only one point. These are not likely to be of much use. Clusters not containing at 
least MINPTS points can be eliminated by making the following changes to the 
subroutine CLUTR. 


1. 


The first statement should read 


SUBROUTINE CLUTR(D, FACTOR,SPREAD,C,CLEN, PRINT, MINPTS) 
The statement beginning IF(PRINT) following the COMMON statement 


COMMON DIM, N, MST 


should be removed. 


The following statements should be inserted immediately after the COMMON 


statement: 
IF(MINPTS.LE.N) GO TO 5 
C(1) = 0 
RETURN 


5 IF(PRINT) WRITE(6,99998)D, FACTOR,SPREAD 
IF(PRINT) WRITE(6,99996)MINPTS 
99996 FORMAT(iHb, 10X , 39 HMINIMUMbNUMBERbOFbPOINTSbPERbCLUSTERb 


*«IS,I9) 


Statement number 130 should be replaced by the following: 


130 IF((NUMIN —BEGCLS+1).LT.MINPTS) GO TO 150 
CALL STORE (NUMIN —BEGCLS+1,C, CP, CLEN) 


The statement following statement 140 should be replaced by 


GO TO 160 
150 CLS=CLS—1 
160 IF(NUMIN.LT.N) GO TO 80 


With these changes, the program will produce the same results as the original 
program if MINPTS is set equal to 1 at the point of invocation. 
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The algorithm with the above modifications has been tested successfully using 
G and H (opt=2) level Fortran compilers on an IBM 360/65 under o.s. level 21.8. 
With this configuration, the qualifications mentioned by Magnuski [1] are not 
applicable. 

The program has been used to detect artificially generated clusters superimposed 
upon a background of noise and to detect stars in nuclear emulsions. The algorithm 
seems particularly well suited for identifying nuclear events in three dimensions 
using data obtained automatically from emulsions by flying spot scanners. 


REFERENCES 
[1] Maanuskt, H.S. Remark on Algorithm 479. Comm. ACM 18, 2(Feb. 1975), 119. 
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Algorithm 480 


Procedures for Computing 
Smoothing and Interpolating 
Natural Splines [E1] 


Tom Lyche* and Larry L. Schumakerf [Recd. 18 Oct. 
1971 and 9 Apr. 1973] 

Department of Mathematics, The University of Texas 
at Austin, Austin, TX 78712 


* Supported in part by Grant DA-ARO(D)-31-124-61050, Army 
Research Office, Durham, and National Science Foundation Grant 
GP-23655. 

t Supported in part by Grant USAFOSR 69-1812B. 


Key ‘Words and Phrases: approximation, interpolation, spline, 
natural spline, spline smoothing 

CR Categories: 5.13 

Language: Algol 


procedure SPLINECOEFF (m,n,X,Y,W,C,q,S,eps,mach,maxit, fail); 
value m,n,maxit, integer m,1,q,maxit; real S,eps,mach, 
array X,Y,W,C; label fail; 

comment |. The purpose of this procedure is to generate the coeffi- 
cients {c;}1* in the representation 


s(x) = > c:B;(x) (1) 
tal 

of a natural spline of degree 2m — 1 (in terms of a local basis 
{ B.(x)}1") for the splines which solve certain data smoothing and 
interpolation problems. It is based on algorithms described in [2]. 
To describe the problems, let m and n be integers (m,n> 1) and 
suppose {x;}1", {yi }:" and {w,]." are prescribed real numbers, with 
Xi <X2< +++ <Xn and w;>0, i= 1,2,...,.2. Suppose p>O and S>0. 
For appropriately smooth / we define 


io) 
Jf) = i (f™ (x))?dx (2) 
I 00 
E(f) = 20 wilyi—S(xi))?. (3) 
tal 
The spline interpolation problem is 
minimize J(f) subject to E(f) = 0. (4) 
We can solve either of two data smoothing problems: 
minimize [J/(f)-+-pE(/)} (5) 
or 
minimize J(/) subject to E(/) <S. (6) 


In all cases, the solutions are certain natural splines of degree 
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2m — 1 with knots {x;}.:" which can always be represented in the 
form. (1). We assume that » > 2m, in which case the solutions 
are unique, and there is a convenient basis {B;(x) }1". 

Determining the {c;}:" in problem (4) involves setting up and 
solving a system of n equations with a 2m — 1 banded matrix. 
Similarly: (5) leads to a system with a 2m + 1 banded matrix. 
Solving problem (6) depends on the fact that for small S there is 
a unique p = p(S) such that the solution of (5) for this p is the 
solution of (6). The parameter p(S) is the unique positive solu- 
tion of 


P(p) = Els) = S, (7) 


where s, is the solution of (5) corresponding to p. Equation (7) 
is solved by Newton’s method applied to 


PD = SF (8) 


Then (6) is solved approximately in the sense that a spline s is de- 
termined so that 


|E(s)\—S| < eps/n; (9) 


comment 2. We describe the parameters of SPLINECO EFF. The in- 
tegers m and » must satisfy m>1,n>4m—1. The real arrays 
X[l:ia], Yil:aj, and W[l.v1] must satisfy X[1]<-+-<X[”] and 
Wi] >0, i= 1,2,...,7. The integer gq has nonnegative values. In case 
q = 0, the procedure solves (4)—i.e. produces the coefficients of 
the natural interpolating spline (1) of degree 2m ~— 1 with knots 
at the X[i|’s. The coefficients are returned in the array of real num- 
bers Cl| 1.7]. 

If g = 1, problem (5) is solved with smoothing parameter 
p := 8S, a specified positive real number. Again the coefficients 
are returned in array C. Finally, if g = 2 the iterative process 
described in comment | is carried out to determine a spline s sat- 
isfying (9). S and eps must be positive real numbers. The param- 
eter maxii should be a positive integer specifying the maximum 
number of iterations desired in solving (8). 

The parameter mach is to be the largest machine number such 
that 1 + mach = 1: It is machine dependent, of course. The 
label fail is for the purpose of exiting from SPLINECOEFF if cer- 
tain situations arise (e.g. if maxit is exceeded). These are explained 
in detail in comments 7, 11, and 15; 

comment 3. SPLINECOEFF calls on four other procedures called 
BANDET, and BANSOL, ENDBASIS, MIDBASIS. \t is as- 
sumed these procedures are defined in the driver program—we de- 
scribe their bodies later. The driver program should provide two 
arrays for workspace, namely, XXR,XX[1:n,1:2m]; 
begin 
integer 4,k1;k := m+m;kl:= k--1}; 
begin 
integer a,i,j,/,i},i2,m1,m2,r,v,g¢,/1,/2; 
real FFF, f1,s2,p,d,h,h1; 
array E,B,BWE[1:n,—m:m],LB[1:n,1:m],NIK,T(O:n),Z,U([1:k]; 
integer array INT [1:n]; 
l:=n,a:=k+kjr:= ifn>a then a else; 
for 7 := 1 step 1 until k do 
begin 
l:=/-l;r:=r-—-]; 
for i := 1 step 1 until / do 
XX(i,j) := Xli+/—Xld; 
for i := 1 step 1 until rdo 
XXR{ii,j) = XX(n—-—i-J/+1,4); 
end /; 
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for i := 1 step 1 until 1 do 
for / := —m step 1 until m do 
Bii,j] := 0; 


comment 4. The array B is to contain the values of B;(x,), 
where B;(x) are the local basis elements of (1). There are es- 
sentially three kinds of basis functions, namely (see [2]) 

Qom.i(x), i=1,2,...,m 

Nom, i(x), i= m-+1,.. 

Oo, i(x), i=n— —m-+],.. 

Let B = (B;(x;)). Because of aie support properties of 
the B;(x), Bis 2m—1 banded and we may store it as follows: 


Bix) = .—m 


—mi-m | 0 m-| m 
1] 0 0 By(x1) Bn (X1) 0 
2 1(X2) B2(X2) 
n|0 Bn (Xn) 1) 


Specifically, Bij =" Bi4;(x2) = Bais; for j= max(l—m 


1—i),...,minQm—1,n—i), = 1,2,...,175 
for / := 1 step 1 until k1 do 
begin 

for j := 1 step 1 until /— {do 


Ti) := XX(il—/; 
T(l] := 0; 
12 := if/+ki > ntheny else/+k1; 
for 7 := /+1 step 1 until /2 do 
Tj] := XX(L,j-]; 
ENDBASIS (k,l,n,T,XX,NIK) ; 
/1:= if! > mthen/ else m; 
for j := /1 step 1 until /2 do 
Bil,j—m—l+1) := NIK{(J); 
end /eftpoints; 
for / := k step | until »—k do 
begin 
for j := /—k]1 step 1 until /—1 do 
Tlf] = XXU/—J); 
T{/] := 0; 
for j := /-+1 step 1 until /+-41 do 
Ti) := XXU,j—; 
MIDBASIS (k,/,n,7,XX,NIK); 
for j := /—k1 step 1 until /—1 do 
Bil,j+m—l] :=NIK{J\; 
end midpoints; 
for /:=1 step 1 until k do 


_begin 


for j := 1 step 1 until /—1do 
Tij) := XXRU/—/); 
Til] := 0; 
12:= if/+k1 > nthennelse/+k1; 
for j := /+-1 step 1 until /2 do 
Tif] := XXRLj—-]; 
ENDBASIS (k,1,n,T,XXR,NIK) ; 
/1:= if! > mthen/ else m; 
for j := /1 step 1 unti) /2 do 
Bln—l+1,m+/—j—1]:= NIK{[j] 
end rightpoints; , 
comment 5. When g = 0 or if g was changed from 2 to 3 in 
attempting to do smoothing (see comment 9), the coefficients 
{c:}," of the interpolating spline are computed from the 
linear system BC = Y; 
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interpol: 
if g=0 VV q=3 then 
begin 
m1 := m—1; 
for i := 1 step 1 until 7 do 
for; := —/ml step 1 until m1 do 
BWEi,j) := Bli,J\; 
goto /insol 
end; 
comment 6. For g = 1,2, or 4 (see comment 12) the C array 


is computed from the linear system — 


(B+p7E)C = Y, (10) 
where 
Ei; = wi'Bu, 
min(n.j+m) 
;=fl1; JI 1/1), J Lye F= max(1, jm) 
t=max(l.j—m 
min(n, j-+m), and 
1, J=1,2,... oe 
fl; = (-1)™"(2m—-1)! Coe: j=m+i,.. a 
(—1)>t™7, peers 


The #’s are the coefficients of certain divided differences. The 
array Fis 2n+1 banded and is stored in £ in a form similar 
to B. The quantity d is an estimate for ||£}|,; 
fl := -Iljv:=k-1;i1 := 1;12:= m;d:=0; 
for i := 2step 1 until mdo/fl := —f1 Xj; 
for {:= m-+1 step | until vdo fl := fi Xi; 
for 7 := 1 step 1 until 7 do 
begin 
if j>—m then begin f1 := —fl;f:=/1 end 
else if < mthen f:= /1 
else f:= f1 X XX[j—m,k]; 
if 7>m+1 then 7] := i1+1; 
if (2<n then i2 := i2+1; 
for / := il step 1 until i2 do 
begin 
K:=fiv:= 1-1; 
for i := il step 1 until vy do 
SF = fF/XXlit — i); 
for i := /+-1 step | until i2 do 
= —S/XX(Li-N; 


El,j—!) := F/ Wil); 
d:= d+abs(E{l,j—/)) 
end /; 
end E matrix; 
d:=d/n; 
mi := myr:= —1; 82 := sqrt(S);m2:= m—1; 
if g=2 then p:= 10 X mach Xd 


else if S<10 Xd X mach then 


begin 

q := 7; goto fail 
end 
else p := S; 


comment 7. The matrix & is singular. Hence in the caseg = 1, 
if p<10 X mach X |\|E\l,, the matrix (10) will be very close to 
singular since ||B||1 = 1. In this case we exit and set g = 7; 

comment 8. If g = 2 we need to carry out the iteration de- 
scribed in comment 1. Since f!(p) in (8) is concave (see (3, 
4]), we want to choose the first guess p® for Newton’s method 
such that f~!(p°) <<S~}. We choose p® = 10 X mach X ||E\l, 
(see comment 7); 

nextit: 

comment 9. When p > d/10 mach, the matrix p—£ is consid- 
ered insignificant in (10) and the smoothing problem (5) is re- 
placed by the interpolation problem. In this case we set g = 3; 

if p > d/\0/mach then 
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begin 

q := 3; goto interpol 
end; 
r:=r+l; 


ifr > maxit then 

begin g := 6; goto fail; end; 

for i := 1 step 1 until 1 do 

for 7 := —mstep | until m do 
BWE|i,/| := Bl, J+ E(t /)/p; 


linsol: 


BANDET(BWE,LB,INT,n,m1); 
for i := 1 step 1 until 7 do 
Cli] := Yd; 
BANSOL(BWE,LB,C,INT,n,m1}); 
ifg < 2 V q = 3 then goto exis; 
comment 10. We now calculate F = /2(p) and check condi- 
tion (9); 
F:=0;1:= m2;i1 := 0; 
for i:= 1 step 1 until 7 do 


begin 
if />n—m2 then/ := /—1; 
if {1 > —m2 then i! := jl—1;FF:= — Y{ij; 


for 7 := il step 1 until / do 
FF := FF+ Bii,j) X Cli-+/]; 
F:= F+FF X FF X W(i|; Tli] := FF; 
end; 
if abs(F—S) <eps X sqrt(n X abs(S) then 
begin S := F; goto exit end; 
comment 11. It may happen that the choice of p* (see comment 
8) leads to s,° with f-!(s,°) >S—}. In this case we setg = 5 
and exit. 
This means the initial choice of S is too large; 
comment 12. In some cases the iteration may lead to s, with 
f(s.) >S~?. (Because of the concavity of /-! this is theoreti- 
cally impossible.) We set g = 4 and exit. See also comment 
15; 
if < S then 
begin 
if r=0 then begin g= 5; goto fail end 
else begin g=4; S := F; goto exit end 
end; 
comment 13. We now compute FF=/(p) < /’(p) and carry out 
one step of the Newton process; 
for i := 1 step 1 until 1 do 
Cli] := Wii] < TTi); 
BANSOL (BWE,LB,T,INT,n,m); 
FF := 0;/ := m2;i1 := 0; 
for i := 1 step 1 until do 
begin 
ifi > n—m2then/ :=/-—1; 
if (1 > —m2 then jl := i1—1; fl := 0; 
for / := i1 step 1 until / do 
fl := f1+Bli,j] X Tli-/); 
FF := FF—C(i] Xf 1; 
end; 
p:= pX (A+F X (s2--sqrt(F)) /s2/FF); 
goto nextit; 


ll 


exit: 


end; 

comment 14, Choice of parameters. It is known that the condition 
number of the system BC = Y for spline interpolation in- 
creases at least exponentially with m (see de Boor [1]). It is also 
related to the spacing of the {x;}:". We have computed splines 
to order 20 (m= 10) with knot spacing 


max,(x ry1— xX i) 
i 
min xis. — Xi) 


up to 1000, without difficulty. For many problems a choice of 
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a small m is desirable—e.g. m = 2, 3 lead to cubic and quintic 
splines, respectively. The size of the parameter » is naturally 
limited by the storage capability of the machine and the time 
available for computation—it seems to have little or no effect on 
conditioning. 

The choice of {w;}:" and S for smoothing depends on the 
confidence we have in the data {y,}.". It has been suggested [3] 
that w; should be chosen as éy; 2, where dy, is an estimate of the 
standard deviation of the ordinate y;. A practical upper bound 
for the choice of w; is (mach)~*, where mach is defined in com- 
ment 2. If we have more confidence than this in the data, then it 
is probably accurate to machine word length, and we should set 
q = 0 and do interpolation rather than smoothing. When 
q = 1, the choice of p (input through S) for problem (5) is 
problematical. There really is no dependable scheme for choos- 
ing it (see the remarks in [4]) unless more is known about the 
problem. For gq = 2, it is recommended [3] that S be chosen 
in the interval «2 — (22)! < S < nm + (2n)!. The param- 
eters eps and maxit influence each other. For most applications 
it would seem that eps should not be too small—we often used 
107'; 

comment 15. Summary of output after execution. After the execu- 
tion of SPLINECOEFF, the values of g¢,S provide information 
on the computation. If g = 0,1, 2, then computation pro- 
ceeded normally, and the desired coefficients are stored in array 
C. lf q = 3 (see comment 9) interpolation instead of smooth- 
ing has been carried out (if the user insists on doing smoothing, 
S must be increased). If g = 4 (see comment 12) the program 
delivered the solution of problem (6) with the S returned in the 
output. (If the user insists on a solution of (6) with the pre- 
scribed S, then the problem can be rerun with a write statement 
providing the values of p and / in each iteration. Then an ap- 
propriate p can be estimated by interpolation and the program 
reentered with g = 1.) If gq = 5 (see comment 11), the user 
must either reduce S or consider doing a least squares fit. If 
q = 6, maxit has been exceeded. If g = 7 (see comment 7), 
then the initial value of p prescribed for problem (5), i.e.g = 1 
initially, is too small. The value of p can be increased or a least 
squares fit should be used; 
end SPLINECOEFF; 
real procedure SPLINEDER (v,X,/,C,m,n,arg); 
value v,/, m,n, C, arg; 
integer v,/, m,n; real arg; array X, C; 
comment 16. Given a spline s of the form (1) with coefficients {c;}." 
SPLINEDER produces the value s‘” (arg) of the vth derivative of 
s for the argument arg. 

s‘ (arg) is computed by evaluating certain local basis splines 
corresponding to degree 2m — y. The procedures MIDBASIS 
and ENDBASIS are used here. Then s‘” (arg) is a linear combina- 
tion of these quantities with coefficients {cf} ” (see [2, Lemmas 

5.1 and 5.2]). The c{” are computed from the c,’s by certain re- 
cursions, carried out by procedure CV bclow; 


comment 17. We note that s®”"-") is precewise continuous with pos- 


sible discontinuities at the knots {x;},". The procedure always re- 
turns s2"-)(x;-+) if called with arg = x, a knot; 
comment 18. We describe the parameters of SPLINEDER. The in- 
tegers mand n and the array X[1:] are as in procedure SPLINE- 
COEFF. The array C[1:] is the output of SPLINECOEFF. The 
integer v must satisfy 0 < v < 2m -- 1. The real number arg and 
the integer/ satisfy1 </ <n — Jand X{/] < arg < X(/+1]; 
begin 
integer k;k:= m+ m — ¥; 
begin 
array 7, NIK(0:n],Z, QIK, PIK(0:k]; real s; 
integer i, /, i1, i2, pvl, qvl, rvl, mv, lu, 11, 12; 
procedure CV(C, X, r,s, n, m, v); value r,s, 1, m, v; 
integer r, 5,1, m, v; array C, X; 
comment 19. CV computes {c;°’}i~,. It should be noted that 
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CV is a recursive procedure; 
begin 
integer /, rl, s1; 
if v = OQ then goto exit else if y < m then 
begin 
CV(C, X,r, s+1,2, m, v—1); 
for j := rstep | until s do 
CL] := iff < m — v then —C{j] 
else if 7 < 2 — m then 
(Cij+1] —CUi])/(Xim+/] — X[j—m-+-v]) 
else C[L/+1] 
end 
else 
begin 
rl:=ifr > 1thenr — 1| else 1; 
sl:=ifs < a+ v — 2Xmthen selses — 1; 
CV(C, X, rl, sl,n,m, v—1); 
ifs =n + v — 2Xm then C{s| := 0; 
for j := sstep —1 until r do 
Cla] := (Cl/J—CU—1)) /(XUj+-2 X m=] — Xj) 
end; 
exit: 
end CY; 
comment 20. The numbers pvi/ and qv/ give the range of c‘»)’s 
corresponding to nonzero basis elements in the expansion of 
Ss (arg); 
if v < m then 
begin 
pvl:= if! < mthen1else/ — m + 1; 
qi :=ifn <1 + mthenn — velse/ + m — vy; 


end 

else 

begin 
pvl:= if! < k then lelse/ — & + 1; 
qui := iff <n — kthen/elsen — k; 

end; 

C(O] := 0; 

CV(C, X, pri, qvl, n,m, v); 

s:= Q; 


if v < m then goto v/m; 
for 7 := pvi step 1 until gvi + k do 
T{j] := abs(arg—X{j)); 
MIDBASIS (k,1,n, T, XX, NIK); 
for j := pvil step | until gv/ do 
s:= 3s + Cli] X NIK{J); 


goto exit; 
vim: 
if/ < k then 
begin 
for ; := 1 step | until/ + kdo 
T[j] := abs(X[j]—arg); 
ENDBASIS (k, 1, n, T, XX, NIK); 
for j := pvi step 1 until gv/ do 
s:=s + Cli] X NIKU+m-—]]; 
end else 
if/ > n — k then 
begin 
for j:= 1 step 1 untiln —/ + k + 1do 
T[j] := abs(arg—X|n—j+})); 
i1:=ifarg > X{/| thenn — /elsen — / + 1; 
ENDBASIS (k, 11, n, T, XXR, NIK); 
for j := pvi step | until gv/ do 
s:=s + Cl] X NIK|n4+m—v—-J]; 
end 
else 
begin 


forj:=/ — k + istep 1 until/ + kdo 
T(j] := abs(X[j]—arg); 
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MIDBASIS (k,1,n, T, XX, NIK); 
for j := pvi step 1 until gv/ do 
s:=s + Cj] X NIK[i-k+m]; 
end; 
exil: 
for i := 1 step 1 until v do 
S:= 5X (m+m-i); 
splineder := s 
end inner block 
end splineder; 
procedure MIDBASIS (k, 1, n, T,XX, NIK); 
value k,/,n; integer k,/,n; array T, XX, NIK; 
comment 21. This procedure implements case I of [2]. It computes 
the value of certain normalized B-splines N%,.(arg) at an arg which 
enters indirectly through the array T via TU] = |x[/] — arg]. 
After execution N/K([/] contains N‘ (arg), j = max(1,/+1—k&), 
faye 
begin 
integer i, /, il, i2; 
NIK(I} := 1; NIK([+1] := 0; 


il:=i2:=1; 
for i := 2 step 1 until k do 
begin 
ifi < / then 
begin 
il:= ii — 1; NIK[/1] := 0; 
end; 


ifn — i < /then/2:= 72 — 1; 
for j := il step 1 until i2 do 
NIK{j] := T{j] X NIK{J]/XXT/, /—1] + Tl] X 
NIK{j+1)/XXU-+1, i-1]; 
end; 
end midbasis; 
procedure END BASIS (k,/,, T, XX, NIK); 
value k,/, n; integer k,/, 1; array T, XX, NIK; 
comment 22. This procedure implements case II of [2] to compute 
the quantities (7.4) of [2] atan argument arg which enters through 
the array T as in comment 21; 
begin 
integer i,j, k1,/1,/2; real temp, temp2; 
array Q[0:k, —1: k+/]; 
kl:=k-1; 
for i := O step 1 until & do 
for j/:= / — 2step 1 until / + ido 
Qli, /| := 0; 
Q[l,/] := 1/XX1/, 1]; Q(0, —1] := T121/XXU, 1]; 
for i := 2 step 1 until & do 
begin 
for } := / step | until i — 2 do 
begin 
temp| := Tij+1) 
Oli, | := Qti-2,j—2] + (empi+T{j)) x Qli-—2, 7-1] + 
temp| X temp\ X Qli—2, J]; 
end; 
ifi > / then 
begin 
fempl := T\i|; temp2 := temp\ X temp1/XX(|1, i—1];° 
Qli, i—1} := Qli—2, i-3] + (temp1+T{i—1)—temp2) X 
Q(i—2,i—2] + temp2 x Q[i—2, i-1]; 
end; 
l1:= ifi > / theni else/; 
12:=if/ +i-—-1>n — 1thenn — lelse/ + / — 1; 
for j := /1 step 1 until /2 do 
Oli, /) := (TY-i+1] x Qli-1,/-UW+7U+1] k Q—1,7))/ 
XX[f—i+1, O; 
end i; 
if/ > 1 then NIK{/—1] := 0; 
for j := / step 1 until k1 do 
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NIK{j] := Q{k, J]; 
2:=ifk +/—-—-1>n — lthenn — lelsek +/ — 1; 
for j:=k step 1 until /2 do 
NIK{j| := Q[k, 7] X XX[j-k+1, 4]; 
end ENDBASIS, 
procedure BANDET (A, B, INT, n, m); - 
value 7, m; integer 1, m; array A, B, integer array INT; 
comment 23. BANDET decomposes the 2m + 1 banded n XK n 
matrix A in an upper triangular matrix A and a lower triangular 
matrix B using Gaussian elimination with complete pivoting. De- 
tails of the interchanges are stored in the array INT. The arrays are 
dimensioned as follows A[l:n, —m:m], B[l:n, lim), INT{[1:n]. 
For further details see [5]; 


begin 
integer i, /, k,/; real x; 
Gam, 
for i := 1 step 1 until mm do 
begin 
for j:= 1 — istep 1 until m do 
Afi, j—-l) := Ali, /]; 
f:=/1 — 1; 
for j:= m — / step 1 until m do 
Ali, j] := 0 
end i; 
l:=m; 
for & := 1 step | until 7 do 
begin 


x:= Alk, —m];i:=k; 
if] < nthen/:=/ + 1; 
for /:= & + 1 step 1 until / do 
if abs(A[j, —m]) >abs(x) then. 
begin x := A[j, —m];i:= /end; 
INT[k] := i; 
if i + k then 
for /:= —”m step 1 until m do 
begin 
x i= Alk, /]; Alk, J) := Ali sl; Ali, / = 
end /; 
for i::= k + 1 step 1 until / do 
begin 
x i= Ali, —m]/A[A, —m]; BlA, i-k] : 
for j:= 1 — mstep | until m do 
Afi, j—1] := Afi, /] — x X Alk, J]; 
Ali, m] := 0 
end i 
end k 
end BANDET, 
procedure BANSOL (A, B, C, INT, 1,m); 
value n, m; integer 7, m; array A, B, C; integer array /NT; : 
comment 24. The parameters A, B, INT, n, and m come from 
BANDET. BANSOL solves the system decomposed by BANDET 
with right-hand side C. The solution is returned in {C[/|{i" (see 


| 
* 


i 
* 


fo); 
begin 
integer i, j,k, /; real x; 
l:=m; 
for k := 1 step 1 until 7 do 
begin 
i:= INT [k]; 
if i + k then 


begin x := C[k]; C{k] := C[é]; Cli] := x end; 
if] <nthen/ :-=/+ 1; 
for i := k + 1 step 1 until / do 
Cli) := Cli] — Blk, i — k] X Ck] 
end k; 
/:= —m; 
for i :=:n step —1 until 1 do 
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begin 
x:= Cli;j:=i+m; 
for k := 1 — mstep | until / do 

x:=x— Ali, k] X Clk + J]; 

Cli] := x/A[i, —m]; 
if/ < mthen/ :=/+ 1 

end / 

end BANSOL; 
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Arrow to Precedence Network 


Transformation [H] 
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fornia, Berkeley, CA 94704 


Key Words and Phrases: critical path, networks, precedence 
networks 

CR Categories: 3.23, 5.40 

Language: Fortran 


Description 

Purpose. Many of the recent application programs in the area 
of critical path scheduling and resource allocation are written for 
the precedence networking convention [1, 2, 3]. Since only a few of 
these programs accept networks defined by the arrow convention 
directly, a method of transforming arrow convention networks into 
precedence convention is required. This algorithm generates the re- 
quired transformation by producing a list of followers for each non- 
dummy arrow activity. New labels are produced for each trans- 
formed activity and replace the (i — /) labels associated with arrow 
networks. (The new label is actually the activity input sequence 
value, but this can easily be modified to any desired notation by 
using the input sequence value as a subscript to any array contain- 
ing the desired notation.) 

The logic used in the transformation can also be utilized to 
produce a list of precedecessors if they are desirable. (This order is 
required by IBM (3] but is performed internally.) The role of arrays 
(11 and JJ) would be reversed and the array (JLOC) would refer 
to (JJ) vice (II). - 

Method. The values of the arrow (i — /) labels are utilized to 
trace the followers of a particular activity. Activities which have an 
(i) label corresponding to the (/) label of the activity under evalua- 
tion are logical followers. The major problems rest with the arrow 
DUMMY activities. These activities are not really followers but 
indicate instead addition nodes that precede logical followers. The 
transformation routine recursively traces all possible following 
nodes and determines the input sequence number of all logic 
followers. 

To perform this search with the minimum storage required the 
following procedure is utilized. First the arrays (IJ, JJ, NLOC) are 
filled by scanning the description of the arrow network and storing 
in input order the converted value of the (/) label into array (JJ); 
the converted value of the (/) label into array (JJ); and finally the 
array (NLOC) contains the input sequence value. To aid in deter- 
mining which activities were dummies, the last two arrays (JJ, 
NLOC) have their values set negative when the corresponding ac- 
tivity was a dummy. Since the minimization of storage was a goal, 
all incoming (i — /) labels were converted into a numerical sequence 
starting with one. The algorithm indicates the required modification 
if this is undesirable. (The actual conversion. method is described 
in the routine HASH.) Once the arrays are filled, the transformation 
routine can be called. 

Routine (TRNFRM) first sorts the array (//) into ascending 
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order, maintaining the same correspondence of each element in 
array (NLOC). A sequential scan is then performed on the sorted 
array (JI), and the array is overlayed by an array, (LOC), contain- 
ing pointers to the beginning of each different (7) value in the sorted 
array. That is element (1) of the new array points to the start of 
the value (1) in the sorted array; element (10) to the start of (10), 
and so forth. Finally the array (JJ) is scanned sequentially and the 
nonnegative values become subscripts to the pointer array (ILOC). 
This yields the beginning location and number of activities that 
had an (7) label equal to the current (/) value. The values stored in 
(NLOC) are the input sequence numbers of the followers. If the 
follower was a DUMMY, (NLOC) negative, a recursive search is 
performed for additional followers. 

Finally for each nonnegative entry in (JJ), the description is 
retrieved from the scratch tape and the activity and its followers are 
output. 

_ Test Results. Testing was performed by two additional programs 
which are also included in the algorithm listing in case they are 
desired. Routine (TEST) reads the arrow network filling the arrays 
(I, JJ, NLOC) as described. Routine (HASH) performs the re- 
quired conversion to the (i — /) labels during this process. 

Tests include networks with sequential dummies and other 
unusual conditions. In each case tried, the transforrnation was cor- 
rect. The inefficiency of the bubble up sort could adversely affect 
very large networks and an alternative would be to pre-sort the 
arrow network and eliminate the sorting portion. The following 
table indicates execution time versus number of activities for tests 
run on a CDC 6400. 


Execution Times for Various Networks Tested 


Number of Execution 
activities time in sec. 
16 0.42 

44 1.68 

177 2.08 

461 5.81 

677 10.76 


The routine does not test for logical errors in the arrow net- 
work such as loops, so these would be transformed without change 
into the precedence notation. 
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Algorithm 
(Note: A sample driver is included to help clarify the use of 


this algorithm—L.D.F.) 


THIS IS THE TEST PROGRAM FOR THE TRANSFORMATION ALGORITHM. 
IT READS THE ARROW NETWORK DESCRIPTION AND ESTABLISHES 

THE INPUT ARRAYS FOR THE ROUTINE (TPNFRM). 

IT IS LIMITED TO 788 ACTIVITIES IN ARROW NOTATION. 

THE ROUTINE (HASH) IS UTILIZED TO CREATE A SEQUENTIAL 
NUMBERING. 

THE ROUTINE (TRNFRM) CREATES THE ACTUAL TRANSFORMATION. 
TAPE(2) -A BINARY SCRATCH TAPE (FILE) WITH ALL DATA TO 

BE INCLUDED WITH THE TRANSFORMED ACTIVITIES+«NOTE- CHANGE 
STMT 148 TO CORRESPOND WITH ACTUAL DATA STORED. 

TAPE(4) -A BINARY SCRATCH TAPE FOR TRANSFERING THE TRANS - 
FORMED DATA BACK TO THE MAIN PROGRAM FOR PRINT OUT, OR ANY 
OTHER USE. THE DATA IS IN THE FORM (1,.M,FOL) WHERE I 1S 


qgaanNgaAaaaagaaaaa 
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c 
Cc 
c 


THE NEW ACTIVITY LABEL AND M IS THE NUMBER OF FOLLOWERS 
AND FOL IS AN ARRAY CONTAINING THE LABELS OF THE M 
FOLLOWERS... 
INTEGER 11¢788), 
* HASH.s FOL(5®@) 
DATA DUMMY/SHDUMMY/, 


JJC7868), NLOC(780), ACTC(2), DUMMY, 


IBLNK/1H 7 


C READ IN ARROW ACTIVITIES ACCORDING TO CURRENT FORMAT. 

99999 FORMATCIHI], 13H INPUT ORDER, 6X. SHLABEL. 5X, 4HDESC, 
* THRIPTION, 12X%s 3HDUR)D 

99996 FORMAT(2A4, 2A18, 13, 3X, 16) 

99997 FORMATCII4, 4X, Ads LH-s Ads 3X, 2AI1Gs 16) 

99996 FORMATCIH], 19HTRANSFORMED NETWORK//14H LABEL DESCR, 


* 6HIPTIONs 10X, JHDUR, 3X, QHFOLLOWERS) 
99995 FORMATCIH » 17, 2X, 2A18, 14) 
99994 FORMATCIH+, 36X, 1515/¢€37X, 1S515)) 
WRITE (6.99999) 
NACT = @ 
NTAPE2 = @ 
10 READ (5.99996) I, Js ACT. IDUR 


a aa 


agqganna 


gQAgAgAANnA a 


aaan 


aoa 


aagagaana 


c 
Cc 
c 


FORMAT (99996) WILL 
THE TEST FOR END OF 
IF (I.EQ. IBLNK) 
NACT = NACT + 1 
LIST THE ARROW DATA 
WRITE €6,99997) 


VARY FOR INDIVIDUAL NEEDS. 
DATA 1S A BLANK CARD- 
GO TO 38 


FOR REFERENCE. 


NACT, I. Js ACT, IDUR 


CONVERT THE ALPHANUMERIC I-J LABELS INTO SEQUENTIAL 
NUMERIC. (ROUTINE HASH PERFORMS THIS TASK.) 
STORE THE CONVERTED LABELS IN THE ARRAYS (II AND JJ). 
NOTE. THE VALUE STORED IN ARRAY (JJ) IS ALSO SAVED AS 
VARIABLE J TO ALLOW IT TO BE USED AT STMT 28 WITHOUT AN 
ARRAY -REFERENCE. 

TICNACT) = HASHCI) 

J = HASH(dJ) 


JJCNACT) = J 
STORE THE INCOMING INPUT SEQUENCE 
NLOCCNACT) = NACT : 
EXAMPLE OF USER CREATED LABELING, SEE ALSO COMMENTS AFTER 
STMT [4@ IN ROUTINE TRNFRM. 
LABLS (NACT)=CONCATENATION OF INPUT (1-J) 
THE CONCATENATION IS PERFORMED IN ACCORDANCE VITH VALID 
FORTRAN FOR THE COMPILER IN USE 
TEST FOR A DUMMY ACTIVITY AS IT WILL NOT BE TRANSFORMED. 
IF cCACT(1)-EQ@-DUMMY) GO TO 28 


VALUE IN ARRAY (NLOC) 


SAVE GN TAPE (2) ALL INFORMATION RELATING TO THE ACTIVITY 
JUST READ THAT IS TO BE ASSOCIATED WITH THE TRANSFORMED 
ACTIVITY. (FOR THE EXAMPLES ONLY THE DESCRIPTION AND DUR 
ARE SAVED,ACTUAL USERS WILL HAVE INDIVIDUAL REQUIREMENTS) 

NTAPE2 = NTAPE2 + 1 

WRITE €2) ACT. IDUR 

GO TO 1@ 


IF AN ACTIVITY WAS A DUMMY. SO NOTE BY SETTING THE 
LOCATEON AND JJ LABEL VECTORS NEGATIVE. 
28 NLOCCNACT) = -NACT 
JJCNACT) = -d 
RETURN FOR NEXT INPUT ACTIVITY. 
STMT 308 WHEN LAST INPUT 
GO TO 18 
3@ REWIND 2 
CALL THE TRANSFORMATION ROUTINE.,DESCRIPTION OF INPUT 
ARRAYS IS FOUND IN THE (TRNFRM) ROUTINE. 
CALL TRNFRM(NACT, II. Jus NLOC) 
PRINT OUT THE TRANSFORMED NETUORK..- 
WRITE (€6,99996) 
DO 4@ N=l,NTAPE2 
RECOVER THE REQUIRED DATA RELATING TO THE TRANSFORMED 


TRANSFER WILL BE MADE TO. 
IS RECOGNIZED. 


ACTIVITY FROM TAPE(2) AND TAPE (4). 
READ (2) ACT. IDUR 
READ (4) I. Ms FOL 
WRITE (6.99995) I, ACT, IDUR 


IF (MeLE-@) GO TO 48 
WRITE (6.99994) (FOLCMM),.MM=1 4M) 
43 CONTINUE 
STOP 
END 


INTEGER FUNCTION HASH(N) 
THIS ROUTINE CONVERTS THE ALPHANUMERIC ARROW LABELS INTO A 
SEQUENTIAL NUMERIC EQUIVALENT. THE MAXIMUM NUMBER OF 
SEPARATE ACTIVITY LABELS IS 580 FOR THIS TEST PACKAGE. 
THE ACTUAL INCOMING LABEL 1S STORED IN ARRAY (HOLD) AND 
THE SEQUENTIAL NUMERIC EQUIVALENT IS STORED IN ARRAY 

(SAVE) 

VARIABLE (NUM) PROVIDES THE SEQUENTIAL NUMBERS - 

INTEGER HOLD(580), SAVE(S0@) 


DATA NUM/@/, HOLD/S@@*8/ 
USE A MODIFIED HASHING ROUTINE TO FIND AND STORE THE 
EQUIVALENT VALUES. 


NN IS A HASHED VALUE FOR THE INPUT VARIABLE N. 


99999 FORMAT(34H EXCEEDED THE EVENT TABLE CAPACITY) 


aqaanaaan 


NN = MODCIABS (N/68719476736),375) 

1@ DO 28 I=NN,588 
THE ARRAY (HOLD) IS EXAMINED STARTING WITH THE HASHED 
VALUE, IF THE ARRAY ELEMENT CONTAINS THE INPUT VARIABLE N, 
TRANSFER IS MADE TO STMT 4@ AND THE EQUIVALENT SEQUENTIAL 
NUMBER IS RECALLED FROM ARRAY (SAVE). IF THE ARRAY ELEMENT 
CONTAINS A ZERO,TRANSFER IS MADE TO STMT 38 AND A 
NUMERICAL 
EQUIVALENT IS ASSIGNED. 


THE SEARCH OF CHOLD) CONTINUES 


aan 


aagaagagaaanaanrananaangnaananan 


c 
c 
Cc 
c 
c 
c 
Cc 
c 
9 
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UNTIL AN OPEN ELEMENT IS FOUND... 
IF CHOLD(I)-EQ.N) GO TO 4d 
IF CHOLD(1).EQ@.8) GO TO 30 
28 CONTINUE 
IF NO OPEN ELEMENT WAS FOUND AND NN=1 
ELEMENTS IN THE ENTIRE ARRAY. 


THERE ARE NO OPEN 


IF NN IS NOT EQUAL TO 1. SET 


IT TO 1 AND SEARCH LOWER PART OF (HOLD)... 
IF (NN-EQ.1) GO TO 68 
NN = 1 
GO TO 18 
FOUND A NEW LABEL-GIVE IT AN EQUIVALENT SEQUENTIAL NUMBER 
3@ HOLD(1) = N 
NUM = NUM + 1 
IVAL = NUM 
SAVECI) = IVAL 
TRANSFER TO STMT 5@ AND SAVE A REDUNDANT RECALL FROM 
(SAVE) 
GO TO 58 
48 IVAL = SAVECI) 
5@ HASH = IVAL 
RETURN 


AN ERROR MESSAGE IS GENERATED IF THE NUMBER OF EVENTS 
EXCEEDS THE DIMENSION ALLOWED. 
68 WRITE (6,99999) 
STOP 
END 


SUBROUTINE TRNFRM(NACT, Il, Jds NLOC) 
ALL DATA WAS STORED IN THE ARRAYS CII-JJ-NLOC) BY THE 
CALLING ROUTINE AND COMFORMS TO THE FOLLOWING DESCFIPTION 
CNACT) -THE NUMBER OF ARROW ACTIVITIES INCLUDING DUMMIES. 
CII) -AN ARRAY OF CONVERTED -I- LABELS STORED IN THE ARROW 
NETYORK INPUT ORDER-REFER TO THE COMMENTS AFTER STMT 148 
IF USER GERERATED LABELS ARE DESIRED.SEE ALSO COMMENTS IN 
MAIN ROUTINE. 
(JJ) -AN ARRAY LIKE (II) FOR -J~ LABELS EXCEPT THAT THE 
VALUE IS NEGATIVE FOR ALL DUMMY ACTIVITIES. 
(NLOC) -AN ARRAY INDICATING INPUT ORDER.CA SEQUENTIAL LIST 
SUCH THAT THE ABSOLUTE VALUES WOULD RANGE FROM ONE TO NACT 
>) NOTE- THE VALUE STORED IN (NLOC) IS NEGATIVE WHEN THE 
CORRESPONDING ARROW ACTIVITY WAS A -DUMMY- . 
TAPE(4) -A BINARY SCRATCH TAPE FOR TRANSFERING THE TRANS - 
FORMED DATA BACK TO THE MAIN PROGRAM FOP PRINT OUT, OP ANY 
OTHER USE. THE DATA IS IN THE FORM (I,4M,sFOL) WHERE [| IS 
THE NEW ACTIVITY LABEL AND M IS THE NUMBER OF FOLLOVESS 
AND FOL IS AN ARRAY CONTAINING THE LABELS OF THE M 
FOLLOWERS... 
STORAGE FOR THE ARRAYS IS ALSO SPECIFIED 
PROGRAM. 

INTEGER II¢C1), JJC1)>, NLOCCI) 

INTEGER STACK(5@), FOL(5@) 
THE DIMENSION STAMENTS FOR (II-JJ-NLOC) 
FOR USE WITH SOME FORTRAN COMPILERS. 
DIMENSIONS ON STACK AND FOL LIMIT THE NUMBEP. 
ACTIVITIES TO S@. 
STATEMENT FUNCTION TO PROVIDE OVERLAYING ARRAY CII) 
ARRAY CILOC).REFER TO THE WARNING AFTER STMT 38,1F A 
SEPERATE ARRAY CILOC) IS UTILIZED THE STATEMENT FUNCTION 
WOULD BE DELETED. 


IN THE CALLING 


MUST BE MODIFIED 


OF FOLLOWING 


WITH 


9999 FORMAT(4I1H THE FOLLOWING ACTIVITY APPEARS TO HAVE M, 


* 22HORE THAN 5@ FOLLOWERS >) 


99998 FORMAT(4IH SUSPECT THE FOLLOWING ACTIVITY IS INVOLV, 


c 


Cc 
c 


aanaagagagagaaaaanaannga 


* 4IHED IN A NETWORK LOOP - CHECK INPUT DATA. /15) 
ILOCCI> = LIC) 
REWIND TAPE 4 FOR TRANSFER OF TRANSFORMED DATA. 
REWIND 4 


PLACE THE ARRAYS C(II-NLOC) IN ASENDING ORDER USING 
AS THE SORT VARIABLE. (THIS 15 A BUBBLE UP SORT.) 
LIMIT = NACT - I 
DO 2@ M=l,.LIMIT 
LL =M + 1 
DO !@ N=LL,NACT 
IF CIICM)-LE-IICN)) 
IHOLD II CN) 
TICN) T1¢M) 
T1¢M) IHOLD 
IHOLD NLOC CN) 
NLOC(N) = NLOC(M) 
NLOC(M) = IHOLD 
10 CONTINUE 
20 CONTINUE 
REPLACE THE ARRAY CII) WITH AN INTEGER POINTER SUCH THAT 
THE (K TH) ELEMENT OF THE POINTER POINTS TO THE FIRST 
LOCATION IN THE SORTED ARRAY (II) WHICH CONTAINS THE VALUE 
(K).-THE POINTER ARRAY WILL BE CALLED (ILOC) SINCE IT 
INDICATES TRE BEGINNING OF SORTED ARROW NODES (ARRAY I1) 
AND THESE NODES ARE NORMALLY REFERRED TO AS (I) NODES. 


THE VARIABLE (N) IS SET TO THE MINIMUM VALUE IN ARRAY (I1) 
N IS ALSO A VARIABLE THAT INDICATES THE CURRENT VALUE 
UNDER INVESTIGATION IN ARRAY (It). 
L IS A POINTER TO THE ARRAY CILOC), INDICATING THE LOCATION 
OF THE NEXT ELEMENT.IN ADDITION \ ALSO INDICATES THE NEXT 
SEQUENTIAL NUMBER,.AND IS USED TO FIND THE END NODES. (NODES 
WHERE THERE EXISTS NO -I- IN THE (I-J) PAIRS,AND THERE- 
FORE NO ENTRY IN THE SORTED (11) ARRAY..) 

N= 1 

Lee 

DO S5@ 1=2,.NACT 


CII) 


GO T9 18 
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IF CIICI)-EQ.N) GO TO 58 
N = TFI¢l) 
38 IF (N-EQ@-L) GO TO 46 


C THIS TEST FINDS THE REFERENCES TO THE END NODE WHICH WILL 
C NOT BE IN THE SORTED ARRAY OF C(I) NODES. 
C WARNING -- ALTHOUGH INPUT ORDER IS NOT NORMALLY IMPORTANT 
C REFERENCE TO END NODES.,THAT IS (I-J) PAIRS WITH -J- EQUAL 
C TO AN END NODE.sSHOULD BE POSITIONED IN THE LATER PORTION 
C OF THE INPUT DATA-THIS RESTRICTION CAN BE ELIMINATED BY 
C USING A SEPARATE ARRAY FOR (CILOC). 
C 11¢L) IS SET TO ZERO TO INDICATE THAT NODE -L- IS AN END 
C NODE IN THE ARROW INPUT NETWORK. 

IICL) = 8 

Leb * 1 

GO TO 38 


C STORE THE SUBSCRIPT VALUE OF THE ARRAY (11) IN TO THE 
C OVERLAYED ARRAY (ILOC). 
40 II¢L) = I 
L=Ll+i 
56 CONTINUE 
SET THE NEXT LOCATION OF THE POINTER TO CNE PAST THE LAST 
ACTIVITY NUMBER. 
MAXLST = L - 1 
II¢L) = NACT + 1 
FOR ALL NON DUMMY ACTIVITIESs TRANSFORM THE ARROW LOGIC 
CONSTRAINTS INTO THE PRECEDENCE NOTATION BY GIVING THE 
ACTIVITY A LABEL EQUAL TO ITS INPUT ORDER,THEN LIST ALL 
TRANSFORMED FOLLOWERS. 
DO 160 1=1,NACT 


ana 


aaana 


C L INDICATES THE LENGTH OF THE STACK AND M IS THE NUMBER OF 
C FOLLOWERS.THE STACK 1S USED TO RECURSIVELY TRACE ALL 
C DUMMIES TO FIND LOGICAL FOLLOWERS. 
N = JJCI) 
C IF N IS NEGATIVE THE ARROW ACTIVITY WAS A DUMMY. 
IF (N.-LE-8) GO TO 168 
68 LOC = N 
IF :LOC-GT+MAXLST) GO TO 118 
LOC HAS A VALUE EQUAL TO THE -J- LABEL OF ACTIVITY UNDER 
TRANSFORMATION. ILOCR POINTS TO THE BEGINNING OF THAT SAME 
VALUE IN THE SGRTED ARRAY (11).WHEN (LOC) EXCEEDS THE 
VALUE OF \MAXLST) THE -J- LABEL ON THE ARROW NETWORK WAS 
THE END NODE, THEREFORE THERE ARE NO FOLLOWERS. 
ILOCR = ILOC\LOC) . 
IF CI.OCR.LE.@) GO TO 118 
C IF ILOCR IS NEG CR ZERO THE ACTIVITY HAS NO FOLLOWERS. 
a) Loc = LOC + 1 
NN = ILOCCLOC) - ILOCR 
C NN INDICATES THE NUMBER OF ELEMENTS IN ARRAY CII) WITH THE 
C VALUE. 
IF (NN-LE.«@) GO TO 78 
DO 188 LOOP=1,NN 
LOCS = NLOCCILOCR) 
IF CLOCS-EQ.@) GO TO 98 
IF CLOCS.GT.@) GO TO &8@ 
C LOCS NEGATIVE INDICATES A DUMMY AND THESE ARE HELD IN THE 
C STACK FOR LATER CONTINUED SEARCH OF FOLLOWERS. 


aaaan 


L = Lb +1) 
UF (L-GT.5@) GO TO 136 
STACKC(L) = -LOCS 
GO TO 98 
Be M=M + I 


C A FOLLOWER HAS BEEN FOUND.STORE IT IN THE ARRAY (FOL). 
IF (M.GT.58) GO TO 120 
FOLC(M) = LOCS 
C INCREASE THE POINTER TO NEXT POTENTIAL FOLLOWER. 
908 TLOCR = ILOCR + I 
188 CONTINUE 
118 IF (L-LE.@) GO TO 146 


C IF ¢L) IS NON-ZERO,.THERE ARE DUMMY LINKAGES TO BE CONSIDER 
C ED. (N) WILL INDICATE FIRST OF THESE AND THE SEARCH FOR 
C FOLLOWERS WILL CONTINUE. 

K = STACK(L) 

N = IABSCJUCK)) 

Loeb et 

GO TO 68 


C ERROR MESSAGES IF DIMENSIONS EXCEEDED- LOOP ASSUMED. 
120 WRITE (6.99999) 
138 WRITE (6599998) I 
140 WRITE (4) I, Ms FOL 


C IF USER LABELS ARE USED THEY WOULD BE RETRIEVED THUSLY -- 
Cc I = LABLS(I) 
Cc DO 15@ LOOP=I1.4M 
Cc ISUB = FOLCLOOP) 
c FOLCLOOP) = LABLSCISUB) 
Cc 15@ CONTINUE 
C WHERE LABLS WOULD BE AN ARRAY PASSED IN THE ARGUMENT LIST 
168 CONTINUE 
REWIND 4 
RETURN 


END 
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Algorithm 482 
Transitivity Sets [G7] 


John McKay and E. Regener* [Recd. 21 May 1973] 
School of Computer Science, McGill University, Mon- 
treal, Quebec, Canada 


Key Words and Phrases: transitivity, sets 
CR Categories: 5.39 
Language: Algol 


Let P = [Pi, Po,..., Px} be a set of A permutations on the set 
9 = {1, 2,..., a}. The transitivity set containing i (or orbit of 7) 
under P is the set of images of 7 under the action of products of 
elements of P. This procedure computes these orbits. 

On entry, im{i,/] is assumed to contain the image of i under 
P;, fori = 1,2,...,nand/ = 1, 2,...,k. The procedure numbers 
the orbits consecutively starting at 1. On exit ind[i] contains the 
number of the orbit to which i belongs. The orbits appear in order 
in orb[{1in]. In orb the first element of each orbit is tagged negative. 
If only one permutation is input, the array orb contains it (tagged) 
in disjoint cycle form on exit. 

The algorithm, which involves no searching, is related to one 
for finding a spanning tree of a graph [1]. The set P need not, in 
general, generate a group—it is sufficient that it generate a semi- 
group on 2. 


References 
1. Cannon, J. Ph.D. Th., Sydney U., Sydney, N.S.W., Australia, 
1969. 


Algorithm 
procedure orbits (ind, orb, im, n, k); 
value 1, X; integer n, k; 
integer array ind, orb, im; 
begin 
integer g, r, 5, j, nt, ns, norb; 
for j := 1 step 1 until 7 do ind{j] := 0; 


norb := O; ns := 1; 
for r := 1 step 1 until 1 do if ind[r] = O then 
begin 
norb := norb + 1; ind[r] := norb; 
nt := ns; orb[ns] := —r;s:= 4, 
a: 
ns i= us +1; 
for j := 1 step 1 until k do 
begin 
q := im|s,j); 
if ind[q] = O then 
begin 
nt:= nt + 1; orb(nt] := q; ind{[q] := norb 
end 
end; 


if ns < nt then 
begin s := orb{ns]; go to a end 
end 
end 


* Now at Faculté de Musique, University de Montréal, Mon- 
tréal, P.Q., Canada. 
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Editors’ note: A/gorithm 483 described here is available on magnetic 
tape from the Department of Computer Science, University of 
Colorado, Boulder, CO 80302. The cost for the tape is $16.00 (U.S. 
and Canada) or $18.00 (elsewhere). If the user sends a small tape 
(wt, less than 1 1b.) the algorithm will be copied on it and returned to 
him ata charge of $10.00 (U.S. only). All orders are to be prepaid 
with checks payable to ACM Algorithms. The algorithm is recorded 
as one file of BCD 80 character card images at 556 B.P.I., even 
parity, on seven track tape. We will supply algorithms at a density of 
800 B.P.I. if requested. Cards for algorithms are sequenced starting 
at 10 and incremented by 10. The sequence member is right justified in 
column 80. Although we will make every attempt to insure that the 
algorithm conforms to the description printed here, we cannot guaran- 
tee it, nor can we guarantee that the algorithm is correct.— 
L.D.F. and A.K.C. 


Algorithm 483 


Masked Three-Dimensional 


Plot Program with Rotations 
[J6] 


Steven L. Watkins [Recd. 26 March 1973] Applied 
Research Laboratories, The University of Texas at 
Austin, Austin, TX 78712 


Key Words and Phrases: plotting, three-dimensional plotting 
CR Categories: 4.41 
Language: Fortran 


Description 

PLOT3D will accept three-dimensional data in various forms, 
rotate it in three-space, and plot the projection of the resulting 
figure onto the x-y plane. Those lines or portions of lines which 
should be hidden by previous lines are masked. 

Each call to PLOT3D causes one line to be plotted. A line con- 
sists of a sequence of points in three-space which will be connected 
using linear interpolation between adjacent points. This sequence 
of points is specified by three sequences of real numbers, the x, y, 
and z components of each point. Each of these sequences of real 
numbers can be specified either as being equally spaced, and there- 
fore denoted by an initial value and an increment, or as being con- 
tained in a real array. There is no restriction that any of the three 
component sequences be either increasing or decreasing, and the 
number of points may change between successive calls. 

The masking technique used by PLOT3D is based on two 
premises: (1) lines in the foreground (positive z direction) are 
plotted before lines in the background; and (2) a line or portion of 
a line is masked (hidden) if it lies within the region bounded by 
previously plotted lines. Masking is then achieved by maintaining 
a visible maximum function and a visible minimum function. Those 
portions of each line falling within the region bounded by these 
functions are considered to be hidden. Any line which exceeds user 
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Fig. 1. 


ROTATIONS ABOUT Tit 


VERTICAL XT 


PHI = 20.0 THETA = 


PHI = 65-0 THETA = -10.0 PHE = DOQ.Q THpTQ = 


PH] = 155-Q THETA = -10.0 PHI = 


specified limits is truncated without the loss of the plotter origin. 
A call to PLOT3D before initiating a new figure can be used to 
simulate a line drawn at the bottom of the paper; therefore, only 
those portions of each line lying above all previous lines will be 
drawn. 

The data are transformed by a three-dimensional rotation de- 
termined by two user specified angles. PLOT3D assumes a right- 
hand coordinate system with x running the length of the paper, y 
running across the width, and z coming out of it. The figure is 
first rotated by an angle of @ degrees clockwise about the x-axis. 
The resultant figure is then rotated by an angle of @ degrees about 
its y-axis. The plotted figure is the projection of this final figure 
onto the x-y plane. Figure 1 demonstrates rotations about the 
vertical or y-axis, and Figure 2 demonstrates rotations about the 
horizontal or x-axis. Warning: Some rotations will alter the fore- 
ground/background relationships between the lines, and thus the 
order in which they should be plotted to avoid violating the first 
masking premise. 

As an option, the coordinates of the vertices of the figure and 


COLLECTED ALGORITHMS (cont.) 


the projection of these vertices onto the y = 0 plane of the figure 
will be returned in a user supplied array. This information can then 
be used to put a frame on the figure, as is done in the example 
program, or to connect the endpoints of each line, or to plot axes, 
etc. 

Crosshatched figures are easily obtained as is demonstrated by 
the example program which generated Figure 3. Some perspective 
can be achieved by modifying the data scaling parameters between 
successive calls. PLOT3D attempts to minimize plotter movement 
by beginning at the alternate end of successive lines. A more de- 
tailed description of the parameters is contained in the comments 
at the beginning of the program listing. 

This: routine was developed at the Applied Research Labora- 
tories on their Control Data Corporation 3200 computer system. 
The following system routines were utilized: 


TROUND(X) returns the rounded integer value of its floating 
point argument. 
IPLOTUX, LY, J) moves the pen to the point (1X, 7Y) where: 
IX is the number of plotter increments along the length of the 
paper from the origin 
IY is the number of plotter increments across the width of 
the paper from the origin 
J is the pen status 
2--lower pen before moving 
3 --raise pen before moving 
If J is negative, the origin will be reset at (1X, /Y). 


15.0 THETA 


RATS. S20" THETA. 195-0 PHI] = 15-0 THETA = 200.0 
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Fig. 3. 
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Algorithm 
(A sample driver has been included to illustrate the use of this 


algorithm —-L.D.F. and A.K.C.) 


C THIS PROGRAM GENERATES AN EXAMPLE OF A CFROSSHATCHED 
C FIGURE, THAT IS, ONE FIGURE WHOSE LINES RUN PAPALLEL TO 
© THE X-AXIS OVERLAYED BY ANOTHER FIGURE WHOSE LINES FUN 
C PARALLEL TO THE Z-AXIS. THE FUNCTION IS A PRODUCT To TWO 
C SINC CI.LE. SINFCX)/X) FUNCTIONS. 
DIMENSION MASK(20@62@), VERTEX(16), OUTBUF( 61), 2¢61) 
C FIRST FIGURE 
C GENERATE DATA RUNNING PARALLEL TO X-AXIS 
DO 28 NLINE=1,6€1 
BEAMV = SINCC15.@*SINFCC3*NLINE-93)*2.217453293)) 
DO 1@ NPOINT=1,61 


OUTBUFC(NPOINT) = 
* BEAMU*SINC( 7. S&SINFCC3*NP 2 3) *0.017453293)) + 
* @.25 


16 CONTINUE 
C PLOT EACH LINE AS IT IS COMPUTED 
CALL PLOT3D(1@, @.8, OUTBUF, @.@, Gel, 4-@, -@B. 1, 
* NLINE, 61, -45e, -45e4 568, 348, 1828, MASK, 0) 
20 CONTINUE 
C€ SECOND FIGURE 
C GENERATE ARRAY OF Z2-COMPONENTS 
DO 3@ NLINE=1,61 
ZCNLINE) = -@.1*(NLINE-1) 
3@ CONTINUE 
C GENERATE DATA RUNNING PARALLEL TO Z-AXIS 
DO 5@ NLINE=1,61 
X = @.1*¥(NLINE=1) 
BEAMV = SINCC 7. 5*SINFCC3*NLINE-93)*@. 017453293) ) 
DO 48 NPOINT=1,61 
OUTBUFCNPOINT) = 
* BEAMV*SINCC15.0*SINFCC3*NPOINT-93)*@.017453293)) + 
* @.25 
42 CONTINUE 
C PLOT EACH LINE AS IT IS COMPUTED 
CALL PLOT3DCI@11, Xs OUTBUF, Z, 8.0, 4.8, 1-2, 
* NLINE, 615 -45e, -45., 5.0.5 3.0, 10.6, MASK, VEPTEX >) 
5@ CONTINUE 
C DRAW A FRAME ON THE FIGUPE 
CALL FRAMER(3, VERTEX, MASK: 
STOP 
END 


SUBROUTINE PLOT3DC(IVXYZ. XDATAs YDATA, ZDATA,s XSCALE, 

* YSCALE, ZSCALEs NLINE, NPNTS,s PHI. THETA, XREF, 

* YREF, XLENTHs MASKs VERTEX) 
C MASKED 3-DIMENSIONAL PLOT PROGRAM WITH ROTATIONS 
C THIS ROUTINE WILL ACCEPT 3-DIMENSIONAL DATA IN VARIOUS 
C FORMS AS INPUT, ROTATE IT IN 3-SPACE TO ANY ANGLE. 
C AND PLOT THE PPOJECTION OF THE FESULTING FIGUPE ONTO THE 
C XY PLANE. LINEAR INTERPOLATION IS USED BETWEEN DATA 
C POINTS. THOSE LINES OF A FIGURE WHICH SHOULD EE HIDDEN EY 
C A PREVIOUS LINE ARE MASKED. 
C THE MASKING TECHNIQUE USED BY THIS ROUTINE IS EASED ON 
C TWO PREMISES - 
LINES IN THE FOREGROUND (POSITIVE Z CIPRECTION) 
ARE PLOTTED BEFORE LINES IN THE BACKGROUND. 
A LINE OF PORTION OF A LINE IS MASKELP CHIDDEN) IF 
IT LIES WITHIN THE REGION BOUNDED BY PREVIOUSLY 
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PLOTTED LINES. 

EACH CALL TO PLOT3D CAUSES ONE LINE OF A FIGUPE TO PE 
PLOTTED- 
TWO PARAMETERS OF THE PLOTTER ARE SET ON THE INITIAL CALL 
FOR EACH FIGURE - 
CPIPI) IS THE NUMBER OF PLOTTER INCREMENTS PER INCH. 
CNYPI) IS THE NUMPER OF INCREMENTS AVAILABLE ACROSS THE 
WIDTH OF THE PAPER (Y-DIRECTION). 
WHEN A NEW FIGURE IS INITIATED, THE PLOTTEP ORIGIN IS SET 
AT THE BOTTOM OF THE PAPER BY PLOT3D AND SHOULD NOT BE 
MOVED UNTIL THE FIGURE IS COMPLETED. 
INPUT PARAMETERS - 
CIVXYZ) 1S A FOUR DIGIT DECIMAL INTEGER WHICH IS USEP TO 
SELECT VARIOUS INPUT/OUTPUT OPTIONS. THESE DIGITS, IN 
DECREASING ORDER OF MAGNITUDE, WILL BE REFERFED TO AS Vs 
X, Ys AND Z. 
IF V .NE. @, THE VERTICES OF THE CURRENT FIGURE ANC THEIP. 
PROJECTION ONTO THE Y=® PLANE, WILL BE STORED IN A 16 
ENTRY REAL ARRAY (VERTEX), AND WILL BE UPDATED AS EACH 
LINE IS PLOTTED. THESE COORDINATES ARE IN INCHES AND 
RELATIVE TO THE CURRENT PLOTTER ORIGIN. THE X Y PAIPS 
ARE ORDERED SO THAT THE FIRST PAIR COORESPONDS TO THE 
FIRST POINT OF THE FIGURE. THE SECOND PAIR COORESPONDS 
TO THE LAST POINT OF THE FIRST LINE» AND THE FOLLOWING 
PAIRS ARE ORDERED IN A CIRCULAR FASHION. THE PAIRS ON THE 
Y=@ PLANE OF THE FIGURE, THEN FOLLOW IN THE SAME ORDER. 
IF V=@, THE VERTEX PARAMETER IS IGNORED, EUT SHOULD NOT 
BE DELETED 
IF X=@, THE X-COMPONENTS OF THIS LINE ARE ASSUMED TO BE 
EQUALLY SPACED, AND ARE COMPUTED BY 

XCI)=XDATA+ C1 -1)*XSCALE 
WHERE (XDATA) IS THE INITIAL VALUE IN INCHES AND (XSCALE) 
IS THE SPACING BETVEEN POINTS IN INCHESe IF X «NE. ©, THE 
X-COMPONENTS OF THIS LINE APE READ FROM AN ARRAY AND 
MODIFIED BY 

XCI)=XDATACI) ®XSCALE 
WHERE (XSCALE) IS A SCALE FACTOR. 
THE SAME RELATIONS HOLD FOR THE Y-COMPONENTS, 
Y=@ 


THAT IS, IF 


YCI)2YDATAt CI-~1)*YSCALE 
AND IF Y .NE. @ 

YCI)=YDATACLI) *YSCALE 
IF Z=@, THE Z-COMPONENTS OF THIS LINE ARE ALL: ASSUMED TO 
BE EQUAL, AND ARE COMPUTED BY 

ZCL)SZDATA+(NLINE-1)*ZSCALE 
WHERE (NLINE) IS SOME INTEGER ASSOCIATED WITH THIS LINE. 
IF Z2 .NE.s @, AGAIN WE HAVE , 

ZCI) =ZDATACI) *ZSCALE 
WHEN (NLINE) IS EQUAL TO ONE, IT INDICATES THE BEGINNING 
OF A NEW FIGURE. A CALL TO PLOT3D WITH (NLINE) EQUAL TO 
ZERO BEFORE INITIATING A NEW FIGURE SIMULATES A LINE DRAWN 
AT THE BOTTOM OF THE PAGE. THEREFORE ONLY THOSE PORTIONS 
OF A LINE LYING ABOVE ALL PREVIOUS LINES WILL BE PLOTTED. 
ALL OTHER PARAMETERS ARE IGNORED ON SUCH A CALL. 
(NPNTS) IS THE NUMBER OF POINTS ON THIS LINE, AND MAY BE 
ALTERED FROM LINE TO LINE. ; 
(PHI) AND (THETA) APE THE TWO ANGLES CIN DEGREES) SED TO 
SPECIFY THE DESIRED 3-DIMENSIONAL ROTATION. THE FOLLOWING 
TWO DEFINIATIONS OF THESE ROTATIONS ARE EQUIVALENT - 
IN TERMS OF ROTATIONS OF AXES,» THE INITIAL SYSTEM OF AXES, 
XYZ. 1S ROTATED BY AN ANGLE (PHI) COUNTERCLOCKWISE ABOUT 
THE Y-AXIS, AND THE RESULTANT SYSTEM IS LABELED THE TUV 
AXES. THE TUV AXES ARE THEN ROTATED BY AN ANGLE (THETA) 
COUNTERCLOCKWISE ABOUT THE T-AXIS.s AND THIS FINAL SYSTEM 
IS LABELED THE PQR AXES. THE PLOTTED FIGURE .IS THE 
PROJECTION OF THE ORIGINAL FIGURE ONTO THE PQ-PLANE. 
IN TERMS OF ROTATIONS OF COORDINATES, THE FIGURE IS FIPST 
ROTATED BY AN ANGLE (THETA) CLOCKVISE ABOUT THE X~-AXIS. 
THE RESULTANT FIGURE IS THEN POTATED BY AN ANGLE (PHI) 
CLOCKWISE ABOUT ITS Y-AXIS. THE PLOTTED FIGURE IS THE 
PROJECTION OF THIS FINAL FIGURE ONTO THE XY-PLANE. 
WARNING. SOME ROTATIONS WILL ALTER THE FOREGROUND/ 
BACKGROUND RELATIONSHIPS BETWEEN THE LINES, AND 
THUS THE ORDER IN WHICH THEY SHOULD BE PLOTTED. 
(XREF) AND (YREF) ARE THE COORDINATES, IN INCHES, 
RELATIVE TO THE PLOTTER ORIGIN, TO EE USED AS THE OPIGIN 
OF THE FIGURE. 
(XLENTH) IS THE LENGTH, IN INCHES, TO WHICH THE PLOT IS 
RESTRICTED. ANY POINT WHICH EXCEEDS THIS LIMIT, OP THE 
LIMITS OF THE PAPER IN THE Y DIRECTION (NYPI), WILL BE 
SET TO THAT LIMIT. 
(MASK) IS AN INTEGER ARRAY OF @*XLENTH*PIPI ENTSIES WHICH 
1S USED TO STORE THE MASK. THE CONTENTS OF THIS ARRAY 
SHOULD NOT BE ALTERED DURING THE PLOTING OF ANY GIVEN 
FIGURE. 
ALL PARAMETERS EXCEPT (MASK) 
UNCHANGED. 
BETWEEN ANY TWO CALLS FOR THE SAME FIGUFE, 


AND (VERTEX) ARE PETUFNED 


ANY PAPAMETEP 


CAN BE MEANINGFULLY CHANGED EXCEPT (XLENTH), (MASK), AND 
(VERTEX). 
INTEGER HIGH, OLDHI, OLDLOV 
DIMENSION XDATAC1), YDATAC!), ZDATACI), MASK(C1), 
* VERTEXC1) 
DATA INIT, JVXYZ, SPHI, STHETA/-1s4 -1lse -1+@E99, 


* -1,0E99/ 
INITIALIZATION PROCEDURES 
INITIALIZATION PPOCEDURE FOR A NEW FIGURE 
TEST FOR SPECIAL MASK MODIFYING CALL 
IF (NLINE-EQ.@) GO TO 558 
DETERMINE IF INITIALIZATION IS FEQUIFED 
IF (NLINE.NE-1)2 GO TO 28 
SET PLOTTER PARAMETERS 
PIPI 10@.@ 
NYPI 1296 


c 
c 


c 


c 


c 


agaangaaaa 
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RESET PLOTTER ORIGIN TO BOTTOM OF PLOT PAGE 
I = NYPL + 188 
CALL IPLOT(@, -I, -3) 
COMPUTE LENGTH OF PLOT PAGE IN INCREMENTS 
LIMITX = XLENTH*PIPI + @.5 
I = LIMITX # LIMITX. 
INITIALIZE MASKING ARRAY OVER THE LENGTH OF THE 
DO 18 K=ls1 
MASK(K) = 
18 CONTINUE 
INIT = -1 
SET THE NECESSARY INDICATORS FOR THE FIRST LINE OF A NEW 
FIGURE 
INCI = =1 
I= @ 
INPUT TYPE AND VERTEX INITIALIZATION 
DETERMINE IF INITIALIZATION IS REQUIRED 
20 IF (JVXYZ.EQ.IUXYZ) GO TO 78 
SET INDICATORS FOR TYPES OF INPUT DATA AND SAVING VERTICES 


PLOT PAGE 


INIT 


JVXYZ = IVXYZ 
INDZ = 1 
INDY = 1 
INDX = 1 
INDV = 1 
IF (JVXYZ.LT~.1@08) GO TO 30 
INDV = 2 
JVXYZ = JVXYZ - 18868 

30 IF (JVXYZ~-LT.-108) GO TO 4@ 
INDX = 2 
JVKXYZ = JVXYZ - 100 

4@ IF (JVXYZ»LT-1@) GO TO 5S@ 
INDY = 2 
JVXYZ = JVXYZ - 18 

50 IF (JVXYZ.LT.1) GO TO 68 
INDZ = 2 

6@ JVXYZ = [VXYZ 


ROTATION INITIALIZATION 
DETERMINE IF INITIALIZATION IS REQUIRED 

70 IF (PHI-EQ.+SPHI «AND. THETA-EQ@+STHETA) GO TO 8@ 
COMPUTE ROTATION FACTORS 


SPHI = SINF(@.@174532925*PHI) 
CPHI = COSF(®.@174532925*PHI) 
STHETA = SINF(@.0174532925*THETA) 
CTHETA = COSF(@-.8174532925*THETA) 
All = CPHI 

Al3 = -SPHI 

A2l = STHETA*SPHI 

A2e = CTHETA 

A23 = STHETA*CPHI 

SPHI = PHI 

STHETA = THETA 


PROCESSING PROCEDURES 
SET FLAG TO MOVE THROUGH THE DATA ARRAYS IN THE OPPOSITE 
DIRECTION 
8@ INCI = -INCI 
SET INDICATOR TO THE FIRST POINT TO BE PROCESSED 
IF (I-NE.@) I = NPNTS + 1 
LOOP TO PROCESS EACH POINT IN THE DATA ARRAYS 
DO 53@ K=!,NPNTS 
DATA CALCULATION 


I= 1 + INCI 
GO TO (96,180), INDX 

98 X = XDATA + (1-1)*XSCALE 
GO TO 118 

18 X = XDATACI)*XSCALE 


110 GO TO (128.138). INDY 
122 Y = YDATA + (1-1) *YSCALE 
GO TO 148 
138 Y = YDATACI)D*YSCALE 
148 GO TO (158,160). INDZ 
158 Z = ZDATA + (NLINE~1)*ZSCALE 
GO TO 178 
168 Z = ZDATACI) *ZSCALE 
DATA ROTATION 
172 XXX = Al1*X + AL3*Z + XREF 
XX = XXX 
IX = ITROUND(XX*PIPI) 
YYY = AQl*X + AC3*Z + YREF 
YY = YYY + A22*Y 
TY = IROUNDCYY*PIPI) 
RESTRICT FIGURE TO PLOT PAGE 
IF (IX.LE-@) IX = 1 
IF (1X.GTeLIMITX) IX = LIMITX 
IF CIY-LT-180) IY = 108 
IF (1Y.GTeNYPI) IY = NYPI 
IF (KeNE.-1) GO TO 250 
(LOC) IS THE POSITION OF THE PREVIOUS POINT WITH RESPECT 


TO THE MASK 
+! ABOVE THE MASK 
@ WITHIN THE LIMITS OF THE MASK 
-1 BELOW THE MASK 
PROCEDURE FOR INITIAL POINT OF EACH LINE 
LOCATE INITIAL POINT WITH RESPECT TO THE MASK THEN 
UPDATE THE MASK 


LOW = IX + IX 

HIGH = LOW - 1 

MLOW = MASKCLOW) 

MHIGH = MASKCHI GH) 

IF (MHIGH-1Y) 200, 218, 182 
188 IF (MLOV-IY) 190, 238, 228 
198 LOCOLD = @ 

GO TO 240 
208 MASKCHIGH) = 1Y 


IF (MLOVW.EQ.-1) MASKC(LOW) = IY 
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Cc 


212 LOCOLD = +1 

GO TO 248 
220 , MASKCLOW) = IY 
238 ©=LOCOLD = -! 


MOVE THE RAISED PEN TO THIS INITIAL POINT 
2406 CALL IPLOTCIX, IY, 3) 

JX = IX 

JY = IY 

IYREF = IY 
STORE: VERTICES IF REGUESTED 

IF CINDV.EG-.1) GO TO 538 


INDEX = INCI + 6 
VERTEXCINDEX) = XX 
VERTEXCINDEX+1) = YY 
VERTEXCINDEX+&) = XXX 
VERTEXCINDEX+9) = YYY 
IF (NLINE.NE.1) GO TO 538 


/VERTEXC1) = XX 
-VEFRTEX(2) = YY 


VERTEX(9) 3 XXX 
VERTEX(18) = YYY 
GO TO 536 


SPECIAL CASE WHERE CHANGE IN X COORDINATE IS ZERO 
A SPECIAL PROVISION IS MADE AT THIS POINT SO THAT A LINE 
WILL NOT MASK ITSELF AS LONG AS THE X COORDINATE REMAINS 
CONSTANT 
258 IF (IX.NE«JX) 
JY = 1Y 
GO TO 288 
COMPUTE CONSTANTS FOR LINEAR INTERPOLATION 


GO TO 262 


266 YINC = FLOATCIY-JY)/ABSCFLOATCIX-JX)) 
INCX = CIX-JX)/TABSCIX-Jx) 
YJ = JY 
PREFORM LINEAR INTERPOLATION AT EACH INCREMENTAL STEP ON 
THE X AXIS 
270 .JX = JX + INCX 
“Yd = Yd + YINC 
JY = IROUNDCYJ) 


LOCATE THE CURRENT POINT VITH RESPECT TO THE MASK AT THAT 
POINT THEN PLOT THE INCREMENT AS A FUNCTION OF THE 
LOCATION OF THE PREVIOUS POINT WITH RESPECT TO ITS MASK 


LOW = JX + JX 

‘HIGH = LOW - 1 

MLOW = MASKCLOW) 

MHIGH = MASKCHIGH) 
286 ‘IF (MHIGH-JY) 380, 388, 29¢ 
2908 IF (MLOW-JY) 318, 328, 328 
THE CURRENT POINT IS ABOVE THE MASK 
328 LOC = +1 

IF CLOCOLD) 368, 378, 438 
THE CURRENT POINT IS WITHIN THE MASK 
3168 LOC = @ 

IF (LOCOLD) 348, 358, 332 
THE CURRENT POINT IS BELOW THE MASK 
328 LOC = -1 

IF CLOCOLD) 518, 458, 446 


PLOT FROM ABOVE THE MASK TO WITHIN THE MASK 

338 IF (MHIGHeLE-@YREF) CALL IPLOTCUX, MHIGHs 2) 
GO TO 358 

PLOT FROM BELOW THE MASK TO WITHIN THE MASK 

348 IF (MLOW.GE.«IYREF) CALL IPLOTCJUXs, MLOWs 2) 

PLOT FROM WITHIN THE MASK TO WITHIN THE MASK 

3508 CALL IPLOT(CUXs JY» 3) 
GO TO 528 


PLOT FROM BELOW THE MASK TO ABOVE THE MASK 
368 IF (MLOW-IYREF) 370, 386, 388 
PLOT FROM WITHIN THE MASK TO ABOVE THE MASK 
3708 IF (MHIGH-IYREF) 468, 398, 398 
388 CALL IPLOTCJX, MLOW, 2) 
398 CALL IPLOTCJXs MHIGH, 3) 

GO TO 430 
408 IF (MHIGH.-EQ.~1) GO TO 43@ 

OLDHI = HIGH - 2*INCX 

IF CMASKCOLDHID-JY) 420, 428, 418 
410 CALL IPLOT(UX, JYs 3) 

GO TO 438 
428 CALL IPLOTCJUX-INCX, MASKCOLDHI), 3) 
PLOT FROM ABOVE THE MASK TO ABOVE THE MASK 
438 MASKC(HIGH) = JY 


IF (MLOW.EQ--1) MASKCLOW) = JY 
CALL IPLOT(JUXs JYs 2) 
GO TO 520 
PLOT FROM ABOVE THE MASK TO BELOW THE MASK 


448 IF (MHIGH-IYREF) 460, 460, 458 
PLOT FROM WITHIN THE MASK TO BELOW THE MASK 
450 IF (MLOW-IYREF) 478, 478, 4680 
460 CALL [PLOTCJX, MHIGHs 2) 
478 CALL IPLOTCJX, MLOW, 3) 
GO TO 518 
480 OLDLOW = LOW - 2*INCX 
IF (MASKCOLDLOW)-JY) 498, S88, 588 
496 CALL IPLOT(UXs, JY¥s 3) 
GO TO 510 
588 CALL IPLOTC(CUX-INCX, MASKCOLDLOV), 3) 


PLOT FROM BELOW THE MASK TO BELOW THE MASK 


51 MASK(LOW) = JY 

CALL IPLOTCJAs SYs 2) 
See IYREF = JY 

LOCOLD = LOC 

IF CJXeNE.IX) GO TO 278 
538 CONTINUE 
RAISE PEN 

CALL IPLOT(CJUXs JY, 3) 


c 


aaa 


aqgaraanannaaanannaa 


c 
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STORE VERTICES IF REQUESTED 
IF CINDV.EQ.1) GO TO 546 
INDEX = -INCI + 6 
VERTEXCINDEX) = XX 
VERTEXCINDEX+1) = YY 
VERTEXCINDEX+&) = XXX 
VERTEXCINDEX+9) = YYY 
IF (NLINE-NE.!) GO TO 54@ 
VERTEX(3) = XX 
VERTEX(4) = YY 
VERTEXC11) = XXX 
VERTEX(12) = YYY 

54@ 1 = 1- 1 

RETURN TO CALLING PFEOGRAM 
RETURN 


OPTION TO MODIFY THE MASKING TECHNIGUE TO BE USED ON THE 
FOLLOWING FIGURE SO AS TO PLOT INLY ABOVE ALL PREVIOUS 
LINES. 
550 INIT = @ 

RETURN 

END 


SUBROUTINE FRAMERCIHCORs, VERTEXs MASK) 

ROUTINE TO PLOT A FRAME ON THE PROJECTION OF A 
3-DIMENSIONAL FIGURE AS DRAWN BY PLOTSD. 
INPUT PARAMETERS - 

THCOR NUMBER OF THE VERTEX OF THE FIGURE WHICH 
APPEARS TO BE FURTHEST IN THE BACKGROUND 
(MINUS Z DIRECTION). 

ARRAY CONTAINING THE COORDINATES OF THE 
VERTICES OF THIS FIGURE AS RETURNED FROM 
PLOT3D ON THE LAST CALL. 

ARRAY CONTAINING THE MASK FOR THIS FIGURE 
AS RETURNED BY PLOT3D ON THE LAST CALL. 

THE VERTICES OF THE FRAME ARE NUMBERED (1-4) IN THE SAME 
ORDER AS THEIR COORDINATES APPEAR IN VERTEX. 

THE MASK ARRAY IS ALTERED BY THIS ROUTINE, 

BUT THE PLOTTER ORIGIN IS NOT MOVED. 

DIMENSION VERTEX( 1), MASKCI)s ARRAYC!4) 

I = 2*IHCOR 

IF (I.-LT.2) I = 2 

IF (1.GT.8) I = 8 

VERTICES WHICH MAY BE HIDDEN 

DRAWN BY CALL TO PLOTS3D. 

ARRAY (1) VERTEXCI-1) 

ARRAY (8) VERTEXCI) 

ARRAY (2) VERTEX(1+7) 

ARRAY (9) VERTEX(1+8) 

ARRAY (4) ARRAY (2) 

ARRAY(11) = ARRAY(9) 

ARRAY(6) = ARRAY(C2) 

ARRAYC13) = ARRAY(9) 

ARRAY(7) = ARRAY(C1) 

ARRAY(14) = ARRAY(8&) 

IT=z=te-e2 

IF ¢(1.EQ. 8) 

ARRAY(3) = 

ARRAYCI@) = 

I=z=Jtl+44 

IF (1.GT.8) 

ARRAY(5) = VERTEX(1+7) 

ARRAYC12) = VERTEXC(1+6) 

CALL PLOT3D¢(!1@, ARRAYs ARRAY(8), 8.8, 1.8, 
* 25 Ts O68, GeB, O-Os O.B, BeBs, MASK, @) 
THE REMAINING VERTICES ARE DRAWN BY CALLS TO PLOT. 

CALL PLOTCVERTEX(I-1), VERTEX(I)s 3) 

T=tIt-e 

DO 18 J=els3 

I=I1I+e2 
IF ¢€I-EQ.10) I = 2 
CALL PLOTCVERTEXC1+7) 5 

10 CONTINUE 

CALL. PLOTCVERTEX(I-1), 

I=It-e2 

IF CI-EQ.-@) I = & 

CALL PLOTCVERTEXCI-1), 

CALL PLOTCVERTEXC(I+7), 

RETURN 

END 


VERTEX 


MASK 


THE 
ARE 


hut now D> 


I= & 
VERTEXCI+7) 
VERTEX( I +8) 


1 =f - 8 


1.0, @.@, 


VERTEXC(1+8)4 2) 


VERTEX(C1)s 2) 


VERTEXC1)», 3) 
VERTEX(I +8), 2) 
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ACM Transactions on Mathematical Software, Vol. 1, No. 3, September 1975. 
REMARK ON ALGORITHM 483 

Masked Three-Dimensional Plot Program with Rotations [J6 ] 
[S. L. Watkins, Comm. ACM 17,9 (Sept. 1974), 520-523 | 


Robert Feinstein [Reed 28 April 1975 ] 
The Marine Biomedical Institute, The University of Texas Medical Branch at 
Galveston, 200 University Boulevard, Galveston, TX 77550 


In the sample main program of Algorithm 483, line 13 should read: 
* BEAMV*SINC(7.5*SINF ((3*NPOINT—93) *0.017453293) ) + 


Further, the algorithm docs not define subroutine PLOT. which is called by 
FRAMER. Whereas IPLOT accepts coordinates in increments, PLOT accepts 
coordinates in inches. 

I have modified this algorithm to run on a PDP 11/45-GOULD 5000 and would 
be happy to supply a listing to anyone who desires it. | 


COLLECTED ALGORITHMS FROM CACM 


Algorithm 484 


Evaluation of the Modified 
Bessel Functions K,(z) and 


K.,(z) for Complex Arguments 
[S17] 


Keith H. Burrell [Recd. 30 Mar. 1972] California Insti- 
tute of Technology, Pasadena, CA 91109 


Work supported by the Atomic Energy Commission under 
contract AT(04-3)767. 


Key Words and Phrases: Bessel functions, Hankel functions, 
modified Bessel functions, Gauss- Hermite quadrature 

CR Categories: 5,12 

Language: Fortran 


Description 

Introduction. This procedure evaluates the real and imaginary 
parts of the modified Bessel functions Ko(z) and Ki(z) for values 
of the complex argument z = x + /y in the half plane x > 0. (The 
notation K,(z) is fairly standard; the exact definition of the func- 
tion is given in [1]). 

Methods for the published algorithm. Many previous methods 
of calculating these functions have simply used the series expansion 
for arguments z of small magnitude (i.e. | z| < 11) and the asymp- 
totic expansion for larger arguments. Rewriting eqs. 9.6.11 and 
9.7.2 of {1] in a form more suitable for recursive computation, the 
series expansion may be expressed as 


Ki(z) = ps to; (z) (z/2)*7/(j!)? (1) 
Ky(z) = 1/z> 2/22) hhj(z)(2/2)7/G!), (2) 


too = — (In(z/2) + y); 
toy at togi-1) + 1/j, J > 0, 
hy = [ta + 1/@i + 2)1//G + 1), #2 9, 


where + = 0.577... 


li 


is Euler’s constant; the asymptotic expansion 


Editors’ note: Algorithm 484 described here is available on magnetic 
tape from the Department of Computer Science, University of 
Colorado, Boulder, CO 80302. The cost for the tape is $16.00 (U.S. 
and Canada) or $18.00 (elsewhere). If the user sends a small tape 
(wr. less than 1 1b.) the algorithm will be copied on it and returned to 
him ata charge of $10.00 (U.S. only). All orders are to be prepaid 
with checks payable to ACM Algorithms. The algorithm is recorded 
as one file of BCD 80 character card images at 556 B.P.1., even 
parity, on seven track tape. We will supply algorithms at a density of 
800 B.P.1. if requested. Cards for algorithms are sequenced starting 
at 10 and incremented by 10. The sequence number is right justified in 
column 80. Although we will make every attempt to insure that the 
algorithm conforms to the description printed here, we cannot guaran- 
tee it, nor can we guarantee that the algorithm is correct.—L.D.F. 
and A.K.C. 
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may be written as 


K,(z) a, (=) 


[ano + Ant/11(8z) + a@ne/21(8z)® + @nz/3(8z)? + +++] 
ko = -1, kj = ky; —8, 

ao = 1, ao; = (kj — 4)anj-1), 

a=1, ay = kya. 


Methods based solely on these expansions tend to be ineffi- 
cient because of the large number of terms in the series that must 
be evaluated when | z| gets as large as 10. Further, they are of 
limited accuracy due to the loss of significant digits in summing 
the series when y < x and x 2 5. To overcome these difficulties, 
the integral representation developed by Hunter [2] can be used. 


00 
K,(z) = vres/rine (ae) f en Ppn(2z+p2)—3 dt, (3) 


jargz| <7. 


Hunter suggests evaluation of this integral by means of the trape- 
zoidal rule, which is well suited to integrands of this type, but one 
can achieve equivalent accuracy with fewer evaluations of the 
integrand by using Gauss-Hermite quadrature [3]. 

To have a fast, accurate algorithm, the functions must be 
evaluated by different methods in different regions of the complex 
plane. Owing to the singularity at the origin for K,(z), only the 
series expansions will be useful near z = 0. For moderate values of 
| z|, the integral representations will be the most useful, while for 
| z| large, calculation of the asymptotic expansions will be faster 
than that of the integral. To decide exactly where each method 
should be used and how good the resulting algorithm is, one must 
be able to assess the speed and accuracy of each method. This 
could be done from first principles; but since close estimates of the 
error tend to involve considerable mathematical labor, I thought 
it easier to write a test algorithm which, although very slow, would 
evaluate Ky(z) and Ki(z) quite accurately. 

Test algorithm. For | z| < 3, the test algorithm uses the series 
expansions; otherwise, the integral representation in eq. (3) is 
evaluated using the trapezoidal rule. To find the error in this 
algorithm, consider first the truncation error caused by stopping 
after 1 terms of the series in eqs. (1) and (2). 

Using the integral representation (eq. 9.6.17 in [1]) 


Kz) = — Val doez»>8{~ + In (2z sin? 6)] 
0 


and the identities 
Ky' (z) _ — Ki(z) and 


n=l 1 
e = >> 2"/m! + Kn—1! dt(1 -- 1)! 
m=0 0 
it is easy to show that 
n—1 


Ko(z) = DE toj(z)(z/2)”/(i)? + Ton(z) and 


nal 


K,(z) = 1/z — 2/2 2) tj(z)(2/2)//(i)? + Tin(z) where 


v 


Ton(z) aia =2/(r(2n—1)) f ai—oe [ dOe'2o38 
0 


0 


(y + In (2z sin? @)) 
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1 r 
Tin(z) = 22°71/(r(2n+-1) 1) | dti—pwn i doe!zoox8 
0 0 


“(1 + (Qn4+2+21 cos 6)(y+In (2z sin? @)). 


At least four terms in each sum are taken by the test algorithm, 
thus 


Vs, (2/2) | | y + In(2z) | + In4 
NS Se ai [ton(2) | 

vt (2/2)e 
| Tin(z) | < ze tin(z) ar 


(| z |+2n+2)(| y+In(2z) |4+1n4)+1 
2(n-+1) | fon(z)+1/(2s) | 


Evaluation continues in the test program until | fon(z)(z/2)?"(n!)-?/ 
Ko(z) | < 10-7 and | tn(z)(z/2)?*1(n!)-2/Ki(z) | < 1077. Thus, 
defining & (z) and &,(z) to be the absolute values of the relative 
errors in the computation of Ky(z) and Ki(z), we obtain the limits 
&o(z) < 1.115 X 10- and & (z) < 1.278 & 10°. 

The errors in evaluating eq. (3) by the trapezoidal rule have 
been analyzed by Hunter [2]. Expressing the trapezoidal rule as 


[ Fo dt =h E F(O) + yy Fon | — Eth) (4) 


he obtains bounds for £,(z, 4) and £,(z, #4), the errors in Ky(z) and 
Ki(z), respectively. The test algorithm uses 4 = 0.25. For this, 
Hunter’s formulas yield | Eo(z, A) | < 3.047 & 10-8 and | E;(z, A) | 
< 4.008 x 107%. 

By taking 32 terms in the sum in eq. (4), the truncation error 
can be made much smaller than the E,,(z, /1), so that &0(z) < 4.236 x 
10-8 and &)(z) < 5.435 & 10-. (Round-off error is not a problem 
for the test algorithm. The series is not subject to it for | z| < 3.0, 
and all the terms in the sum in eq. (4) have the same sign.) 

Results of testing. The goal was to make the published algorithm 
accurate to a few parts in 10!°. On this scale, the test algorithm 
can be viewed as exact, at least for purposes of computing the 
modulus of the relative errors. Using the test algorithm, the pub- 
lished algorithm was found to be most efficient if the series are 
used for | z| < 4.3; the integrals in eq. (3), evaluated with 15 point 
Gauss-Hermite quadrature, are used for 4.3 < |z| < 14.0; and 
the asymptotic expansions are used otherwise. 

During the check runs to find these points of division, it was 
noticed that the number of terms needed in the series could be 
predicted approximately by two simple functions of | z |. With this 
in mind, the error expression for the asymptotic expansions (eq. 
9.7.2 in [1]) was used to generate a similar function for these ex- 
pansions. By predicting the number of terms needed, instead of 
making convergence tests in the loops that sum the expansions, an 
appreciable reduction in the number of computations can be 
achieved. This amounts to a 30 percent saving, for example, for 
the series expansions. 

The most extensive test runs were done for z = pe’? having 
the values ¢ = 0°(5°)90° and p = 0.1, 0.5(0.5) 120.0. Another 
test run with p = 0.1, 0.5(0.5)30.0 verified that 


K,.(pe'*) = K*,, (pe~'*) 


by checking the values ¢ = —90°(5°)90°. All tests were made 
using double precision arithmetic on an IBM 370/155. They showed 
that 


&0(z) < 3.55 & 10-” and 
&i(z) < 3.93 & 107". 


Finally, it should be noted that the algorithm actually returns 
the values of e*Ky(z) and e*K,(z). For |z| large, | K.(z)| ~ 
e-*(1/2 | z |)! so that such a return expands the range of | z | over 
which this procedure may be used. 
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Algorithm 


SUBROUTINE KZEONE(X, Ys RE@s IMO. RE]. IMI) 
THE VARIABLES X AND Y ARE THE REAL AND IMAGINARY PARTS OF 
THE ARGUMENT OF THE FIRST TWO MODIFIED BESSEL FUNCTIONS 
OF THE SECOND KIND,K@ AND K1.- RE@,IM@.RE] AND IM}! GIVE 
THE REAL AND IMAGINARY PARTS OF EXP(X)*K@ AND EXPC(X)*KI, 
RESPECTIVELY. ALTHOUGH THE REAL NOTATION USED IN THIS 
SUBROUTINE, MAY SEEM INELEGANT WHEN COMPARED WITH THE 
COMPLEX NOTATION THAT FORTRAN ALLOWS, THIS VERSION RUNS 
ABOUT 38 PERCENT FASTER THAN ONE WRITTEN USING COMPLEX 
VARIABLES - 
DOUBLE PRECISION X, Ys X2,s Y2s RE@s IMB, REL, IMI 
* Ri, R2, Tle T2, Pls P2s RTERM. ITERM, EXSQ(8), TSQCE&) 
DATA TSQC¢1) /G@.3DB@/, TSQC2]) £/3-19383633920635D-1/, 
TSQ(3) /1.29875862295915D0/7, TSACA) 
12 +95637445869665D8/, TSQC5) /5-48983159724444D9/, 
TSQ(6) /8-.88407957B805676D0/, TSQAC7) 
/1.34685357432515D1/7, TSO(8) 42-82499163658789DI/, 
EXSQ(1) /@-56410@3067264D6/, EXSQC2) 
/@-41 202 8687T4969D8/s, EXSQ(3) /G.1584889157959DB/, 
EXSQ(4) /8.3807868@03387255D-1/7.5 EXSQ(5) 
4062776068 842913D-2/, EXSQ(6) /86-10060044412325D-3/, 
EXSQ¢(7) /@.-105911S5S5S47711D-5S/7, EXSQ(6) 
/@01522475884254D-6/ 
THE ARRAYS TSQ@ AND EXSQ@ CONTAIN THE SQUARE OF THE 
ABSCISSAS AND THE WEIGHT FACTORS USED IN THE GAUSS- 
HERMITE QUADRATURE. 
R@ = X*X + Y*Y 
IF (X-GT.@.@D@ .OR. R2.-NE-@.-8D@) GO TO 18 
WRITE (6.499999) 
RETURN 
1@ IF (R2.GE.1-96D2) GO TO 58 
IF (R2.GE.!-.849D1) GO TO 38 
C THIS SECTION CALCULATES THE FUNCTIONS USING THE SERIES 
C EXPANSIONS 


aqgaaaaaanaaaa 


HeRRRK HH HHT 


aaa 


X2 = X/2.8D8 

Y2 = Y/2.8D0 

Pil = K2*X2 

Pe = Y2x*ye 

Tl = -(CDLOG(P14+P2)/2.0D8+8.5772156649815329D0) 


C THE CONSTANT IN THE PRECEDING STATEMENT IS EULEP*S 
C CONSTANT 


T2 = -DATAN2 CY, X) 
x2 = Pl - Pe 

Y2 = X*xY2 

RTERM = 1.8D0 
ITERM = @.@D@ 

RE@ = TI 

IM@ = T2 

Ti = Tl + 8.SD@ 
RE! = Tl 

IMl = Te 


P2 = DSQRTCR2) 

L = 2.106D0*P2 + 4.4D0 

IF (P2.LT.8-8D-1) L = 2.129D0*P2 + 4.0D8 
DO 28 N=l.L 


Pl =N 
P2 = N*N 
Ril = RTERM 


RTERM = (Ri*X2-ITERM*Y2)/P2 
ITERM = (RI*YS2+I TERM*X2)/P2 

Tl = Tl + @.SDO/P1 

RE@ = RE@ + Ti*RTERM - T2*I TERM 


IMo IM@ + TI*ITERM + T2*RTERM 
Pl = Pl + 1.0D@ 
Tl = Th + @.5D@/P1 
RE! + (TI*RTERM-Te*ITERM)/P1I 


RE1 = 
IML = IM! + (TI*ITERM*¢T2*RTERM)/P1 
28 CONTINUE 


Rl = X/R2Q - O@-SDO*CX*REL-Y*IMI) 
R2 = -Y/R2 - @-5SDO*CX*IMI+Y*RE] ) 
Pi = DEXP(X) 

RE@ = P1*REO 

IM@ = PI*IM@ 

RE] = P1l*R1 

IM! = Pl]*R2 

RETURN 


C THIS SECTION CALCULATES THE FUNCTIONS USING THE INTEGRAL 
C REPRESENTATION, EQN 3. EVALUATED WITH 15 POINT GAUSS- 
C HERMITE QUADRATURE 


30 X2 = 2.0@D6*xX 
Y2 = 2.0D0*Y 
Rl = Ye*ye2 
Pl = DSQRT(X2*«Xe2+Rl ) 
Pe = DSQRTC(P1+xX2) 
Tl = EXSQ¢1)9/(2.8D@*P1) 
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RE® = T1l*P2 
IM@ = T1/P2 
REI = 8.8D@ 
IM] = @.0D0 


DO: 48 N=2,8 


Te = X2 + TSQ(N) 

Pl = DSORT(Te*T2+R1 ) 
Pe = DSQRTC(P1+T2) 

Til = EXSQ(N)/P1 


RE@ = RE@ + Ti*P2 
IM@ = IM@ + TI/Pe 
Tl = EXSQ(N)*TSQ(N) 
REL = REL + Ti*Pe2 
IM] = IMI + TI/Pe2 
4@ CONTINUE 
T2.=2 -Y2* IMS 
REI = REI/R2 
R2 = Y2*IMI/R2 
RTERM = 1-41421356237309D8*DCOS(Y) 
UTERM = -1.41421356237309D@8*DSINCY) 


aa 


SQRT (2,0). 
IM@6 = RE@*ITERM + T2*RTERM 
RE@® = RE®@*RTERM - T2*ITERM 


_Tl:= REI*RTERM - R2*I TERM 
T2:= REI*ITERM + R2*xRTERM 
REL = TI*X + T2*yY 
IML = -TI#*¥Y + T2*Xx 
RETURN 


o 


DQ 


ASYMPTOTIC EXPANSIONS 


5@ RTZRM = 1.0D0 
ITERM = 8.8DB 
RE@ = 1.@D@ 
‘IM@ = 0.090 
RE] = 1.@D@ 
IMI = 8.8D9 
Pl’ = &.@DO*R2 
P2 = DSQRT(R2) 
L = 3.91D0+8.12D1/P2 
Rl. = 1.8De 
R2:= 1.8D0 
M= -8 
K = 3 
DO: 6@ Nelo 
M=M + & 
K = K - M 
Rl = FLOATCK-4)*R1 


R2 = FLOATCK)*Re 

= FLOAT(N) *P1 
T2 = RTERM 
CT2*"X+ITERM*Y)/TI 
C-+T2*Y+ I TERM*X)/T1 
RE® = REO + RI*RTERM 


a 
4 
™ 
Be) 
<4 
not 


IMG = IM@ + RI*ITERM 
RE] = RE] + RO*RTERM 
IM] = IMI + R2*ITERM 


6@ CONTINUE 


Tl = DSORTCP2+x) 
T2.= -Y/TI 
Pl = 8.86226925452758D-1/P2 


C THIS CONSTANT IS SQRTCPI)/2-0, WITH PI=3.1415966. 


RTERM = Pl*DCOSCY) 


ITERM = -PI*DSINCY) 
Rl = RE@*RTERM - IM@*ITERM 
R2 = RE@*ITERM + IM@*RTERM 
RE@ = T1*R1l - Te*R2 
IM@ = T1*RQ + T2*Rl 


= REL*RTERM - IMI*ITERM 
R2 = REL*ITERM + IML *RTERM 


REF = T1*R1l - T2*R2 
IME = Tl*R2 + Te*Rl 
RETURN 


99999 FORMAT (42H ARGUMENT OF THE BESSEL FUNCTIONS 


* 35H OR LIES IN LEFT HALF COMPLEX PLANE) 
END 


THIS SECTION CALCULATES THE FUNCTIONS USING THE 


THE CONSTANT IN THE PREVIOUS STATEMENTS IS,0F COURSE, 


1S ZES0, 
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Algorithm 485 


Computation of g-Splines via a 
Factorization Method [E2] 


Harold D. Eidson and Larry L. Schumaker [Recd. 19 
Oct. 1972] Department of Mathematics and Center for 
Numerical Analysis, University of Texas, Austin, TX 
78712 


This work was supported in part by AFOSR-69-1812-C. 


Key Words and Phrases: approximation, spline approximation 
CR Categories: 5.13 
Language: Fortran 


Abstract 

Fortran subroutines are presented for the purpose of computing 
and evaluating g-splines interpolating Hermite-Birkoff data. The 
subroutines are based on a factorization method for computing 
g-splines discussed by Munteanu and Schumaker (Math. Comp. 27 
(1973), 317-325). 


Description 

1. Introduction. In the following we present subroutines for 
calculating polynomial spline functions solving Hermite-Birkhoff 
(HB) interpolation problems. The Subrounties are based on ane 
rithms described in [9]. 

We begin by reviewing the definition of an HB. swieesolauen 
problem. Let N > 2 and x1 < x2 < +++ < xy be prescribed. 
Suppose for each/, 1 <j < N, that z; is a positive integer, 7M; < 
IMz,j < +++ < IM.,,; are positive integers, and yi. j, ¥2j, 6-3) ej. 
are prescribed real numbers. The HB-interpolation problem is to 
determine s such that 


sUM IID (>, ) = _ Yi, ys =], 2; eR it = 1, 2 x oe N, (1) 


We see that z; describes the number of derivatives prescribed at x; 


while the vector ([M,,;, ..., 1M:., jj) describes which derivatives. 
fz; =1,/ =1, ; N, we 2 have. a simple interpolation problem: 

We are concerned with solving HB-interpolation problems 
with elie splines. Let M bean integer, M > IM.,.3, J = 


1, 2, , N. Then (cf. [4]) there exists a function s satisfying (1) 
and 
sQ(t) = 0, x <t < xp, J = 4, 2,....NM=— 1; © (2) 


Editors’ note: Algorithm 485 described here is available on magnetic 
tape from the Department of Computer Science, University of 
Colorado, Boulder, CO 80302. The cost for the tape is $16.00 (U.S. 
and Canada) or $18.00 (elsewhere). If the user sends a small tape 
(wr. less that I 1b.) the algorithm will be copied on it and returned to 
him at a charge of $10.00 (U.S. only). All orders are to be prepaid 
with checks payable to ACM Algorithms. The algorithm is recorded 
as one file of BCD 80 character card images‘at 556 B.P.1., even 
parity, on seven track tape. We will supply algorithms at a density of 
800 B.P.1, if requested. Cards for algorithms are sequenced starting 
at 10 and incremented by 10. The sequence number is right justified in 
column 80. Although we will make every attempt to insure that the 
algorithm conforms to the description printed here, we cannot guaran- 
tee it, nor can we guarantee that the algorithm is correct.—L.D.F. 
and A.K.C, : 
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sO) =O 6 <x tf > XN; (3) 
sé CUD (— iy 20); . (4) 
s@M-D (x j++) = meee (5) 
: cM, »MV\ Mis, ©.) IMs, 3} 


eA 2. pl 


The function s is called a g- eee It is a polynomial spline of 
degree 2M — 1; i.e. it is piecewise a polynomial of degree 2M — 1. 
The way in which the pieces tie together is described by (4) and 
(5). 

If the only polynomial of degree MM — 1 which solves the 
homogeneous HB-interpolation problem (i.e. satisfies (1) with zero 
right-hand side) is the identically zero polynomial, then we ‘say 
the HB-problem is M-poised. In this case there is a unique g-spline 
of degree 2M-I solving the HB-problem (1). ‘We consider con- 
structing g-splines only for M-poised HB-problems. 

Given an M-poised HB-interpolation problem, the unique 
g-spline interpolant s satisfying (1)-(5) can be represented as 


Plt), . t = x1 


8) = spit), Merc PS Ap 2, Sy ekg, (6) 
Puywld, i> xn, 
where ee 1.2, , N, p;(t) is a polynomial of the form 
cab = os Cit -- xj)" and (7) 
l=1 
M 
Pwalt) = 2) Cw @ — x). (8) 
1 
For later use we introduce the notation C; = (Ci,;, ..., Cow.;)7. 


Several algorithms were discussed in [9] for computing the coeffi- 
cients{C,,;|7 %. of s. We give a subroutine GSF below which 
implements Method 3 of [9]. We also include a function GVAL for 
evaluating s or its various derivatives (For a sketch of the organiza- 
tion of these algorithms, see Section 2 below.) 

2. Organization of the algorithms. GSF consists of: (i) a forward 
march during which certain matrices Ux, Vx, and Ax are set up for 
K = 2,3, ..., N — 1; (it) the solution of a 2M-system for Cy; and 
(iii) a backward march in which the Cy_1, Cy—2, ..., C; are com- 
puted recursively. This proceeds as follows. With appropriate 2M x 
2M Taylor matrices TBx and Zx X 2M matrices INTCON«, the 
interpolating conditions (1) at X(K) can be written as INTCON x 
TBxeviCxi = Br. Similarly with 2M@ — ZR X 2M matrices 
SMOCON « the smoothing conditions (5) at X(K) can be written 
as SMOCONRCK = SMOCON,TB xyiCr4. Finally, the end 
conditions (3) at X(1) and X(N) can be written as ENDCON, 


TB,C, = D,; and ENDCONwCw = Dy. To compute Us, V2, and 
Ay the matrix 
SMOCON, —SMOCON, A 07 
INTCON,TB, 0 co Mises: igh 
ENDCON,TB, 0 Sate D: 


is triangularized by TRISYS to the form 


U, V2 Cr Ay 
G | ee 7 |: 
To get Us, V,, A; we triangularize 
SMOCON; © —SMOCON; G 0 


INTCON, TB: 0 =| B 
WTB, 0 D, 
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to the form 


U; V3 C3 A; 
0 WsJLTBC.| LDs 
Continuing yields Ux, Vx, Ax for K = 2,3, ..., N — 1. Then the 


system 
| INTCONNATBy-1 By- | 
WyiTBn-1 Dy-t 
Cy |= 
INTCONN By 
ENDCONw Dy 


is solved for Cy using TRISYS and back substitution. In the back- 
ward march Cy-1, Cy-2, ..., C2 are obtained successively from the 
stored arrays Ux, Vx, Ax via the recursion UrtCx = —VxTBr: 
Cry + Ax, Where TB, is another 2M-Taylor matrix. Since Ux 
is upper triangular, to determine Cx we performa matrix multiplica- 
tion and a back substitution. Finally we set the first 47 components 
of C, equal to the first Mf components of T7B:C2, and the last M 
components to zero. 

The organization of GVAL is very simple. First a simple search 
is performed to determine the integer KNOT such that Xxnor-1 < 
T < Xxwor. Then Horner’s scheme is used to evaluate the (/D-1)-th 
derivative of the polynomial Pxwor. 

3. Numerical Experience. Table 1 below shows the results of 
using GSF to compute a cubic spline interpolating simple data and 
of using GVAL to evaluate it (and its derivatives) at various points. 
The table ‘should be of use in verifying that the subroutines are 
operating correctly on the reader’s machine. The data in Table | 
is taken from Greville [3, p. 20]. 

Table II below shows the results of using GSF and GVAL on 
simple, Hermite, and Hermite-Birkhoff interpolation problems. 
For comparison, we give the maximum interpolation error, 

Wax . max. [Ya so Cy |, 
1<j<n l<i<s; 
the root mean square error 


N23 N 2 
(= Yep Pre NO a: :i) 
j=. i=l j= 

and the relative central processing times for each interpolation 
problem, : 

Tables I and II were computed on the CDC 6600 at The Uni- 
versity of Texas, Austin. In addition to these examples, we tested 
the subroutines on a wide variety of simple, Hermite, and HB 
interpolation problems for 1 <M < 10,2 < N < 100. We tested 
data from standard functions as well as random data with equally 
spaced and unequally spaced knots with knot mesh ratios (¢ = 
max (x;4. — x;)/min (x;41 — x,;)) up too = 104. The results were 
comparable in accuracy with the procedures in (8] for computing 
simple interpolating splines and the subroutines in |2] for computing 
g-splines. For small M(M = 2,3) GSF and GVAL are as fast or 
faster than these other algorithms; for larger M the reverse is usually 
true. 

4. Discussion. The subroutines presented below can be applied 
to compute g-splines interpolating HB-data whenever the HB- 
interpolation problem is M-poised. The question of when an HB 
problem is M-poised is a difficult one, and has been the subject of 
intensive research recently. For a survey of results, see Karlin/ 
Karon [6]. An obvious necessary condition for M-poisedness is 
that >>“, z; > M. For Hermite interpolation problems ((M,,; = 
Ve ses IM; ; ; =z; — 1), thisisalsosufficient. For simple interpola- 
tion (z, = 1, all /), this reducesto N > M. For nonpoised HB- 
interpolation problems, the subroutines may or may not produce 
g-Splines interpolating the data. Thus the algorithm cannot be used 
as a test for M-poisedness. 

There are a large variety of algorithms in the literature for com- 
puting splines interpolating simple data (cf. {7, 10] and references 
therein). In this special case the subroutines given here can be 
simplified (see Eidson [1]). There are few practical schemes for 
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Table I. Cubic Spline Interpolating Simple Data 
DATA 


(M = 2,N = 9) Values of the spline 
Xj y; t s(t) s'(t) s” (ft) 
266.8 1250 273.16 1346.2 15.076 — .02575 
283.5 1500 303.16 1782.9 14.614 .03996 
300.9 1750 323.16 2073.1 13.992 — .06902 
318.0 2000 373.16 2706.4 11.665 — .03218 
355.9 2500 423.16 3254.7 10.357 — .02148 
399.2 3000 473.16 3749.0 9.481 — .01359 
500. | 4000 523.16 4209.3 8.998 — .00574 
555.7 4500 573.16 4655.2 8.884 — .00047 
612.0 5000 
Vable If. G-spline Interpolants for Various Types of Data 
Input data Results 
Type of N xX; z, IM,, Y,, M |Maximum RMS Time of 
data A= TIN Jf = LON Ff = I(I)DN inter- Inter- compu- 
polation polation tation 
error error” (sec) 
| ma 2 17.1(-15) = 2.7(-15) 20) 
Simple 0 H/22 1 lial sinx,,f = 1 3 47.1(-45)  4.5(-15) 608 
4 .t(-14) 6.0(—15) 1.198 
— eet sing d= 1 20-14) 66-15) 040 
Hermite 10 exp(//5) 2 ,i=2 cos xj,f=2 3 jl. -14)  5.4(-15) 095 
4 {l.4(-— 14) 1.4(—09) . 200 
Homie; os ek. pe itis ok ee: 
Birkholf 40 i110 3 3,4 = 2 e”, 5 |8.8(-09) 1.4(—09) 1.543 
$6 S35 9 15123 


computing g-splines (see {2, 5] and references therein). The only 
other subroutines we know of for g-splines are those in |2] based on 
local support bases. The algorithms underlying the subroutines 
given here are valid also for Ly-splines, see |9], and for EHB-data 
(see [4, 9]). We hope to prepare subroutines for the more general 
case. 

Acknowledgments. We wish to thank the referees for. their 
extremely thorough consideration of our paper, and for several 
helpful suggestions. 
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SUBROUTINE GSFC(Ns Ms Xs Ys Zs IM, Cs IDET) 
INPUT NoMsX,Z,°IM,Y-- 
N IS A POSITIVE INTEGER GIVING THE NUMBER OF KNOTS 
M IS A POSITIVE INTEGER DETERMINING THE DEGREE 
2*M-1 OF THE SPLINE 
X 1S AN ARRAY OF REAL NUMBERS WITH 
XCLILT XC] LT eo oe oe LT eo X(N) 
Z 1S AN ARRAY OF INTEGERS SUCH THAT 
Oe LT-ZCI) «LEM, Ll s2,0000N : 
IM IS AN INTEGER ARRAY WITH 
LeLE eMC lsd) oLTo ee ee LT IMC 26S) 5U)e LEM, 
J21,25e0+e.Ne Y IS AN ARRAY OF REAL NUMBERS 
OUTPUT C.,IDET-- 
THE COLUMN VECTOR CC1lsJ),0+6+e2C(2*MsJ) CONTAINS THE 
COEFFICIENTS OF THE SPLINE IN THE INTERVAL X(J-1) 
TO XCdu). IDET IS SET TO ZERO IF A SING!IILAR SYSTEM 
1S ENCOUNTERED OTHERWISE, IDET IS 1. 
THE SUBROUTINE GSF(NsMsXsYsZs,I1M,Cs IDET) COMPUTES 
THE COEFFICIENTS OF THE INTERPOLATING G-SPLINE. THE 
PARAMETERS N»MsX,Y,2, AND IM ARE INFUT. N AND M 
ARE THE POSITIVE INTEGERS OF SECTION 1! WHICH GIVE 
THE NUMBER OF X S AND DETERMINE THE DEGREE OF THE 
SPLINE, RESPECTIVELY. X MUST BE AN ARRAY OF REAL 
NUMBERS WITH XC1)eLT.XC2)-LTeeeeeLTeX(N) AND Z I5 
AN ARRAY OF POSITIVE INTEGERS NONE OF WHICH SHOULD 
EXCEED M. X CONTAINS THE POINTS WHERE HB-DATA IS 
PRESCRIBED AND Z DESCRIBES THE NUMBER OF PIECES OF 
DATA AT EACH SUCH POINT. IM IS AN INTEGER ARRAY 
WITH 
I6LEcIMC 1s J) eLT« IMC2 eS) + LT ee oe LTO IMCZ0J),L)eLEWM, 
Jzls2QsreeesNe 
THE J TH COLUMN OF IM IS A LIST OF WHICH 
DERIVATIVES (SHIFTED UP BY 1) ARE SPECIFIED AT 
XCJ)» THE DATA FOR THE HB-INTERPOLATION PROBLEM 15S 
ENTERED IN THE ARRAY Ye YCIs4J) SHOULD CONTAIN THE 
VALUE ASSIGNED TO THE IMCI,JU)-1 ST DERIVATIVE OF 
THE SPLINE EVALUATED AT X(J)+ THE PARAMETERS C 
AND IDET ARE OUTPUT OF GSF. AFTER EXECUTION, THE 
ARRAY C WILL CONTAIN THE COEFFICIENTS OF THE 
SPLINE. IN PARTICULAR, THE COLUMN VECTOR 
COlsJoe004002eM,J) CONTAINS THE COEFFICIENTS OF 
THE POLYNOMIALS P(J) DESCRIBED BY EQUATIONS (6), 
J=ls2se0+6.N+]. SUBROUTINE GSF CALLS ON SUBROUTINE 
INTCON,SMOCON., AND TRISYS WHICH MUST BE LOADED WITH 
THE MAIN PROGRAM. 
INTEGER Z, ZK 
DIMENSION X(188)4 YC4,188@), 26198), 1MC4,180), 
* CC8,100) : 
DIMENSION D¢(12,17), UVC(6.17,1808) 
DOUBLE PRECISION SUM 
INITIALIZE CONSTANTS 


IDET = 1 
M2 = 2M 
M2Pl = M2 + 1 
MeML = M2 - 1 
M4 = 4M 
M4P1 = M4 + 1 


NMI = N - 1 
GENERATE FACTORIALS FOR TAYLOR MATRIX 
C(lsl) = 1.8 
DO 18 J=2.M 
JMI = J = 1 
CClsJ) = FLOATCUMI) *CC1L.JUMI) 
18 CONTINUE 
BEGIN FORWARD MARCH 
ZK = ZC¢1) 
MMZ = M + ZK 
MeMZ = M2 - ZK 
SET UP INTERPOLATION MATRIX AT X(1) 
CALL INTCON(1, ZK, M2, IM, D) 
SET END CONDITIONS AT X(1) 
IF (MMZ»NE-@) CALL SMOCON(-!I,. ZK, M, M2, MAPIs 
* IMs D) 
BEGIN K LOOP 
DO 258 K=2,N 


KM! = K = I 
LZK = ZK 
ZK = Z¢K) 


LMMZ = MMZ 
MMZ = M - ZK 
MeMZ = M2 - 2K 
M3MZ = M2MZ + M 
KH = XCKMI)D = XCK) 
TAYLOR MATRIX RIGHT MULTIPLICATION 
TROW = MeMZz 
IF (Ke-EQ.N) IROW = M 
DO 20 I=1.4M 
MU = IROW + I 
DC(MU,1) = DCI,1) 
2@ CONTINUE 
DClsM2P1) = 1.0 
DO 98 1=2,M2 
IM! = 1 - 1 
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DO 48 J=1,IMl 
DO 38 II=1.4M 
DCII,J) = DCIIs4J)*H 


3¢ CONTINUE 
4 CONTINUE 
DCL,MeP1) = 1.8 


IF (2.GT.-IM1) GO TO 68 
T = D¢lLsM2P1) 
DO 53 11=2,1M1 
Vos DCIILM2P1) + T 
T = DCIILM2P1) 
DCIIl,MePl) = V 


5@ CONTINUE 
68 DO 8@ J=1.™ 
SUM = 8.8 


DO 78 I1=1,1 
SUM = SUM + D(J,ITI)*DCII,M2P1) 
76 CONTINUE 
MU = IROW + J 
DCMU,.I) = SUM 
8 CONTINUE 
98 CONTINUE 
ON LAST STEP JUMP TO SET INTERPOLATION CONDITIONS 
IF (KeEQ.N) GO TO 240 
SET UP SMOOTHING MATRIX AT XCK) 
CALL SMOCON(K, ZK,» Ms M2, M4Pls IMs Dd 
DO 118 I=!.M 
DO 100 J=M2P1,M4 
MU = M2MZ + I 
D(MUsJ) = 8G 
162 CONTINUE 
118 CONTINUE 
ADJUST RHS OF SYSTEM TO CORRESPOND VITHK DIFFERENT 
ZCK) 
IF (LMMZ.EQ.@) GO TO 162 
IF (LZK-ZK) 138, 130, 120 
128 Il = M2 + LMMZ + 1 
JJ = -1 


III] = M3MZ + 1 
GO TO 148 

138 Ir = M2 
JJ = +1 


Ill = M2MZ + LZK 
148 DO 15@ 1=1,LMMZ 
MU = IIL + Ixdd 
NU = II + I*JJ 
D(MUsM4P1) = DCNU.M4PI) 
156 CONTINUE 
FILL IN INTERPOLATION DATA 
168 DO 170 I=1,L2K 
J = IMCI.KMI1) 
MU = M2MZ + 1 
DCMULM4P1) = YCI,ZKMIDI/CC lsd) 
178 CONTINUE 
TRIANGULARIZE SYSTEM AT Z(K) 
CALL TRISYS(D, M4P1, M3MZ» M2, IDET) 
IF CIDET) 198, 188, 198 
18d RETURN 
FILL UV MATRIX 
196 DO 218 L=!1.,Me2 
DO 286@ J=1,M4P1 
UVCI,J+K) = DCI.J) 
200 CONT INUE 
218 CONTINUE 
COUPLE M-ZC(K) ROWS WITH INTERP CONDITIONS AT NEXT 
STEP 
IF (MMZ.EQ.@) GO TO 248 
DO 23@ 1=1.MMZ 
DO 226 J=1,M2 
LAMDA = ZK + I 
MU = Me + 1 
NU = M2 + J 
DCLAMDA,J) = DCMU,NU) 
228 CONTINUE 
230 CONTINUE 
SET UP INTERPOLATION MATRIX AT XC(K) 
248 CALL INTCON(Ks ZK, M2, IMs D) 
258 CONTINUE 
END OF K LOOP 
SET END CONDITIONS AT X(N) 
IF (MMZ.NE.2) CALL SMOCON(-N, ZK, Ms M2, M2PI1, 
* IM, D) 
FILL IN INTERPOLATION DATA AT X(N-1) 
DO 26@ I=1,LZ2K 
J = IMCILNMI) 
MU = M+HtI 
DCMU,M2PI> = YCILNMI)O/C C1) 
26@ CONTINUE 
ADJUST RHS TO CORRESPOND WITH Z(N) DATA 
IF (LMMZ.EQ-@) GO TO 283 
DO 270 1=1,LMMZ 
MU = M + LZK + I 
NU = M2 + I 
DCMULM2P1) = DCNULM4P1) 
278 CONTINUE 
FILL INTERPOLATION DATA AT X(I}) 
288 DO 298 1=1,2K 
J = IMCI.LN) 
DCI,LM2P1) = YCILNI/CC1lsJ) 
29@ CONTINUE 
TRIANGULARIZE MATRIX SYSTEM AT X(N) 


0 
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CALL TRISYS(D, 


MePls, M2, M25 
CIDET.EQ@.8) RETURN 


IDET) 


C BACK SOLVE FOR CCN) 


388 
312 


320 
Cc END 


M221 


DO 328 II=1,M2 
IP] = I 


is >=] 


SUM = 8.8 
[IF CIP1.GT.M2) 
DO 300 JsIPl.Me2 


GO TO 319 


SUM = SUM + DCIl.J)*CCUsN) 


CONTINUE 


= *SUM + DCILM2P1) 


CCILN) = V/DCILZI) 
CONTINUE 
FORWARD MARCH 


C BEGIN BACKWARD MARCH 


'N 


C BEGIN KB LOOP 


DO 430 KB=2.N 
KP! = K 


= K - 1 


ZK = 2¢K) 


°= XCK) - XCKPI) 


C TAYLOR MATRIX LEFT MULTIPLICATION 


333 


343 


358 
362 
372 


C IF K 


IF (KB.E@.N) 
DO 398 I=1.M2 


DO 330 I=1.M2 

D¢l,I) = 1.8 
CONTINUE 
DéM2,M4>1) = CCM2.KP1) 
DO 373 1=1,M2Ml1 

‘iPh= +1 


T = CCM2,KP1)*D(1.M2) 
‘M2MI = M2 - I 
DO 348 I1=1,M2M1 
J = M2 - YI 
T = T¥H + C(JsKPI)*DC1,d) 
CONTINUE 
DCIsM4?1) = T 
T = 1-@ ‘ 


IF ¢€IP1.GT.M2M1) GO TO 366 


DO 35@ IIT=IP1sMeM1 
Dct.l) = DClsLtId 
DCl,I1) = T 
T = D¢ts1l) + DCls II) 

:CONT INUE 


D¢lsMe) = T 


CONTINUE 


f JUMP ONT TO DETERMINE C(1) 
GO TO 44@ 


C SET UP RHS OF SYSTEM FOR CCK) 


‘SUM = @. 
DO 360 J=1.,M2 
MU = M2 + J 
SUM = SUM + UVCILMU,LK)*DCJsM4P1) 


362 - CONTINUE 
UVCIZM4P1,K) = -SUM * UVCILM4PIL.K) 
393 CONTINUE 
C BACK SOLVE FO8 COEFFS CCK) USING TRIANGULA® PART OF 
C UVCK) 
= M2P1 
DO 420 Il=1.M2 
PIPL =] 
l=I-t 
SUM = 8.0 
IF CIP1L.GT.M2) GO TO 418 
DO 488 J=IPtsM2 
SUM = SUM * UVCIs ds K) *o Cds K) 
400 CONTINUE 
410 Vos =SUM + UVCIJM4P1,K) 
CCLJK) = V/UVCI41,K) 
420 CONTINUE 
430 CONTINUE 
C END K3 Loo? 
C SET COEFFICIENTS CC1) 
44@ DO 4523 1=1.M 
MYo=s M+ i 
C(MULK) = 3.2 
CCI,K) = DCL,M4P1) 
45@ CONTINUE 
C END BACKWARD MARCH 
RETURN 
SUBROUTINE INTCON(K, ZK, M2, IM, D) 
C FILLS INTERPOLATION MATRIX AT X¢K) NISING 
C INFORMATION OBTAINED FROM ARRAYS Z(K) AND IMCI,K) 


12 


20 


SUBROUTINE SMOCON(KK, ZK» Ms M2, 


INTEGER ZK 

DIMENSION D¢te2,!7), 
DO 2@ 
© DO 10 J=1.M2 


IMC4.1008) 
T=1,2K 


DCI,J) = 8.8 


CONTINUE 

TI] = IMCI.K) 

D«I,I1) = 1.8 
CONTINUE 
RETURN 


ICOL, IM, D) 


C FILLS SMOOTHING MATRIX AT KNOTS 2 THROUGH N-1 
C AND THE END CONDITIONS AT K = 1.,N 


INTEGER ZK 
DIMENSION D(12,17), 


IMC4, 188) 
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C IF KK IS NEGATIVE THEN SET END CONDITIONS 


K = IABSCKK) 
IF CKK«LT.®) GO TO 14 
C SMOOTHING FIRST M DERIVAT 
DO 28 I=15M 
DO 18 J=!,Me2 
DUM = @.@ 
IF (1.EQ.J) DUM = 
D¢I,d) = DUM 
10 CONTINUE 
28 CONTINUE 
IROW = M 
IDUP = 1 


a 
IVES 


1.8 


C SMOOTHING HIGHER DERIVATIVES 


30 IF (€ZK.GEe«M) GO TO 8@ 


J 
I K 
40 IF CIMCI,LK)-J) 60, 58 
52 J 
1 
IF (I.LT.1) I = 1 
IF (J) 68, 88, 48 
6@ IROW = IROW + 1? 
DO 78 IIl=1,M2 
DCIROW,II) = 8-6 
78 CONTINUE 


JzJs-il 
MU = M2 - J 
DCIROWsMU) = 1.8 


IF (J) 68, 88, 40 
66 GO TO (98, 120), 
92 M2MZ = M2 - 2K 
DO 118 I=1,M2mMz 
DCI,ZICOL) = 8.8 
DO 100 J=lsMe2 
MU = Me + J 
DCI4MU) = -DCI,Jd) 
102 CONTINUE 
118 CONTINUE 
RETURN 
120 MMZ = M - ZK 
DO 138 I=i,MMZ 
MU = MM + I 
DC(MU.ZICOL) = 6.8 
138 CONTINUE 
RETURN 
C SET END CONDITIONS 
148 IROW = ZK 
IDUP = 2 
MM = ZK 
IF (K«EQ.1) MM = M2 
GO TO 38 
END 
SUBROUTINE TRISYS(D, 


IDUP 


2 68 


Ns Lo 


M2, 


IDET) 


C TRIANGULARIZATION OF NON-SQUARE MATRIX USING LU 


C DECOMPOSITION WITH PIVOTI 
DIMENSION D(12,17) 
DOUBLE PRECISION SUM 


NG 


TO 20 


SUM = SUM + DCI,d)*DCUsK) 


IDET = 1 
DO 158 K=1,Me2 
KPI] = K + I 
KMI = K - 1 
PIVOT = 0.8 
DO 4@ 1=KsL 
IF CKMIL-EQ-@) GO 
SUM = 8.8 
DO 18 J=l.,KMI 
1a CONTINUE 
DCI,K) = -SUM + D 
28 T = ABSC(DC(I.K)) 
IF (T-PIVOT) 46, 
308 PIVOT = T 
IPIV = 1 


40 CONTINUE 
IF (PIVOT) 60, 56, 
5@ IDET = @ 
RETURN 
66 IF CIPIV-K) 70, 99, 
78 DO &@ J=1,4N 
T = D(KsJ) 


C1.K) 


40, 30 


68 


78 


D(KsJ) = DCIPIV.J) 


DCIPIVsJ) = T 
88 CONTINUE 
98 T = DCKsK) 


IF CKPI-L) 108, 100 
100 DO 118 I=KPILL 
DCI,K) = DCIZK)/T 


110 CONTINUE 
120 IF CKMI-EQe8 -OR. 
DO 148 J=KPI.N 
SUM = 8.8 
DO 138 I=1,KMI 


2 128 


KP1.GT-N) 


GO TO 


SUM = SUM + DCK,1)*DCI.J) 


130 CONT INUE 
D¢(K,J) = -SUM + D 

148 CONTINUE 
15@ CONTINUE 

LAST = L - M2 

IF (LAST-EQ@-8) GO TO 

K = M2 

MePl = M2 + 1 


(KJ) 


19@ 


15@ 


0 
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agnganann 


aa 


DO 188 I=1,LAST 


K = K + 1 
DO 1780 J=M2P1.,N 
SUM = 8.8 


DO 168 I1=!,M2 


SUM = SUM + DCK, ITI) *DCII,u) 


168 CONTINUE 
D(KsJ) = -SUM + D(Ksd) 

178 CONTINUE 
186 CONTINUE 
198 RETURN 

END 

FUNCTION GVAL(T. ID, Ne Mp 
INPUT Ts IDsNoMsXsC 


THE PARAMETERS N.sM,XsC ARE AS IN GSF 
COMPLETELY DESCRIBE THE G-SPLINE. 

T IS A REAL NUMBER AND ID A POSITIVE 
GVAL PRODUCES THE ID-1 ST DERIVATIVE 


Xs 


AND 


INTEGER. 
OF THE SPLINE 


AT Te GVAL AUTOMATICALLY PRODUCES @ IF ID.GT.M*2 


DIMENSION XC1@G), CCG6s188), SCB) 


IORD = 24M 
IF (ID-GT-IORD) GO TO 138 


BINARY SEARCH FOR KNOT SUCH THAT 


XCKNOT-1)+LT.TCKNOT) 
KNOT = 1 
IF (T-XCKNOT)) 78, 78, 18 
16 KNOT = N : 
IF (T-XCKNOT)) 20, 60, 60 
26 KUP = N 
KLO = 1 
30 IF (CKUP-KLO).EQ-1) GO TO 
KNOT = (KUP#KLO)/2 
IF (T-XCKNOT)) 58, 78, 4@ 
4@ KLO = KNOT 
KNOT = KUP 
GO TO 3@ 
50 KUP = KNOT 
GO TO 3@ 
EVALUATION OF THE SPLINE 
6@ IORD = M : 
IF (ID-GT.-IORD) GO TO 138 
70 Y = T - XCKNOT) 
IORD!I = IORD + } 
SET UP SPLINE COEFFICIENTS 
DO 68 I=l.IORD 
MU = IORDI - 1 
SCI) = CCMU.KNOT) 
6@ CONTINUE 
HORNERS SCHEME 
DO 18@ K=t,ID 
IORD = IORDI = K 
DO 98 1=2,I10R8D 
SQ) = SCI-1)*"Y + SCT) 
92 CONTINUE 
18@ CONTINUE 


FACT = 1.8 
IF CID.-EQ.1) GO TO 128 
IDM! = ID - 1 


DO 11@ f=1,1DM) 
FACT = FACT*FLOAT(CI) 

118 CONTINUE 
126 GVAL = SCIORD)*FACT 

RETURN 
13@ GVAL = 8.8 

RETURN 

END 


76 
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Algorithm 486 


Numerical Inversion of 
Laplace Transform [D5] 


Francoise Veillon [Recd. 26 Sept. 1972] 
Mathematiques Appliquees Informatique, Universite de 
Grenoble, B.P. 53, Cedex 53, 38 Grenoble-Gare, France 


Key Words and Phrases: Laplace transform inversion 
CR Categories: 5.16 
Language: Algol 


Description 

This work forms part of a thesis presented in Grenoble in 
March 1972. Improvements made to the Dubner and Abate algo- 
rithm for numerical inversion of the Laplace transform [1] have led 
to results which compare favorably with theirs and those of Bell- 
mann [2], and Stehfest [3]. The Dubner method leads to the ap- 
proximation formula: 


f(t) = 2et'/T[}Re{F(a) } 
+ >> Re{F(a + ikn/T)}cos(krt/T)], (1) 
k=1- 


where F(s) is the Laplace transform of /(‘) and a is positive and 
greater than the real parts of the singularities of f(r). 

Definition of the calling parameters. Assume that f(t) is a func- 
tion which has real values and that F(s) is its Laplace transform. The 
procedure /aplaceinverse calculates, for a programmer-chosen set of 
values of r, the corresponding values of f(t). The parameters are as 
follows: 

rf1fis a real procedure with two parameters which are, respec- 
tively, real part of s and imaginary part of s. Its value is the real part 
of F(s). 

ntf is the number of values of ¢ for which we want to calculate 
f(t). 

tf is a one-dimensional array, the bounds of which are 1 and 
ntf. It contains the values of t. 

naf is the number of values taken by the parameter a (see eq. 
(1)). In the following examples, naf is equal to 5. 

af is a one-dimensional array, the bounds of which are 0 and 
naf — 1. At the time of the call this array must contain the values of 
a. In the following examples, these values are, in order: 1.15, 1.20, 
1.25, 1.30, 1.35. These values have been experimentally chosen as the 
best over the whole set of functions that have been calculated (ap- 
proximately 30, as different as possible), but they are not the best for 
each particular function. 

iterf is 1/8 of the number of terms considered in the infinite 
sum of the approximation formula, eq. (1). In the example, iterf is 
equal to 8. 

resultatf is a one-dimensional array, the bounds of which are 1 
and ntf. At the end of the procedure it contains the tf values of f(1). 

ecri is a procedure with one real parameter (time). It must print 
the value of the parameter, an error message (see later) and be 
written with local conventions. 
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A few examples of functions which have been calculated by 
means of this procedure, and then compared with other methods are 
given in Tables I and II. 

Outline of the method. The program first evaluates f(t) using eq. 
(1) for naf values of a. The sum in eq. (1) is evaluated in iterf groups 
of eight terms by the e-algorithm (procedure epsalgor) which cor- 
responds to an iteration of the Aitken A? process. This accelerates 
the convergence of the sum. The grouping of terms by eight results in 
either using fewer calculations for the same results or, for the same 
volume of calculations, using more terms in the sequence of the 
partial sum, and consequently obtaining better precision. It also 
smooths this sequence. If iterfis equal to eight, this leads to the use 
of 64 terms in the sum. That is satisfactory to proceed with the e-al- 
gorithm. 

If naf is different from one (and greater than three, otherwise the 
spline approximation is meaningless), then the program fits to the 
naf estimates of f(1) a cubic spline S(a) whose second derivatives 
vanish at the endpoints. The spline representation employs second 
derivatives, and the system of linear equations satisfied by these 
derivatives is solved using the double-sweep method. 

We want an a such that*(r) is the least dependent on a. The pro- 
gram then attempts to find an a for which S’(a) = 0. If no such a 
exists, then the program attempts to find one for which S’(a) is 


Table I. 

Racha coy f(t) 

IO = Tig ft) s(n) Laplace- 
t exact Stehfest Dubner* inverse 
1 0.56419 0.56555 0.73172 0.56419 
2 0.39894 0.39912 0.40035 0.39894 
3 0.32574 0.32655 0.26343 0.32573 
4 0.28209 0.28278 0.28286 0.28209 
5 0.25231 0.25174 0.29365 0.25231 
6 0.23033 0.22989 0.22501 0.23033 
7 0.21324 0.21322 0.18062 0.21324 
8 0.19947 0.19956 0.20112 0.19947 
9 0.18806 0.18814 0.21609 0.18806 
10 0.17841 0.17796 0.17650 0.17841 


* The Dubner method has been performed with aT = 10 and 
500 terms for the sum. 


Table IT. 
f(t) 

S(t) = e-#/2 f(y) Laplace- 
t exact Bellmann inverse 
4.140186 0.126174 0.120527 0.126174 
2.501126 0.286329 0.288195 0.286329 
1.643438 0.439675 0.439084 0.439675 
1.085084 0.581269 0.581308 0.581269 
0.693147 0.707107 0.707318 0.707107 
0.412298 0.813712 0.813401 0.813712 
0.214821 0.898157 0.898482 0.898158 
0.085541 0.958131 0.957847 0.958135 
0.016048 0.991008 0.992205 0.992015 
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minimum. Using this a, the program evaluates f(r) from eq. (1) (un- 
less the chosen a is among the original set of values of a) to obtain 
the final approximation. As it is not possible to calculate the best a 
for an unknown function, the values of a have been experimentally 
chosen so as to give the best global result over a set of about thirty 
known functions, as different as possible. 

Although it is very rare, a zero divide may occur in procedure 
epsalgor because of the division between two terms which may be- 
come equal. Then the program calls the procedure ecri and jumps 
to the next value of +. The value of f(t), which has not been evaluated 
because of this, will be zero. 

It must be said that the algorithm can be applied only to func- 
tions whose inverses are expected to be reasonably smooth. 

Implementation. This program has been run on an IBM/360 
computer, using compiler F under Operating System, version 18.6. 
The computing time per f-value, irrespective of the time needed to 
evaluate Re(F(s)), is 0.7 sec. The number of calls of procedure rflf 
is less than or equal to ntf(naf — 1) (8 X iterf + 1). The object 
module size is about 15K bytes. The effective memory occupied dur- 
ing the execution step is 66K bytes. 
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Algorithm 
procedure /aplaceinverse (rf lf,tf\ntfaf,naf,iterf,resultatf,ecri); — 
real procedure rf1/; real array ¢/,a/,resultatf; procedure ecri; 
integer iterf,ntf,naf; 
begin 
procedure epsalgor (eps, neps, resuleps, teps) ; 
array eps, resuleps; integer neps; real teps; boolean bool; 
begin 
array epstamp[1: neps — 1]; integer i, j,k; 
for i:= 1 step 1 until neps + 2 do resuleps{i] := 0.0; 
for i := 1 step 1 until neps — 1do 
begin 
ifeps{i + 1] = eps{i] then 
begin ecri(teps) ; bool := true; go to fin; end; 
epstamp{i] := 1.0/(eps{i + 1] — eps{i)) 


end; 
resuleps{1] := eps|neps}; 
:= 2: 
for 7 := neps — 2step —1 until 1 do 
begin 
for i := 1 step 1 until j do 
begin 
eps|i] := epstamp{i); 
if epstamp{i + 1] = epstamp{i] then 
begin ecri(tps); bool := true; go to fin; end; 
epsiamp{i] := eps[i+1] + 1.0/(epstamp{i+1] — 
epstamp|i)); 
end; 
if (k+2) X 2 = k then resuleps[(k-+2)+1] := epstamp{/]; 
k:=k+1 
end; 
fin: 


end epsalgor; 

procedure /apliny (rflg,tg, iterg,ag,resultatg); 
real procedure rf1g; real tg,resultatg,ag; 
integer iferg; 

begin 
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real somme; integer i,/; 
real array ftab[0:8 Xiterg], ep[1 : iterg), resulep{\ : iterg+2}; 
for i := Ostep I until8 xX itergdo 
ftab{i] := rflg(ag,iX3.1415926536/(8.0X tg)); 
somme := 0.0; 
for i := 1 step 1 until ‘terg do 
begin 
for j := 1 step 1 until 8 do 
somme := somme + ftabli +8 X G-1)] & cos(i + 8 
(i—1)) X 3.1415926536/8.0); 
epi] := somme 
end; 
epsalgor (ep, iterg,resulep,tg) ; 
if — bool then 
resultatg := 2.0 * exp(agXtg)/(8.0Xtg) X* (resulep 
[iferg+2] + 0.5 X ftab[0}); 


end /aplinv; 

procedure coefsplinetrois (n,x,y,m) ; 
value n,x,y; integer 1; array x,y,m; 
begin 


integer i; array d[1 :1—1]; real a,b,c,e; 
for i := n—1 step —1 until 1 do 


begin 
a:=xfit+1] — x[f];6:= x[i]) — x[i-—1); 
e:= ylitl] — yfi];e:= yl] — yli-1); 


ifi = n — 1 then 
begin 
di] := (x{i+1] — x[i—1))/3.0; m[f]:= c/a — e/b 
end 
else 
begin 
d{i} := (12Xd+1) X (x«[i+1]—x[i-—1]) —aXa)/36X 
dli+1)); 
mli]:= c/a — e/6 — a X mf{i+1]/(6.0 X dfit+1)) 
end 
end; 
m[0] := m[n] := 0.0; 
for i:= 1 step 1 until — Ido 
ifi = 1 then m{i] := mi[i]/d{i] 
else 
m[i] := (6Xm{i]— (x[i]—x[i—1]) Xmfi-—1)/(6 X d[i)) 


end coefsplinetrois; 

boolean bool, boo/1, bool2; 

real de/ta, al, 61, c1, zero, x1, x2, dzero, v, u; 
integer i, /; 

real array x, m, z[O:naf—1); 

real array y(O:naf]; 

for i := 1 step 1 until ntfdo 

begin 


bool := false; resultatf[i] := 0.0; 
for 7 := O step 1 until naf — 1 do 
begin 
xf) = aff//ita); 
laplinv (rf Vf, tf(i], iterf, x[/), yLi); 
if — bool then resultatf(i] := y{j] 
else 
go to e; 
end; 
if naf ~ 1 then 
begin 
coefsplinetrois (naf—1, x, y, m);u:= 0.0; 
for j := O step 1 until naf—2 do 
begin 
al := (m[j+1) — mf[j))/6.0/(x+1] — x); 
b1.:= (m{Uj] — 6.0 X al X x{[j])/2.0; 
cl := (yt+1) — yi)/GU+1) — xl) - al x 
(x[7] X x] + xi+1] & x[j+1] + xfj] x 
x[i+1)}) — 41 X (L/] + xf+1)); 
delta:= b1 X b1 — 3.0 X al X cl; 
booll := false; bool2 := false; 


0 


x 
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if delta > 0.0 then begin 
begin . if z[j] < z[j+1] then 
ifal = begin u := x[j]; v := z[/] end 
begin else 
ifb1 ~ 0.0 then begin uv := x[j+1]; v := z[j+1] end; 
begin x2 := —c1/2.0/61; bool2 := true end; end 
end : else 
else ify > z[j+1] then 
begin begin v := z[j+1];u:= x[j+1] end; 
x1 := (—b1 + sqrt(6l * 61 — 3.0 & al X cl))/ zero := —61/3.0/al; — 
a1/3.0; dzero := abs ((3.0 X al X zero + 2.0 X bl) X 
x2:= (—6b1 — sgrt(bl * 61 — 3.0 * al X cl))/ zero + cl); 
al/3.0; if (x[j] < zero A zero < x[y-+1] A dzero < z{j]) then 
booll := true; bool2 := true; begin u := zero; v := dzero end 
end end; 
end; j:= 0; 


if booll then 


begin ifu = x[j] then resultatf{i] := y{y] 
if (x[]] < x1 A xl < x{j+1)) thenu := x1 else 
end ifu < x{j+1] then 
else begin 
if bool2 then lapliny (rf If, tf[i], iterf, u, y[naf)); 
begin if — bool then resultatf[i] := y[naf] 
if (x[/)) < x2 A x2 < x[j+1)) thenu := x2 else 
end resultatf(i] := yl]; 
end; end 
if = booll /\ vw bool2 V u = 0.0 then else 
for j := O step 1 until naf— 2 do ifj < naf—2 then 
begin beginj := 7 + 1; goto/ end 
if7 = Othenz[j] := abs ((3.0 X al X x[j] + 2.0 X else 
51) X xfj] + cl); ifu = x[j+1] then resultatf{i] <= y4]] 
zU+1] := abs (3.0 KX al & xfj+1] + 2.0 * 51) X end; 
x[j+1] + el); e: 
ifj = Othen end 


end /aplaceinverse; 
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REMARK ON ALGORITHM 486 
Numerical Inversion of Laplace Transform [D5] 
[F. Veillon, Comm. ACM 17, 10 (Oct. 1974) , 587-589] 


Henk Koppelaar and Peter Molenaar [Recd 12 Feb. 1976 and 11 May 1976] 
Department of Psychology, Division MPS, State University of Utrecht, Oudenoord 
6, Utrecht, The Netherlands. 


The following changes were made in the algorithm: 

(1) Within the body of the procedure epsalgor the last call of ecri was changed 
to read: ecri(teps). 

(2) Within the body of the procedure coefsplinetrois the assignment to d[z] was 
changed to read: d{i] := (12Xd[i+1]xX, ete. 

(3) Tests show the increasing inaccuracy of the approximation by Laplace- 
inverse if t gets in the vicinity of zero. In fact if t = 0, overflow occurs at various 
places. The first spot where it occurs is after declaration of coefsplinetrois in the inner 
do-loop: 


for j := 0 step 1 until naf—1 do begin 
x[j] := af(g)/ifle] 


if £ = tf[t] is zero for some 7. In order to avoid this overflow, one may compute 
Laplaceinverse at t * O or insert in the algorithm the precaution: if {f[z] = 0 
then begin ecri (tf[7]);_ go to e end: 

for j := 0 step 1 until naf—1 do begin 

if (f{{i] = 0 then begin ecri(if{i]); 

go to ¢ end; 2[j] := af{j]/if{2] 
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Though this precaution prevents overflow, it is appropriate to add a comment in 
the heading of Laplaceinverse concerning problems if ¢ = 0. Also, in the description 
of the algorithm a warning against t = 0 is necessary. 

(4) In the heading of the procedure epsalgor the declaration boolean bool; was 
erased. 7 

With these modifications the algorithm Laplaceinverse was translated for the 
CDC-6500 using the Control Data Algol 3 compiler. 

The program was used on the following five tests, computing the inverse of 
F(s), s = a@ + 1b, which is f(t), while the program is supplied with Re {F(s)}: 


Test F(s) f() Re {F(s)} 

a 1//s 1// (ir) '  VS(@+p)/2]/p, p = V[(a?-+b?)] 

b 1/(s+0.5) | exp (—#/2) (0.5-+a)/((0.5-+a)?+62) 

c s/(s?+1)? —(t/2) sin (t) a (a?+-4b2(1—b2))/(z?-+4b2(a2))2, 
x= a—b*+1 

d 1/(s?-+s+1) . (2//3)exp(—t/2)sin(t/(2/V/3))  2/(2?+y?), = a®—b?+a+1, 
y = b(2a+1) 

e sexp(—25s) U(t—25) exp (— 25a) (a cos(25b) —b X sin (25d) )/ 
(a?+b?) 


Except for tests b and e the results were accurate to about four decimal places. 
For t & 0.01 the results for test b were accurate to about two decimal places, while 
test e showed accuracy to only one decimal place at t % 25. 
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Numerical Inversion of Laplace Transform [D5] 
[Francoise Veillon, Comm. ACM 17,10 (Oct. 1974) , 587-589] 


Francoise Veillon [Reed 21 April and 30 July 1976] 
Mathématiques Appliquées Informatique, U.S.M.G. B.P. 58, 38041 Grenoble, 
France 


A significant improvement in efficiency can be obtained by using call by value 
rather than call by name where appropriate. Thus the following three changes are 
suggested: 


(1) value tf, nif, af, naf, tterf; 
inserted between the heading of the procedure laplaceinverse and its specifica- 
tions. 
(2) value eps, neps, teps; 
inserted between the heading of the procedure epsalgor and its specifications. 
(3) value tg, iterg, ag; 
inserted between the heading of the procedure laplinv and its specifications. 


As the procedures necded to evaluate Re(F(s)) and the true values of the re- 
sults are the responsibility of the user, two kinds of tests have been performed: 

(a) The modifications (1), (2), and (3) arc included in the procedure laplacein- 
verse. 

(b) Calls by value are also used in the user supplied function rf1f. (Call by value 
is used in only three of the seven truc value functions because it is not worthwhile 
using it when the parameter is referred to only once.) 

The computing times (in seconds) are given in Table I. They concern the calcu- 
lation of ten ¢-values for seven functions; the last column, to the right, concerns 
the mean time for one t-value. 
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The programs were run on an IBM/360/67 computer, using an F compiler, 
under Operating System MVT, version 20.1/asp 2.6. 


Table I 
tl: Computing time needed to evaluate Re(F(s)). 
!2: Computing time irrespective of the time needed to evaluate Re(F(s)). 
13: Computing time per t-value irrespective of the time needed to evaluate Re(F'(s)). 


Full Computing Computing Computing 
computing time time #1 time {2 time 23 
No call by value 61.99 28.18 33.81 0.48 
Call by value only in laplace- 48.42 28.18 20.24 0.29 
inverse 
Call by value in laplacein- 41.16 20.92 20.24 0.29 


verse and user supplied func- 
tion rflf 


rn nn RN Ge 
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Exact Cumulative Distribution 
of the Kolmogorov-Smirnov 


Statistic for Small Samples 
[S14] 


John Pomeranz [Recd. 13 Mar. 1973] | 
Computer Sciences Department, Mathematical Sciences 
Building, Purdue University, West Lafayette, IN 47907* 


Key Words and Phrases: Kolmogorov-Smirnoy test, K-S statistic, 
goodness-of-fit testing 

CR Categories: 8.1, 5.5 

Language: Fortran 


Description 

The algorithm calculates the exact cumulative distribution of 
the two-sided Kolmogorov-Smirnov statistic for samples with few 
observations. The general problem for: which the formula is needed 
is to assess the probability that a particular sample comes from a 
proposed distribution. The problem arises specifically in data 
sampling and in discrete system simulation. Typically, some finite 
number of observations are available, and some underlying dis- 
tribution is being considered as characterizing the source of the ob- 
servations. 

The statistic used here simply measures the maximum deviation 
between the proposed distribution. and the empirical distribution 
derived from the sample. Elementary rules for calculating this devia- 
tion can be found in, e.g. Knuth (4, p. 41], Brunk (2, -p. 267], or 
Miller and Freund (5, p. 222]. Simply put, let Syv(x) be the fraction 
of the N observations which are less than x. Let F(x) be the proposed 
cumulative distribution of the source. Let 


Ky = V/N X max | Sy(x) = F(x) |. 


Usually Ky is called a two-sided Kolmogorov-Smirnov statistic. 
Omitting the absolute value signs gives a one-sided statistic. For 
computational ease we let Dy = Ky//N be the observed deviation, 
unweighted by /N. 

The inputs to the function are the sample size N and a critical 
value D. The function value is the exact probability Pr{ Dy < D} = 
Pr{Ky < DVN}. 

The formulas used in the function are obtained directly from 
Durbin [3, formulas (23) and (24)]. To validate the function, an- 
other was coded using matrices determined by Pomeranz [7], and. 
the two were identical to eight decimal places. Then the function was 
used to generate Birnbaum’s Table 1 [1, pp. 428-30] for D = 1/N, 
2/N,..-,4/N, J =min {N, 15},1 <M < 100. Eightentries differed 
by 10-5, apparently from roundoff error [1, p. 440]. The final test 
was of Miller’s Table 1 [6, pp. 113-15] of critical values in the ex- 
treme tail for 1 < N < 100. (Miller’s approximation is based on the 
one-sided statistic with doubled tail probabilities, which is accurate 


* Present address: A.T. Kearney, Inc., 100 South Wacker Drive, 
Chicago, IL 60606. 
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in the extreme tail.) Newton’s method was used to determine the 
values of D, which yield cumulative probabilities of .8, .9, .95, .98 
and .99, for each N. Miller’s entries agreed within one in the fifth 
decimal place for probabilities other than .8 and within four in the 
fifth decimal place for the .8 probability. This supports Miller’s 
claim (6, p. 120] and further allows the use of the column a = .10 
(P = .80) in his Table 1 when an error in D of 4 x 10-5 is acceptable. 
However, the two-sided statistic and the one-sided statistic [4, p. 44] 
are significantly different outside the tail. For example, with a 
sample size of 10, Pr{Kio < .54} is approximately .12, but at the 
same critical value for the one-sided statistic, the cumulative proba- 
bility is .50. 

Finally, using a CDC 6500, values were computed up to N = 
140. The major limitation is the magnitude of the exponent re- 
quired to represent WV". Rearranging sums produced no changes. 
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REAL FUNCTION PKS2@C(N» D) 
INTEGES 
N IS THE SAMPLE SIZE USED. 
REAL D 
D IS THE MAXIMUM MAGNITUDE (OF THE DISCREPANCY 
BETWEEN THE EMPIRICAL AND PROPOSED DISTRIBUTIONS) 
IN EITHER THE POSITIVE OR NEGATIVE DIRECTION. 
PKS2@ IS THE EXACT PROBABILITY OF OBTAINING A 
DEVIATION NO LARGER THAN D.- 
THESE FORMULAS APPEAR AS (23) AND (24) IW 
Je DURBINe THE PROBABILITY THAT THE SAMPLE 
DISTRIBUTION FUNCTION LIES BETWEEN TWO PARALLEL 
STRAIGHT LINES- ANNALS OF MATHEMATICAL STATISTICS 
394 2CAPRIL 1968)4 398-411. 
DOUBLE PRECISION Q¢141). FACTC141). SUM, Cl, 
* FT, FUs FV 
IF (N+EQ-1) GO TO 96 
FN = FLOAT(N) 
FND = FN*D 
NDT © IFIX(C2-*FND) 
IF (NDT-LT-J) GO TO 186 
ND = IFIXCFND) 
NDD MIN@C2*NDsN) 
NDP ND + 1] 
NDDP = NDD + J 
FACTC1) = Ie 
CI = }e 
DO 18 I1=]4N 
FACTCI+1) = FACTCI)*CI 
Cr = CI + Je 
1@ CONTINUE 
Q¢I> = 1. 
IF (NDD.~EQ.-6) GO TO 58 
CI = }. 
DO 28 I1=)sNDD 
QCI+}) = Cl*¥*I/FACTCI+1) 
CI = CI + Ie 
2@ CONTINUE 
IF (NDP.GT-N) GO TO 88 


a 
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FV = FLOATCNDP) - FND FUNCTION PKS(Ns EPS) 
JUMAX = IDINTCFV) + 1 


C CALCULATE THL CUMULATIVE DISTRIBUTION OF THE 
DO 48 1*NDP,NDD C KOLMOGCROV-SMIRNOV STATISTIC USING THE FORMULAS OF 
SUM = @- C JOHN POMERANZ. EXACT VALUES OF THE TWO-SIDED 
FT = FND C KOLMOGOROV- SMIRNOV CUMULATIVE DISTRIBUTION FOR 
ee C FINITE SAMPLE SIZE. TECHNICAL REPORT NUMBER 88. 
FU = FV C COMPUTER SCIENCES DEPARTMENT, PURDUL UNIVERSITY, 
DO 30 J=1sJMAX C FEBRUARY 1973. 
SUM = SUM + FTe*(S-2)/FACTC I) #FURKK/ DOUBLE PRECISION PF(4,4G), UC46), VC4B) 
* FACTCK+} ) DOUBLE PRECISION SUm 
PTs PT +. Js DIMENSION DXAC4) 
iy oe ; 1. COMMCN DX, DXA» PFs L 


DATA MNP /4@/ 
38 = CONTINUE 


QACI+#)) = ACI+1)9 - Bee FND& SUM 
JMAX = JMAX + | 
FV = FV + J. 


FN = N 
RN = }-/FN 
K = EPS*FR + «8880206! 


FK = K 
40 SONTINUE IF CABSCFK-EPS*FN)+GT.-6@420021) GO TO 18 
IF (NDD.-£0-iv) GO TO B89 K 2 K - } 
$@ DO 78 I=NDDP.N FK = EK 
SUM = @. 1@ CONTINUE 
SIGN’ = |e DEL = EPS - FK«*RN 
FT = 2.eFND XUP = RN - DEL 
DO 6@ J=!,NDT XLO = DEL 
FT = FT = Je IF CABS(XUP-XLO)-LT..888@8008!) XUP = XLO 
Ke Pe gd +} XPREV = @. 
SU4 = SUM + SIGN#FTe*J/FACTC Ut!) *QCK) DC 20 1=1,MNP 
SIGN = ~SIGN UCI) = @- 
68 CONTINUE 28 CONTINUE 
Q¢i+1) = SUM UCKel) © Le 
7% CONTINUE IMIN = -K 
BW PKS2 =: 9CN+])*FACTCN+12/FN&eN 30 X = AMIN! CXUP.,XLO) 
RETURN IF (X-GT.e99S669) X = Je 
98 PKS@ = 2.eD - |. DX = X - XPREV 
RETURN JMIN = CEILCCX-EPS)*FN--@@000001) 
108 PKS2 =.@. IF CABSCFLOATCJUMIN) -(X-EPS)*FN)-«LT- «-@8000001 ) 
RETURN * JMIN = JMIN + J 
es UMAX © (X+#EPS)#FN + «9900081 
IF CABSCFLOATCJMAX)- (X+EPS)#FN)-LT«-@2U@88081) 
* JMAXY = JMAX - 1 i 
JMAX = JMAX - JMIN + } 
SUBPOUTINE PRFAC CALL PRFAC 
DOUBLE: PRECISION PF(4.40) DO 6@ J=1.MNP 
DIMENSION DXAC4) SUM = @. 
COMMON: DX, DXAs PFs J IF (J«GT-JMAX) GO TO 5@ 
DATA I. /17 I=] 
DO 38 J=144 4G IP =©§ J - I + 1 + JMIN - IMIN 
IF (DXACJ).EQ.DX) RETURN 
1a CONTINUE SUM = SUM. + UCI) *PFCLZIP) 
J = 1 Ie y « J 
rs r+! IF CCEMINt+I>-LE-CUMIN+J))> GO TO 48 
IF C1e/EG.5) 1 = J 5a V¢J) = SUM 
DX¥ACI) = DX : 68 CONTINUE 
PFCJ.1) = 1. DO 78 1=}.MNP 
DO 2@ K=2,38 UCI) = VET) 
PFCdsK) = CPF CJsK~-1)«DX)/FLOATCK-1) 78 CONTINUE 
26 CONTINUE IMIN = JMIN 
RETURN XPREV = X 
END IF (X.EQ.XUP) XUP = XUP + RN 
IF (X.EQ-XLO) XLO = XLO + RN 
IF (XeLTe1-) GO TO 38 
DO 8@ 1=1,N 
FUNUTION CEIL(X) UCK+1) = UCK+1) *FLOATCI) 
IF (X.GE.8.) GO TO 16 8B CONTINUE 
I= -x: PKS = UCK+1) 
CEIL = -1 RETURN 
RETURN END 
10 1 =X + 196999999 
CEIL = I 
RETURN 


ENG 
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REMARK ON ALGORITHM 487 


Exact Cumulative Distribution of the Kolmogorov-Smirnov Statistic for Small 
Samples [S14] 
[J. Pomeranz, Comm. ACM 17, 12 (Dec. 1974), 703-704 ] 


Subroutine PRFAC, function subprogram CEIL, and function subprogram 
PKS, which were published as a part of Algorithm 487, were test routines that 
were inadvertently printed along with the main algorithm. 
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A Gaussian Pseudo-Random 
Number Generator [G5] 


Richard P. Brent [Recd. 9 Nov. 1973, and 19 Dec.1973] 
Computer Centre, Australian National University, 
Canberra, Australia 


Key Words and Phrases: random numbers, pseudo-random num- 
bers, Gaussian distribution, normal distribution 

CR Categories: 5.39, 5.5 

Language: Fortran 


Description 

Introduction. Successive calls to the Fortran function GRAND 
return independent, normally distributed pseudo-random numbers 
with zero mean and unic standard deviation. It is assumed that a 
Fortran function RAND is available to generate pseudo-random 
numbers which are independent and uniformly distributed on 
(0, 1). Thus, GRAND may be regarded as a function which converts 
uniformly distributed numbers to normally distributed numbers. 

Outline of the method. GRAND is based on the following algo- 
rithm (Algorithm A) for sampling from a distribution with density 
function f(x) = K exp (—G(x)) on (a, 6), where 


0< G(x) <1 (A) 
on [a, b), and the function G(x) is easy to compute: 


Step 1. If the first call, then take a sample u from the uniform dis- 
tribution on [0, 1); otherwise u has been saved from a previous 
call. 

Step 2. Set x —a + (6 — a)uand uy — G(x). 

Step 3. Take independent samples m, wo, ... from the uniform 
distribution on [0, 1) until, for some k > 1, ura < m. 

Step 4, Set u<— (ux _ Un_-1)/(1 = Ur_1). 

Step 5. If k is even go to Step 2, otherwise return x. 


The reason why Algorithm A is correct is explained in Ahrens 
and Dieter [2], Forsythe [4], and Von Neumann [6]. The only 
point which needs explanation here is that, at Step 4, we can form 
a new uniform variate u from uz. and u, , thus avoiding an extra 
call to the unifofm random number generator. This is permissible 
since at Step 4 it is clear (from Step 3) that (uw — m1)/(1 — 
ux_1) is distributed uniformly and independent of x and k. (The 
fact that it is dependent on 1, is irrelevant.) 

Let a; be defined by (2/r)} fo; exp (— 40)dt = 2-* for 
i= 9,1,.... To sample from the positive normal distribution 
(Algorithm B), we may choose i > 1 with probability 2-* (easily 
done by inspecting the leading bits in a uniformly distributed 
number) and then use Algorithm A to generate a sample from 
[a:-1, ai), with G(x) = }(x? — ais). It is easy to verify that 
condition (1) is satisfied, in fact 


L(a;2 — aja) < log (2). (2) 


Finally, to sample from the normal distribution (Algorithm 
C), we generate a sample from the positive normal distribution 
and then attach a random sign. 

Comments on the method. The algorithm is exact, apart from 
the inevitable effect of computing with floating-point numbers 
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with a finite word-length. Thus, the method is preferable to meth- 
ods: which depend on the central limit theorem or use approxima- 
tions to the inverse distribution function. 

Let N be the expected number of calls to a uniform random 
number generator when Algorithm A is executed. If the expected 
value of k at Step 3 is E, and the probability that & is even is P, 
then N =.E +N P,soN = E/(1 — P). From Forsythe (4, eq. 
(11)], E = (6 — a) f,> exp (G(x))dx and 


1 b 
1-p=~— | exp (—G(x)) dx, so 
b= ad, 


b b 
n= f exp (G(x)) ax / exp (— G(x)) dx. (3) 


From (3) and the choice of a; , the expected number of calls to a 
uniform random number generator when Algorithm C is executed is 


0 a; a; 
» al exp (f(x? — ai_s)) ax / | exp (—$(x" — aj) dx 
tl Qi-) aj-1 


= 1.37446. (4) 


This is lower than 4.03585 for the algorithm given in Forsythe [4]; 
or 2.53947 for the improved version (FT) given in Ahrens and 
Dieter (2]. It is even stightly lower than 1.38009 for the algorithm 
FI, of [2], and FL, requires a larger table than Algorithm C. 
Thus, Algorithm C should be quite fast, and comparable to the best 
algorithms described by Ahrens and Dieter [1]. The number (4) 
could be reduced by increasing the table size (as in the algorithms 
FL,, FL;, and FL, of [2]}), but this hardly seems worthwhile. 
Exact timing comparisons depend on the machine and uniform 
random number generator used. (If a very fast uniform generator 
is used, then Step 4 of Algorithm A may take longer than generat- 
ing a new uniform deviate.) 

The loss of accuracy caused by Step 4 of Algorithm A is not 
serious. We may say that loge (1 — ux_1)7! “bits of accuracy” are 
lost, and in our application we have, from (2) and Step 3 of Algo- 
rithm A, log (2) > uo > --+ > ux_1, so the number of bits lost is 
less than log, (1 — log (2))7! < 2. 

Test results. \f x is normally distributed then uo = 
(2r)-+ ft, exp (— 32) dt is uniformly distributed on (0, 1). 
Hence, standard tests for uniformity may be applied to the trans- 
formed variate u. Several statistical tests were performed, using a 
Univac 1108 with both single-precision (27-bit fraction) and double- 
precision (60-bit fraction). For example, we tested two-dimensional 
uniformity by taking 10® pairs (u, u’), plotting them in the unit 
square, and performing the Chi-squared test on the observed num- 
bers falling within each of 100 by 100 ‘smaller squares. This test 
should show up any lack of independence in pairs of successive 
uniform deviates. We tested one-dimensional uniformity similarly, 
taking 10 trials and subdividing (0, 1) into 1,000 smaller intervals. 
The values of x? obtained were not significant at the 5 percent Jevel. 
It is worth noting that the method of summing 12 numbers dis- 
tributed uniformly on (—1/2, 1/2) failed the latter test, giving 
x29 = 1351. (The probability of such a value being exceeded by 
chance is less than 10711.) , 

Naturally, test results depend on the particular uniform gen- 
erator RAND which is used. GRAND will not produce independent 
normally distributed deviates unless RAND supplies it with inde- 
pendent uniformly distributed deviates! For our tests we used an 
additive uniform generator of the form up = uni + Un_in (mod 2”) 
with w = 27 or 60 (see Brent [3] and Knuth [5]), but a good linear 
congruential generator should also be adequate for most applica- 
tions. 
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Comparison with Algorithm 334. The fastest exact method 


previously published in Communications is Algorithm 334 [7]. 
We timed function GRAND, subroutine NORM (a Fortran transla- 
tion of Algorithm 334), and function RAND (the uniform random 
number generator called by GRAND and NORM). The mean 
execution : times obtained from 500,000 trials on a Univac 1108 
were 172, 376 and 59 ysec respectively. Since NORM returns two 
normally distributed numbers, GRAND was effectively 9 percent 
faster than NORM. Based on comparisons in {2], we estimate 
that the saving would be greater if both routines were coded in 
assembly language, for much of the execution time of NORM is 
taken up in evaluating a square-root and logarithm which are 
already coded in assembly language. 


GRAND requires about 1.38 uniform deviates per normal de- 


viate, and NORM requires 4/7 + 1/2 > 1.77. Thus, we may es- 
timate that if a uniform generator taking U usec per call were used, 
the time per normal deviate would be (91 + 1.38U) usec for 
GRAND and (83 + 1.77U) ysec for NORM. Hence, GRAND 
should be faster for U > 20. 
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FUNCTION GRANDCN) 
EXCEPT ON THE FIRST CALL GRAND RETURNS A 
PSEUDO~-RANDOM NUMBER HAVING A GAUSSIAN (I-E.- 
NCPMAL) DISTRIBUTION YITH ZERO MEAN AND UNIT 
STANDARD DEVIATION. THUS,» THE DENSITY IS) FCX) = 
EXP (-3.5eX**2)/SQRT(C2-G*PI). THE FIRST CALL 
INITIALIZES GRAND AND RETURNS ZERO. 
THE PARAMETER N IS DUMMY. 
CRAND CALLS A FUNCTION RAND» AND IT IS ASSUMED THAT 
SUCCESSIVE CALLS TO RAND(@) GIVE INDEPENDENT 
PSEUDO- RANDOM NUMBERS DISTRIBUTED UNIFGRWiLY ON (Ss 
})s POSSISLY INCLUDING @ (BUT NOT 1). 
THE METECD USED WAS SUGGESTED BY VON NEUMANN, AND 
IMPROVED BY FCRSYTHEs AHRENS,» DIETER AND BRENT. 
ON THE AVEPAGE THERE ARE 1-37746 CALLS OF RAND FOR 
EACH CALL:.CF GRAND. 
WARNING =: DIMENSION AND DATA STATEMENTS BELOW ARE 

MACHINE-DEPENDENT. 

DIMENSION: OF D MUST BE AT LEAST THE NUMBER OF BITS 
IN THE FRACTION OF A FLOATING-POINT NUMBER. 
THUS» ON MOST MACHINES THE DATA STATEMENT BELOW 
CAN BE TRUNCATED. 
IF THE INTEGRAL OF SQRT(2-@/P1)*EXP(-G-eSaX**2) FROM 
ACI) TO INFINITY IS @*«x¢-I), THEN DCI) = ACID - 
ACI-1)-+ 
DIMENSION D(68) 
DATA D¢})» DC2)5 
D(8)s DCI)» DCIB)s 
D(14)s DCIS)» DCI6)s 
DC2G), OC2I)s DC22), DC23)4 DC24)5 DCE5)~ 
D(26)5 DC27)s DC28)Is DC29)s DEC3O)s DC3I)4 
DO32) /8+ 6744897505 8-47585963658- 3837711644 
G9. 32661) 32340-2911 42827,G0+ 263684322, 
G+ 24256845 256+ 225567444, 6+ 211634) 665 
Ge 199924267,80.189918758,8-18I 2251814 
G17 3601400... 1€6841969,6-16879672S95 
@+155349717.0-1S5@48S 3844 6+ 145962577, 
@-141772233,6-13796€317 459.1 34441762, 
G+131172158.6-1281 25965,8- E5279 696, 
@-122616883,86.120183568,0.117741767. 
@-115511892,8-.1134823465G-1114827285 
@-189563852,0-1876S7617/ 


D¢3)s DC4)s DE5)s 
D¢Is)ds DC12).4 
DO17)» 


D(C 6), 
DCI3)5 
DCI8)s DCI9)s 


DC7).» 


Hee KKHRAEH HHH HEH 
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D(36)2 DC37)s DC38)s5 
D¢42), DC43)2 D444 
D(48)5 DC4S)s DC5O)s 
D¢51)s DCS2)s DC53)4 DCS4). BCS5)s DC(56), 
D¢S7), DE¢S8)s DC89)s DC6B) 
40618597 677220 -1 843348 4)50.15E76681 2, 
G.ISIEESGS2Ss Be BPS 827 234s O- O98 A448 2825 
O-89712432920-8956517784 0+ 894E2746)1 4 
O.GS344E 40756-85231 190522891 21548 2, 
@.0501568 3846-0891 33867+0-8881 44615, 
O@+88718729350-88626621558-885361 834, 
Ze C844907S 6s S-283645 487. 2-88 28 24524, 
@-982827 847, 5eB812531€2s0-LB8S4SS 8445 
B 2797 669325 G- 6790535274 G-07835878 15 
9.877681 8997 
END OF MACHINE=DEPENDENT STATEMENTS 
U MUST BE PRESERVED BETWEEN CALLS. 

DATA U /3.8/ 
INITIALIZE DISPLACEMENT A AND COUNTER I. 

A = @-08 

I= @ 
INCREMENT COUNTER AND DISPLACEMENT IF LEADING BIT 
OF U IS ONE. 


DATA D(33). DC34). DC35), 
D(O39)5 DC4B8)s DC4I)s 
D¢45), DC46)5 DC47)s 


AHRHERH HEHEHE HE HN 


1@ US U+ U 
IF (UeLTeJ]+G) GO TO 20 
U= Ur 1-8 
Le de 
A= A - DCI) 
GO TO 18 

FORM W UNIFORM ON @ eLE« W -LTe DCI+!]) FROM U. 

26 W = DCI+))*U 

» FORM V = @eSeCCWrAD##*2 =- A**2)- NOTE THAT @ «LE. V 
LT. LOG(2). 


Vo= WeCO.S*eY-A) 
GENERATE NEW UNIFORM U- 
38 U = RANDCE) 
ACCEPT Y AS A RANDOM SAMPLE IF V 
IF (VeLE-U) GO TO 4@ 
GENERATE RANDOM V- 
V = RAND(G) 
LOOP IF U «GT. V-» 
IF €UeGT.-V) GO TO 36 
REJECT W AND FORM A NEW UNIFORM U FROM V AND U. 
Uo = (UU) 761 -8-UD 
GO TO 2@ 
FORM NEW U (TO BE USED ON NEXT CALL) FROM U AND V. 


eLJee Ue 


4@ U = €U-V)/C1.8-V) 
USE FIRST BIT OCF U FOR SIGNs RETURN NORMAL VARIATE. 
U=U+tU 
IF CUeLT.1-@) GO TO 56 
U = U - 1-8 
GRAND = W- A 
RETURN 
5S® GRAND = A - W 
RETURN 
END 
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The Algorithm SELECT—for 
Finding the ith Smallest of n 
Elements [M1] 


Robert W. Floyd [Recd 26 Sept. 1974] 

Computer Science Department, Stanford University, 
Stanford, CA 94305 

and 

Ronald L. Rivest, M.I.T. Project MAC, 

545 Technology Square, Cambridge, MA 02139 


Key Words and Phrases: selection, medians, quantiles 
CR Categories: 5.30, 5.39 


Language: Algol (not strictly Algol 60) 


Description 

SELECT will rearrange the values of array segment X|L : R| 
so that X[K] (for some given K; L < K < R) will contain the 
(K—L-+-1)-th smallest value, L < I < K will imply X[/| < X[K], 


and K <7 < Rwillimply X[/) > X[K]. While SELECT is thus | 


functionally equivalent to Hoare’s algorithm FIND [1], it is sig- 
nificantly faster on the average due to the effective use of sampling 


to determine the element T about which to partition X. The average ~ 


time over 25 trials required by SELECT and FIND to determine the 
median of 7 elements was found experimentally to be: 


a 500 1000 5000 7 10000 Poe 
SELECT 89 ms. 141 ms. 493 ms. 877 ms. 
FIND 104 ms. 197 ms. 1029 ms. 1964 ms. 


The arbitrary constants 600, .5, and .5 appearing in the algorithm 

minimize execution time on the particular machine used. SELECT 

has been shown to run in time asymptotically proportional to 

N+ min (,N—J), where N = L—~ R+ 1and/= K—L +1. 

A lower bound on the running time within 9 percent of this value 

has also been proved [2]. Sites {3} has proved SELECT terminates. 
The neater Algol 68 construct: 

while (boolean expression) do (statement) 

is used here instead of the Algol 60 equivalent: 

for dummy := 1 while (boolean expression) do (statement) 


References 
1. Hoare, C.A.R. Algorithm 63 (PARTITION) and Algorithm 65 
(FIND), Comm. ACM 4 (July 1961), 321. 
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2. Floyd, Robert W., and Ronald L. Rivest. Expected time 
bounds for selection. Stanford CSD Rep. No. 349, Apr., 1973). 

3. Sites, Richard. Some thoughts on proving clean termination of 
programs. Stanford CSD Rep. 417, May 1974. 


Algorithm 
procedure SELECT (X,L,R,K); 


value L,R,K; integer L,R,K; array X; 
begin 
integer N,/J,J,S,SD,LL,RR,; real Z, T; 


while R > L do 
begin 
if R — L > 600 then 
begin 
comment Use SELECT recursively on a sample of size S 
to get an estimate for the (K—L+1)-th smallest element 
into X{K], biased slightly so that the (K-—L+1)-th 
element is expected to lie in the smaller set after partition- 


ing; 
N:=R-—-L+1; 
l:=K~—~L+1; 
Z := InN); 


S:= .5 X exp(2xZ/3); 
SD := 5 X sqrt(ZXSX(N—S)/N) X signd—N/2); 
LL := max(L,K—!1XS/N+SD); 
RR := min(R,K+(N—D X S/N+SD); 
SELECT(X,LL,RR,K) 

end; 

T := X(K]; 

comment The following code partitions X|LZ : R] about T. It 
is similar to PARTITION but will run faster on most ma- 
chines since subscript range checking on / and J has been 


eliminated. ; 
I:=L; ; 
J:=R; 


exchange(X|L\,X{K]); 
if X{R] > T then excharge(X|R],X[L]); 
while / < J do 
begin 
exchange(X|I),X|J}); 
P:=14+1;J:=J/-—1; 
while X{/} < Tdo/:=/+ 1; 
while X[J] > Tdo J := J — 1; 
end; 
if X{L]) = T then exchange(X|L|,X{J]) 
else begin J := J + 1; exchange(X{|J|,X|R}) end, 
comment. Now adjust L, R so they surround the subset con- 
taining the (K—L+1)-th smallest element, 
if/ < KthnL:=J/+1; 
if K < Jthen R:= J — 1; 
end 
end SELECT 
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REMARK ON ALGORITHM 489 


The Algorithm SELECT—for Finding the ith Smallest of n Elements [M1] 
[R.W. Floyd and R.L. Rivest, Comm. ACM 18, 3 (March 1975), 1731] 


Theodore Brown [Recd 2, Oct. 1975| 
Department of Computer Science, Queens College of the City of New York, 
Flushing, NY 11367 


Algorithm 489, SELECT, is an cffective algorithm for finding the kth smallest of 
elements. The authors, Floyd and Rivest, have analyzed its properties in a com- 
panion paper [1]. 

The description of the algorithm given here is different from that given by Floyd 
and Rivest [1] and is truer to the actual implementation. The description, further- 
more, leads to a simple modification of the algorithm that, as is shown, improves its 
performance for finding values near the median. It is also shown that a small con- 
stant multiplying the standard deviation term is beneficial. Finally, a basic error in 
Floyd and Rivest’s analysis is pointed out. 

SELECT can be viewed as a desecndant of FIND [3], an earlier algorithm for 
finding the kth smallest clement. A major component of SELECT is au improved 
coding of the partitioning algorithm PARTITION [8] used by FIND. This also is 
the partitioning algorithm used by the familiar QUICKSORT [3]. The partitioning 
works by dividing the n elements into two parts: those greater than a chosen element 
and those less than it. (Equality is ignored here. The analyses are based on uniquely 
valued elements.) In FIND (and in SELECT) the partitioning is reapplied re- 
peatedly to the partition that contains the required kth smallest clement until this 
value is determined. 

FIND chooses the partitioning element randomly from the available candidates. 
The improved performance of SELECT is based on the use of a sample of the avail- 
able candidates to determine the partitioning element. As described in the following 
paragraphs, the jth smallest of the sample, say S,;) (found by recursively calling 
SELECT), is chosen so as to reduce the subsequent size of the required partition. 

Writing the kth smallest of the original n clements as x(4) , for a sample of size 
s(n) = s the probability that the jth smallest sample value is the 7th smallest of the 
original 7 is 


PriSy) = tw} = GayC)/@), (1) 


as j — 1 elements of the sample must be less than x(;) and s — 7 greater. The'mean 
and variance for this distribution arc, respectively, 


kg) = J (n+ 1)/(s + 1) (2) 
ou = (fs — jt (n+ 1)(n— s)/(s + 1)(s + 2) (3a) 
< i(n4+ 1)(n - s)/s. (3b) 


Kquation (2) can be interpreted as the mean sizc of the partition of the n elements 
which contains S,;) and the values smaller than it. 

Floyd and Rivest [1] suggest a value for 7 of u = way — 2d(n)ou) if k > n/2 or 
v= way + 2d(nou) if k < n/2, d(n) a slowly inercasing function of n ((In n)"” 
is used). They suggest this value for 7 to make sure that the kth smallest falls in the 
partition cither greater than u (if uw is used) or less than v (if v is used). A better 
criterion is to keep the partition that will contain the k-th as small as possible. Their 
stated criterion is contrary to this for very small k, values of k near n, and for values 
of k near the median; for intermediate values of k, their criterion is consistent with 
this one. 

Notice that from eq. (1), Pr {S.;) >taq} = 1 fork < 7. So for very small values of 
k, it does not pay to choose 7 > &k (or for k near n, 7 < k). The coded version of 
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Table I. Times (in msec) To Find 
Median SELECT 


; Our 
N Algorithm Fortran Difference 
489 ‘ 
version 
500 89 44 45 
1,000 141 89 52 
5,000 493 363 130 


SELECT takes care of these conditions in the MIN and MAX functions. Notice too 
that when finding a median it pays to choose 7 = wa) = s/2. Any other choice will 
cause k to be most likely in the larger partition. 

In fact, for any 7 it never pays to choose a value of u less than s/2 or a value of v 
greater than s/2. It is proposed that the calculation of u and v be modified to pay + 
2d(n)f(n)o~) and wy — 2d(n)f(n)o «%) , respectively, with f(n) a function that mono- 
tonically goes to zero from each side of the median. We uscd a linear function, re- 
placing the SIGN function in the coded calculation of SD by the factor (2 X I[/N — 
1). 

A Fortran version of SELECT was written for an X DS Sigma 7. Table I compares 
the times published by Floyd and Rivest in Algorithm 489 with those obtained here. 
Unfortunately, Floyd and Rivest only give times for finding a median. Notice, 
however, that not only is our version faster but that it gives proportionately better 
results for larger n. Our Fortran program was run first with no modification, then 
with the proposed modification. Figure 1 shows the timing of our Fortran version of 
SELECT without the modification (labeled 1) and with the proposed modification 
(labeled 2). As expected, the most substantial improvement occurs at the median. 

Additional improvement was obtained by reducing the size of d(n). This is true 
for several reasons. For n = 5000, d(n) = 2.9. With a normal approximation, the 
probability that k is more than 5.8 standard deviations away from the mean is less 
than 10-°. This is a much stricter bound than required, and can be substantially 
reduced without adverse effects. One does not need to be so careful that the kth 
smallest element does not end up in the smaller partition. Even if the kth smallest 
ends up in the larger partition but near the boundary, the reduced problem can be 
done efficiently. This can be scen in Figure 1. Furthermore, the algorithm’s use of 
the bound (8b) in place of the true deviation overestimates the true standard devia- 
tion. Floyd and Rivest recognized this and used a 0.5 multiplier for the standard 
deviation in the coded version. It was found that a multiplier of 0.1 produced even 
better results. The modified standard deviation with a 0.1 multiplicr gave the results 
labeled 3 in Figure 1. 

Floyd and Rivest [1] assert that their choices of s(n), wu, and v make the probabil- 
ity of o(1/n) that k will fall in the partition less than u if wis used or in the partition 
greater than v if v is used. This is incorrect. It is not possible for any u or v for 
their choice of s(n). Even the choice of v = Sq) or u = Sq) is not adequate, for 
from eq. (1), 


Pr{Sq) > tw} = (")/G) = (n — s8)e/(m)e, 


where (n), = n(n — 1)...(n — k 4+ 1), isclearly not o(1/n). The best choice of 
s(n) is an open question. The sorting mcthod of Frazier and McKellar [2] has simi- 
larities to SELECT—it uscs sampling and the partitioning of PARTITION [2]. 
Frazier and McKellar suggest a sample of 0.1n for their procedure. No appreciable 
change in the times resulted from using this sample size. The values differed by less 
than 10 percent. Further experiments showed that the modifications made here 
made the running time of SELECT rather insensitive to changes in the parameters 
that Floyd and Rivest [1] suggest tuning for the particular computer: the sample 
size and the cutoff point below which the algorithm does not do sampling. 
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Algorithm 490 


The Dilogarithm Function 
of a Real Argument [S22] 


Edward S. Ginsberg* [Recd 22 June 1973] 

Department of Physics, University of Massachusetts 
at Boston, Boston, MA 02125 

and 

Dorothy Zaborowskit 

Information Processing Center, Massachusetts Institute 
of Technology, Cambridge, MA 02139 


Key Words and Phrases: dilogarithm 
CR Categories: 5.12 
Language: Fortran 


Description 
The dilogarithm function [1-3], defined by 


Linx) = — f (1/z) In (1 — z) dz, (1) 


occurs in several different applications in physics and engineering, 
ranging from quantum electrodynamics, to network analysis, to the 
thermodynamics of ideal ferromagnets, to the structure of polymers. 
A new function subroutine is developed which computes the diloga- 
rithm function of a real argument to an accuracy of a few parts in 
1045. This program was designed to be included in the usual package 
of library subprograms relied upon by most users. It employs an 
alternative computational approach to a previously published 
algorithm [4]. 

The dilogarithm function is real for real argument x < 1 and 
complex for x > 1. However, the imaginary part of the dilogarithm 
is just an ordinary logarithm, —imln(x), when x > 1, which does 
not require special means for computation. Therefore, the following 
algorithm and comments are concerned only with the computation 
of the real part of the dilogarithm function for real argument. 

Briefly, the method consists of transforming the usual series 
definition 


Lisle) = 2 "nt, [x1 <1, (2) 


into a more highly convergent power series by means of partial 
fractions. The identity 


eae ee, S 
natilia+2) 2\n n+l n+2 


* This work was supported by NSF grant GP 29705. t Present 
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leads immediately to the relation 
(1 + 4x + x?) Lie(x) = 4x2 3 (xr/[n(n + 1)(n + 2)2P) 
1 


+ 4x + ax 30) =e) n(x), a 


|x| <1. 


This equation permits the evaluation of Li.(x) for | x | < 1 using 
a series which converges like x"/n® instead of x"/n?. Of course, more 
partial fractions can be employed to increase the rate of convergence 
even further, but then the resulting equation for Li(x) is not so 
simple. The “optimal” number of partial fractions is a question 


requiring further study. 


By the use of well-known functional identities, it is possible to 
relate the real part of Li.(x), for any real argument, to values of the 
function in the restricted range 0 < x < }. With x = 4, the maxi- 
mum relative error in Li,(4) after only 25 terms from eq. (4) is 


' roughly 


DEN OL ENG. x, 
3\2 25) ; 


In many cases, far fewer terms are actually needed to achieve this 
relative accuracy. The various ranges of argument and the corre- 


. sponding identities used in the Fortran program listing below are: 
.forx > 2 


Re(Lin(x)] = 7?/3 — $(in x)? — Lin(1/x), 
for2>x>1 
Re[Zi.(x)] = 22/6 — (In x)(In(x — 1) — 31n x) 
+ Li(l — 1/x), 


for! > x >} 


2 
Li(x) = 2/6 — (In x)In(1 — x) — La(l — x), 


for0O >x>—-1 
Liz(x) = —3[In(l — x)P — Li(x/(x — 1)) 


for -1>x 
Lig(x) = 2?/6 — 3In(1 — x)[2 & In(—x) — Indl - x)] 
+ Li(t/(l — x)). 


The inherent limitations of floating point arithmetic forced 
certain modifications and are the only serious sources of error. For 
example, when | x | is small, the argument of the natural logarithm 
in eq. (4) is close to unity. The error in DLOG (the library subpro- 
gram) then determines the accuracy of DILOG. It was found that 
for 0 < |x| < 10~*, the original series, eq. (2), with eight terms, 
provided 16-place accuracy. Also excluded is a small region around 
Xo ce 12.595 . . .,1 which is a zero for the real part of the dilogarithm. 


Here, a Taylor series is used for the calculation. The relative accu- 


racy of DILOG suffers accordingly, because the closer x is to Xo, 
the more significant figures are lost in computing the difference 
(x/xo) — 1 used in the expansion. (In addition, the value of xo 
probably cannot be expressed exactly in floating point or hexa- 
decimal form.) It is possible to recoup some relative accuracy by 
computing (x/xo) — 1 to higher than machine precision [5]. How- 
ever, this would require calculating x» to more significant figures 
than presently known. 

The most accurate tables [2] (nine decimal places) published 
thus far are not adequate to check the values computed by D/LOG. 


1The best value for xo obtained by the authors so far is 
12.5951703698450184. ... 
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Instead, the program was tested at a selection of arguments for 
separate ranges of x as follows: 

(a) For certain special arguments, the dilogarithm function 
can be expressed entirely in terms of elementary functions. These 
are: 1, —1,2,3,2 +4,1+ 4,49, 1—-— 4, —q, and —1—gq, where 
q = $((5)! — 1). For example, Li.(1) = 12/6, and L(g) = —In’g + 
w/10. 

(b) For values of | x | close to unity, DILOG can be checked 
against a Taylor series expansion. Most of the discrepancy for this 


Algorithm 
DOUBLE PRECISION FUNCTION DILOGCX) 


C REAL PART OF THE DILOGARITHM FUNCTION FOR A REAL 
class of argument is associated with the computation of 1 — x C ARGUMENT. REF+ NO- i=Le LEWIN, *DILOGARITHMS + 
. C ASSOCIATED FUNCTIONS* 
when x is near unity. C (MACDONALD, LONDON, 1958). 
(c) For very small values of x, an exact calculation by hand is C NUMERICAL CONSTANTS USED ARE C(N)=(N(N+1) (N+2) dine? 
; : C FOR N=1 TO 385 ¢PI#e2)/3=%.28986800+8, 
practical with eq. (2). C (PI#*2)/6=1-644394e5., AND) ZERO OF DILOG ON THE 
(d) For very large values of x, an exact hand calculation for C POSITIVE REAL AXIS» XO=12.595 17606 
; ; ; ; ; DOUBLE PRECISION As Bs BY, Ce Cl, C2, C3s Cds 
the difference Li(x) — Liz(— x) is possible. In this case, of course, * DXs DYs TEST, Wo X» X@s Ys Z 
there is cancellation between the two terms so that fewer than 16 DIMENSION 6¢30) 
: 3 DATA C(1)s CC2)s C63). ClAds CCS)s ClO)» CCT), 
places of accuracy are to be expected in evaluating the difference. * C68)s COO. COIB)s COl1d, CC12), CC13). 
(Since Lin(x) ae = In?| x |, for | x | > 0, the values shown in the * COI4)s CCO1I5)s CO16)5 COITIs CC1IBIs CCID), 

; ; : * C(20)s C21). CC22), C623). C624), C25), 
table below for Li(x) — Li(—x) are consistent with 16-place * C(26)4 C275 C28), C629). CC3B) 
acenbacy for DILOG. * /36+DBs576+DGs 36-D2s 144.024 441-D2s 112896-DO, 

y G.) ; * 254016+D8,5184.D2,9801-D2, 17424.D2, 2944656. DG, 
It can be seen that the worst case in the table represents a rela- * 4769856 «D8s74529.D2, | 12896.D2, 166464-D2, 
: ‘ g ae * 239708 16.D8s33802596«DO,467856+D2, 636804.D25 
tive error of only 2.4 parts in 10'*. Thus, 15 to 16 significant figures * 853776-D2, 112911876 Ds 1474767 36.DB, 19644.DA, 
are correct, representing a slight gain over K6lbig’s algorithm * 24336.D4s 3080625. D2, 386358336 .DGs 480661776-DB, 
[4]. Moreover, a test on an IBM 370/165 of the time required for ee ae fe 
1,000 calls to DILOG, for randomly generated arguments of abso- IF (XeGE-12.59DB) GO TO 186 
. . IF (X.GE.2.D8) GO TO 12 
lute value less than 100, revealed that the present algorithm is lk CET EA Dd GO 7660 
twice as fast as Kolbig’s (0.21 vs. 0.43 sec). IF (XeEQe1+DB) GO TO 38 
IF (X.GT..5D) GO TO 49 


SELECTED VALUES OF CILOG FCR VARICUS ARGUMENTS (X.GT-{[-D-2) GO TO 58 


SPECIAL VALUES EXPRESSIBLE IN ELEMENTARY TERMS IF (XeLT«-1+D8) GO TO 68 
x DILCG(X) CHECK IF (XeLTe-16D-2) GO TO 78 
0+10000000000000000401 0.6 16449340668482260401 04 16449.140668482260401 C DILOG COMPUTED FROM.REFe NO+ ly P.244s EQ(1). 


-0-10000000000000000+01 
0-20000000000000000+01 
0.-50000000000000000+00 
0.2618033988749895D0+01 
0. 16180339 88749895001 
0.6180339867498948D+00 
0-38196601125010520+00 

—-0.61803398874989480+00 

-0.16180339887498950+01 

ARGUMENTS CLOSE TO UNITY 
x 
0-10000100000000000+01 
0.99999000000000000+00 
0.1000000000 1000000+01 
0.99999999990000000+00 
0.10000000000000010+01 
0.99999999999999900+00 
~0.99999000000000000+00 

-0.10000100000000000+01 

-0. 9999999999 0000000+00 

-0.10000000001000000+01 

~06 9999599 99999999900+00 

-0.10000000000000010+01 


0. 82246703342411300400 
00 24674011002723370401 
0 5822405 2646501230400 
0.24003296863795660+01 
0624186901038761120+01 
0.75539561953174130+00 
004264088061620961040C 

—0 5421912 1645069340+00 

-0.121865252606861280401 


CILCG(X) 
0. 16450591555022320401 
0.16448089369925260401 
0. 16449340652508070+01 
0-1644934064445641D+01 
0.16449340668482590+01 
00 16449340668481910401 
—0282246010194265020400 
0. 8224739648 8626 100+00 
~0 82246 703335479830400 
— 0.82246 703349342 140400 
O60 82246 7033424112404+00 
0282246 703342411350+00 


~0.682246703 342411320400 
0224674C1100272339D+01 
0.58224052646501250+00 
0.24003456863759670+01 
Co24186501036761140401 
0.75539561953174140+00 
04264C8806 16209610400 

-0.54219121645065340+00 

-0612185252606661300+01 


CrECK 
Cob 6450591955022340t01 
0.16448089369929260+01 
02 1644934069250 8110401 
0. 1644934064445641001 
0616449 34066848 2620401 
04 16449340668481910401 
-0.822460101942€5020+00 
0 68224739648862E150400 
-0.82246703335479850+00 
—€ 682246 7103349342790400 
0682246 703342411250400 
-0.8224617103342411390+00 


DILOG = X*(1-eDO+X*C-25DB+X*C1-DG/9 «DO+XK* 
* (625 .D-44+X*(4.D-24Xal 1 eDI/36.DI+X#C1-DG/ 


* 49-D0+X 
RETURN 

C DILOG COMPUT 

C AND DESCRIPT 

18 Y = 1.D@ 

BY = -le 


764eD8I)>>)> 


ED FROM REF. NOs ls 
ION OF THIS ALGORITHM, 
7% 

D@ - Y*(4.D8+4) 


P.244s 


EQ(6)» 
EQC4)« 


DILOG = 3-28986813369645287D8 - 
* ¢SDO@*DLOG(X)*%*2 + CY RC 4eDO45~675DG*Y) +3-D0% 
* C1LeDOtY)*(1-D8-Y) *DLOGC 1 -DO-Y))/BY 

IF (DILOG+4.D0*Y.EQ-DILOG) RETURN 


GO TO 8 


C DILOG COMPUTED FROM REF. 


C X#1/X + EQC6 
G EQ¢4). 


20 Y = 1.D6 


NOe Is 


Peedds 


EQ¢(7) WITH 


2» AND DESCRIPTION OF THIS ALGORITHM, - 


- 16D8/X 


DX = DLOG(X) 


BY = 
DILOG = 


1-DB + Y*(4-DO+Y) 


1-64493406684322643D0 + 
* DX*(«5D@*DX-DLOG(X-1-D8)) +. 


VERY SMALL ARGUMENTS * CY*KC4oDO45-75DO*Y) -3eDOkC1eDO+Y) KDX/X) / BY 
x DILCGIX) ChECK Go To Ba is 
0.1000000000000001D-01  0410025111740139110-01  G.10025111740135100-01 GID TOE COHEGT ERO FROM EE Sos pabe sei. oes 


0299999999999999900-02 
-0. 99999999999999900-02 
-0.100000000000000 10-01 

0.10000000000000000-04 
-0.10000000000000000-04 


Oe LOOZS5L1174013908D-O01 
—0-99751104900835260-02 
-0.99751104900835450-02 

0-10000025000111110-04 
-0.99999750001111100-05 


0. L0025111740135080-01 
-0299751104900835260-02 
—0.99751104900835460-02 

0.10000025000111L10-04 
-0.9595S 750001111100-05 


38 DILOG = 
RETURN 


C DILOG COMPUTED FROM REF. 


C AND DESCRIPT 


1.6449 34866848 22643D8 


NOe lo 
ION OF THIS ALGORITHM, 


P.244, 


EQ(7). 
EQC(4). 


0 10000000000000000D-09 0.1000000000025C000-09  0.10000000000250000-09 48 Y = 1.D@ - X 
-0.1000000000000000D-09 -0.99999999997500G0D-10 -0.9599995959575C0000-10 DX = DLOG(X) 

0.10000000000000000-14 0.10000000000000000-14 0.10000000000000000-14 BY = -1eDQ - Y*(4eDg+¥) 
-021000000000000000D-14 -0.9599999995999997D-15 ~-0.95559559959999970-15 DILOG = 1-64493406684822643D@ - DX*DLOG(Y) + 


0. 10000000000000000-29 
—Ge 190000000000000000- 29 
VERY LARGE ARGUMENTS 
x 


0.10000000000600000-29 
-0.1€000000000C00000-29 


DILOG(X)-CILCG(-X) 


0.16000000000000000-29 
~021000060C00000000000-29 


CrECK 


* CY*C4.DG456¢75DO*XY)+3-DG*( 1 eDB+Y) *DX*X) / BY 


oGO TO 8B 


C DILOG COMPUTED FROM DESCRIPTION OF THIS ALGORITHMs 


0.10000000000000000403 0449148015783144580401 0.49148019783144570+01 C EQ¢4) 
0.10000000000000000406 0.4934782200544664D#01  044934782200544675D+01 50 Y¥ =X 
G- LOOGDODDCNCODODODE 11 «—-06 493.48022003446200401  0449348022003446750401 BY = 1¢D@ + Y*(4-DO+Y) 
0.10000000000000000+16 0.49348022005445960401  0.49348022005446770+01 DILOG = CY*C4.DO+5e75DG*Y) +3.DO*( 1 eDO+Y) * 
0-10000000000000V0D+31 0.49348022005445390401 0.49348022005446790+01 * (€1eDO-Y)*DLOG(1-DB-Y)>/BY 

GO TO 8@ 


C DILOG COMPUTED FROM REF. 


NO. ls 


P.245, 


EQC12) WITH 


C X=-Xs AND DESCRIPTION OF THIS ALGORITHM», EQC(4). 
68 Y = 1.4D8/(€1-D8-xX) 
ae DX = DLOG(-X) 
Author Ginsberg would like to acknowledge the hospitality of DY = DLOGCY) 
BY = 1-D0 + Y*C4-D@G+Y) 


the Center for Theoretical Physics at M.I.T. Both authors are 
indebted to W.J. Cody of Argonne National Laboratory for sug- 
gesting many improvements to the original program. 


DILOG = -1+64493406684822643D6 + 
* eSDO*DY*(DY+2.DO*DX) + CY*C4.-DG+5~-75D0*Y) 
* +36D0*C1-DO0+Y)*C1-D@-Y) *(DX+DY) 0 / BY 

IF (DILOG+4-D@*Y.+EQ.DILOG) RETURN 


GO TO 8 
C DILOG COMPUTED FROM REF. NO. ls Pe24ds EQ(8)s 
References C AND DESCRIPTION OF THIS ALGORITHM, EQ¢4). 
L i L. Dil “tf i Ace aleel Rune id 78 Y = X/(X-1.D@) 
‘1. Lewin, L. Dilogarithms and Associated Functions. MacDonald, DX = DLOGCI+DB-x) 
London, 1958. BY = -1.D@ - Y*(4.D+Y) 
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DILOG = (Y¥*(4.D0+5«75D0*Y)-3.D0*C1-DO+Y)* Y = X/XOB - 1-DO8 
* (1-D@-Y)*DX)/BY - »«S5DQ*DX*DX Z = 16D8/11-5951763698458184D9 
88 B = 4-DO*Y*Y/BY Wo= Yx*xZ 

DO 98 N¥1,38 Cl = €3.D@*X8-2.D08)/6-D28 
B = Bey C2 = €C11eD8*XB~15-DG) *X8+6-D8)/24-D6 
A = B/C{N) C3 = €CC5eD1*XB-104-DB)*XG+B4.D8) *XG-24-DB)/ 
TEST = DILOG * l2eD1 
DILOG = DILOG + A C4 = €€66274¢DO*¥XG-77-D1)*KO4+94eD1)*XGO-54-D1)* 
IF (DILOG-EQeTEST) RETURN * XG+12.D1)/72-D1 


96 CONTINUE 
RETURN 
C DILOG COMPUTED FROM TAYLOR SERIES ABOUT: ZERO OF 


C DILOGs 


XGeo 


DILOG = Y*C1eDQ-Y*(-SDO-Y*(C1-D8/3-D0-Y* 
* (€e25DG-Y*Ce2D8-Y/6+D9)))))*DLOG(Z) ~ 


RETURN 


108 X® = 12-5951783698450184DG END 
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The Dilogarithm Function of a Real Argument [S22] 
[E.S. Ginsberg and D. Zaborowski, Comm. ACM 18, 4 (April 1975), 200-202 ] 


Robert Morris [Recd 11 July 1975] 

Bell Laboratories, Murray Hill, NJ 07974 

The necessary value for the zero of the dilogarithm function is 
12.5951703698450161286398965. . . 


to 25 decimal places, all correct. The value given in Algorithm 490 is in error in 
the last two digits. 


The identity stated for values of x less than —1 is incorrect and should read 


Ing(z) = — 2?/6 — 4n (1 — 2)[2 X In (—2) — In (1 — 2) ) + Ze(i/(1 — 2)). 


* WAXO*Y* (Ce SDO-W*CC 1-We (CO-We (C3 W#C4) > 
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Algorithm 491 
Basic Cycle Generation [H] 


Norman E. Gibbs [Reed 13 July 1971] 
Department of Mathematics, College of William and 
Mary, Williamsburg, VA 23185 


This work was partially supported by NASA under Grant 
NGL-47-006-058. 


Key Words and Phrases: Graph, basic cycle, fundamental 
cycle, spanning tree, vertex adjacency matrix 

CR Categories: 5.32, 3.24 

Language: PL/I 


Description 

The PL/I procedure BASIC_.GENERATOR is an implementa- 
tion of Paton’s algorithm [1] for finding a set of basic (fundamen- 
tal) cycles of a finite undirected graph from its vertex adjacency 
matrix. 

The input parameters to the procedure are: 

(1) A modified form of the vertex adjacency matrix, called A 
(see assumption 3 below). 

(2) The number of vertices of the graph, called N. 

(3) The number of edges of the graph, called EDGES. 

The output of the procedure is an array of bit strings, called B. 
The jth bit of B; is 1 if and only if the ith basic cycle contains the 
edge labeled /. 

The following assumptions are made by the procedure: 

(1) The graph is finite, connected, undirected, and without 
loops or multiple edges. 

(2) The vertices are labeled 1, 2,..., N. 

(3) The vertex adjacency matrix A has an edge table coded into 
its lower triangular part. The following PL/I code could be used 
to generate the table: 


E =0; 
DOI =2TON; 
DOJ =1TOI-—1; 
IF Ad, J) = 0 THEN 


DO; 
E=E+1; 
A(I, J) = E; 

END; 


(4) A is not the vertex adjacency matrix of a tree. 
The algorithm is: 


Step 1. Let vertex 1 be the root of the spanning tree. Start forming 
the spanning tree by placing all edges of the form {1, W} into 
the tree. At the same time, place all vertices W into a push- 
down list called STACK. 

Step 2. Let Z be the last vertex added to STACK (i.e., the top of 
the stack). If STACK is empty, then stop. If STACK is not 
empty, then remove Z from STACK and go to step 3. 

Step 3. Consider all edges |Z, W} which have not been examined. 
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If all edges have been examined, go to step 2. Otherwise, for 
each edge {Z, W} do the following: 

(a) If W is in the tree, generate the basic cycle formed by 
adding {Z, W} to the tree and repeat step 3. 

(b) If W is not in the tree, add {Z, W} to the tree, Wto STACK, 
and repeat step 3. 


For details on the algorithm and the production of the basic 
cycles, Paton’s original paper should be consulted. This paper also 
discusses two other algorithms for basic cycle generation and con- 
tains performance statistics. 

BASIC_GENERATOR has been implemented using the 
IBM PL/I F-level compiler (version 5.1) and has been tested on 
approximately 200 graphs. 


Reference 

1. Paton, K. An algorithm for finding a fundamental set of cycles 
of a graph. Comm. ACM 12,9 (Sept. 1969), 514-518. 

Algorithm 


BAS IC_GENERATOR: 

PROCEDURE (4,N,FDGES, 8); 

7* BASIC_GENERATOR GENERATES A SEF ME RASTC CFUNDAMENTAL ) 
CYCLES FROM THE VERTEX ADJACENCY MATREX OF A CONNECTED 
UNOTRECTED GRAPH WITHOUT LOCPS OR MULTIPLE EDGFS. THF 
PROCEDURE IS A PL/I EMPLEMENTATION OF KEITH PATIN'S 
ALGORITHM DESCRIBED IN CACM 12, 9 CSEPTRFMSER 1959), 
914-518. a] 

DECLARE 
(AC#,*®)yNeEDGES ) BINARY FIXED (1540), 
B(*) SIT CENGES), 
RASIC AINARY FLEXED (1540) ENITIAL (0), 
T BIT (NP TNITEAL (*°O°B), 
STACK CONTROLLEO BINARY FIXFD (1590), 
(ZeWeJ) BINAQY FIXED (1590), 
PREV(N) BINARY FIXED (15,0) INITIAL COND 9S 
/* A 1S AN N RY N VERTEX ADJACENCY MATRIX OF THE GPAPH, 
THE LOWER TRIANGULAR PORTION CONTAINS AN ENGE 
TABLE. FF J>K AND ACJyKI=M, THEN EDGE M JOINS 
VERTICES J AND K EN THF GRAPH. TF A€SyKI7=9 AND 
J>K THEN A(KyJI=l. THE UPPER TRIANGULAR PART NF A 
1S DESTROYED IN THE PROCESS, AUT CAN RE EASILY 
RECOVERED FROM THE LOWFR TRIANGULAR PART. CINPUT) 
N 1S THE NUMBER OF VERTICES IN THF GRAPH. CENPUT) 
EDGES 1S THE NUMBER OF FOGES IN THE GRAPH. (TNPUT) 
3 WILL BF THE SFT OF BASIC CYCLES GENERATED. THE 
K TH 81T DOF BIS) TS L TF AND ONLY ETF THE JS TH 
BASIC CYCLE CONTAINS THE ENGE LASELED K (NYUTEUT). 

BASIC [TS USFD TO ENDFX THE BASTC CYCLES AS THEY ARE 
GENERATED. 

T IS USEN TO KEFP TRACK OF THE VERTICES CURRENTLY 
IN THF SPANNING TREE. 

STACK IS A PUSHDOWN LIST USED TO HOLD THE VERTICES OF 
THE SPANNING TREE WHICH HAVE NOT YFT BREEN 
EXAMINED. 

Z tS THE VERTEX AF THE SPANNING TREE CURRENTLY BEING 
FXAMINED. 
w TS USED TO FIND EOGES WHICH CONNECT TO Z. 
PREV TS AN APRAY USED IN THE PRODUCTION OF THE BASIC 


CYCLFS. TE PRFV(K)=J THEN (K,J) ITS AN EOGE SF THE 
TREE WITH J NEARER THE RONT. */ 
/* INITIALIZATION SECTION--NOTE THAT VERTEX 1 IS ALWAYS 
THE ROOT. xf 
B="0'B; 


SUBSTRIT ys Le lLdP=*1* Bs 
ALLOCATE STACK; 
STACK=05 
ALLOCATE STACK; 
STACK=15 
NEw _Z: 
Z=STACK 
ITF 27=0 THEN RFTURN 
ELSE 
00; 
FREE STACK; 
N0 W=2 TO NG 
IF ACMIN(ZyW) sMAX(ZysW))D=1 THEN 
on: 
IF SUASTR(T Weld THEN 
7* THE FOGE CONNECTING 2 AND W CREATES A BASIC CYCLE. */ 
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DO; 

BASIC=BASIC415 

SUBSTRIBCRASTC),ACMAXOW»PREVOW) Ds 
MIN(WePREV(W))) eLI=tL'BS 

SUB STRIBI BASIC) sACMAX(Z pW) sMING ZW) De 
Lb= FL Bs 

ACUMINGZ Wd) MAX CZ ow) =05 

J=73 

DO WHILE (J~=PREVOW)); 
SUBSTR (BUI BASTC) 5 ACMAX (PREVI J), JS), 

MINCPREV(S) odd LI=PLO RG 

JSPREV(SD: 


END$ 
END; 
ELSE 
7* THE FOGE CONNECTING 2 AND W SHOULD BE PLACED IN THE 
TREE. x/ 
DNs 
PREV W)I=Z;5 
SUBSTR(T We LIT 1L BS 
ALLOCATE STACK3 
STACK=WS 
A(MINGZ 9H) ¢ MAX(Z,W) )=03 
END; 
END; 
END; 
GO TO NEwiZ3 
END; 


END BASIC _GENERATOR 
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Algorithm 492 
Generation of All the 


Cycles of a Graph from a 
Set of Basic Cycles [H] 


Norman E. Gibbs [Recd 13 July 1971] 
Department of Mathematics, College of William and 
Mary, Williamsburg, VA 23185 


This work was partially supported by NASA under Grant 
NGL-47-006-058. 


Key Words and Phrases: basic cycle, cycle, graph 
CR Categories: 5.32, 3.24 Language: PL/I 


Description 

The PL/I procedure CYCLE_GENERATOR is an implemen- 
tation of Gibbs’ algorithm [1] for finding all the cycles in a graph 
from a set of basic cycles. 

The input parameters are: 

(1) An array of bit strings B, where the /th bit of B; is 1 if and 
only if the ith basic cycle includes the edge labeled /. 

(2) The number of basic cycles, called BASIC. 

(3) The number of edges in the graph, called EDGES. 

The output from the procedure consists of: 

(1) An array of bit strings Q, where the /th bit of Q; is 1 if and 
only if the ith cycle contains the edge labeled /. 

(2) The number of cycles, called CYCLES. 

The algorithm is: 


Step 1. Set C = {Bi}, Q = C,D = R=96,i = 2. If BASIC = 1, 
stop. 

Step 2. For all T € Q, if TMB; = 9, then set D = D U{T®@ Bi, 
otherwise set R = R U{T@B}.(A®B= AUB-— 
AN B). 

Step 3. For all U, V€ R, if UC V, set D = D U{V} and R = 
R— {V}. 

Step 4. SetC = C UR U{B},Q@=CUD,R=$%,i=i+1. 

Step 5. If i > BASIC, stop. C is the set of all cycles. If i < BASIC, 
go to step 2. 

In CYCLE_GENERATOR, C = {Q(D): QFLAG(D = '0'B}, 

D = Q — C, and R = {Q(LOWER), Q(LOWER+1),..., 

Q(UPPER)}. The procedure assumes that BASIC > 0 and that the 

dimension of Q is 22487¢ — |. CYCLE.GENERATOR has been 

implemented using the IBM PL/I F-level compiler (version 5.1) 

and has been tested on approximately 200 graphs. 


Reference 
1. Gibbs, N.E. A cycle generation algorithm for finite 
undirected linear graphs. J. ACM 16, 4 (Oct. 1969), 564-568. 


Algorithm 
CYCLE_GENERATOR: 
PROCEDURE (By BASIC 4Q eC YCLES, EDGES); 
/* CYCLE_GENERATOR GENERATES ALL THE CYCLES %F A GRAPH 
FROM A SET OF BASIC (FUNDAMENTAL? CYCLES. THIS 
PROCEDURE IS A PL/I [IMPLEMENTATION OF NORM GIRBS! 
ALGORITHM FOR GENERATING ALL THE CYCLES OF A GRAPH 
WHICH APPEARED IN JACM lby 4 (UCTOBER 1969), 564-568. */ 
DECLARE 
(BC*),O0#)) BIT LEDGES), 
(BAST C,CYCLES,EDGES) BINARY FIXED (15,0)¢ 
QFL AGU 2**BASTC-1) BIT C1) INT TEALC(2®*BASIC-1LI(1L)'O'BD, 
(QINDEXs IyJ¢K,pUPPERsLGWER} BINARY FIXED (15,013 
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/* B IS THE SET OF BASIC CYCLES WHERE THE K TH BIT OF 
B(J) 1S 1 FF AND ONLY IF EDGE K TS AN ELEMENT OF 
THE J TH BASIC CYCLE. (INPUT). 
Q 1S THE SET OF ALL CYCLES GENERATED. THE K TH BIT 
OF Q(J) £8 1 TF AND ONLY IF €D0GE K IS AN ELEMENT 
OF THE J TH CYCLE. (OUTPUT). 
BASIC 1S THE NUMBER OF BASIC CYCLES IN 8B. 
(INPUT). 

CYCLES IS THE NUMBER OF CYCLES GENERATED. (OUTPUT). 
EO0GES 1S THE NUMBER OF ENGES IN THE GRAPH. (INPUT). 
QFLAG IS A LOGICAL ARRAY USED TO MARK EDGE-DISJIINT 

UNIONS OF CYCLES. 

OTHER IDENTIFIERS ARE USED AS COUNTERS OR POINTERS. */ 

7* INITIALIZATION STEP. THE PROCEDURE ASSUMES THAT 
BASIC>0. */ 

QCLI=BC1)5 
IF BASIC=1 THEN 
DO; 
CYCLES=BASIC; 
RETURN; 
END; 
/7* FORM ALL LINEAR COMBINATIONS OF THE BASIC CYCLES IN Q.¥*/ 
00 f=2 TO BASIC; 
LOWER=2%* (1-195 
UPPER=24**1-1; 

/* [TF BCI) INTERSECT Q(QINDEX) IS EMPTY, THEN THE SYMMETRIC 
DIFFERENCE OF B{I) AND Q(QINDEX) IS THE UNION OF NIS- 
JOINT CYCLES AND THE APPROPRIATE ELEMENT OF QFLAG IS 
SET TO '1'B. OTHERWISE THE SYMMETRIC DIFFERENCE IS 
PLACED INTO A SET (INDEXED BY LOWER AND UPPER) FOR 
FURTHER TESTING. */ 

OO QINDEX=1 TO LOWER-135 
IF BCT) & QCQINDEX) THEN 
003 
Q(UPPERD=( BCT) 1Q(QINDEX) E(B CE) | 7QCQENDEXD); 
UPPER=UPPER-1L3 
END; 
ELSE 
DO; 
QCLOWER) = (BUT) [QC QUNDEX PH (>B CTP 1 -QCQINDEX) 3 
QFLAG (LOWER )="1°B; 
LOWER=LOWER+15 
END; 
Q(LOWERI=B(1); 
END; 

/* WE NOW TEST THE SET Q(LOWER), QULOWER#+1) 4.-.,Q(UPPER) 

TO SEE IF ANY ELEMENT OF THIS SET PROPERLY CONTAINS 
ANY OTHER ELEMENT. IF SO», THE CONTAINING ELEMENT IS 
MARKED AS THE EOGE-OISJOINT UNION OF CYCLES AND THE 
APPROPRIATE ELEMENT OF QFLAG IS SET TO '1'B. */ 
OO JeLOWERt] TO 2**1-2;5 

DO K=J+1 TO 2**1-13 
TF QFLAG(J) THEN GO TO NEXT_J; 
ELSE IF QFLAG(K) THEN GO TO NEXT_K3 
TF (QCJDLQCK)D=QC I) THEN QFLAG(JI=91°BS 
ELSE IF (QC SP FQCKID=Q(K) THEN QFLAG(K)I="1'BS 


(BASIC > 0). 


NEXT_Ks 
ENO; 
NEXT_J: 
END; 
END; 

/* BEFORE RETURNING, WE WANT TO MOVE ALL THE CYCLES (THOSE 
ELEMENTS OF Q FOR WHICH QFLAG IS *0'B) TO QEL), Ql2), 
eoes QICYCLES) AND SET CYCLES EQUAL TO THE NUMBER OF 
CYCLES IN Q. */ 

CYCLE S=03 

HOUSEKEEPING: 

DO I=1 TO 2**BASIC-13 
IF QFLAG(I) THEN GO TO NEXT_I$ 
ELSE 

00; 
Q(CYCLES# 1) =Q01)3 
CYCLES=CYCLES +1; 
END; 

NEXT_I: 

ENO HOUSEKEEPING; 

END CYCLE_GENERATOR; 


